2001-09-15 01:43:17 +08:00
|
|
|
Copyright 2001 Red Hat Inc., Egor Duda
|
|
|
|
|
2001-10-10 02:12:51 +08:00
|
|
|
So, your favorite program has crashed? And did you say something about
|
2001-09-15 01:43:17 +08:00
|
|
|
'stackdump'? Or it just prints its output from left to right and upside-down?
|
|
|
|
Well, you can file an angry bug report and wait until some of the core
|
|
|
|
developers try to reproduce your problem, try to find what's the matter
|
|
|
|
with your program and cygwin and fix the bug, if any. But you can do something
|
|
|
|
better than that. You can debug the problem yourself, and even if you can't
|
2001-11-05 11:16:58 +08:00
|
|
|
fix it, your analysis may be very helpful. Here's the (incomplete) howto on
|
2001-09-15 01:43:17 +08:00
|
|
|
cygwin debugging.
|
|
|
|
|
2001-09-16 10:56:48 +08:00
|
|
|
1. The first thing you'll need to do is to build cygwin1.dll and your crashed
|
2001-09-15 01:43:17 +08:00
|
|
|
application from sources. To debug them you'll need debug information, which
|
2001-11-05 11:16:58 +08:00
|
|
|
is normally stripped from executables.
|
2001-09-15 01:43:17 +08:00
|
|
|
|
|
|
|
2. Create known-working cygwin debugging environment.
|
|
|
|
- create a separate directory, say, c:\cygdeb, and put known-working
|
2001-09-16 10:56:48 +08:00
|
|
|
cygwin1.dll and gdb.exe in it.
|
2001-09-15 01:43:17 +08:00
|
|
|
- create a wrapper c:\cygdeb\debug_wrapper.cmd:
|
|
|
|
|
|
|
|
========= debug_wrapper.cmd =========
|
2001-10-10 02:12:51 +08:00
|
|
|
rem setting CYGWIN_TESTING environment variable makes cygwin application
|
2001-09-15 01:43:17 +08:00
|
|
|
rem not to interfere with other already running cygwin applications.
|
|
|
|
set CYGWIN_TESTING=1
|
|
|
|
c:\cygdeb\gdb.exe -nw %1 %2
|
|
|
|
===================================
|
|
|
|
|
|
|
|
3. Try to use cygwin's JIT debugging facility:
|
|
|
|
- add 'error_start=c:\cygdeb\debug_wrapper.cmd' to CYGWIN environment
|
|
|
|
variable. When some application encounters critical error, cygwin will stop
|
|
|
|
it and execute debug_wrapper.cmd, which will run gdb and make it to attach to
|
|
|
|
the crashed application.
|
|
|
|
|
|
|
|
4. Strace.
|
|
|
|
You can run your program under 'strace' utility, described if user's manual.
|
2001-11-05 11:16:58 +08:00
|
|
|
If you know where the problem approximately is, you can add a bunch of
|
|
|
|
additional debug_printf()s in the source code and see what they print in
|
2001-09-15 01:43:17 +08:00
|
|
|
strace log. There's one common problem with this method, that some bugs
|
2001-10-10 02:12:51 +08:00
|
|
|
may mysteriously disappear once the program is run under strace. Then the
|
2001-09-15 01:43:17 +08:00
|
|
|
bug is likely a race condition. strace has two useful options to deal with
|
|
|
|
such situation: -b enables buffering of output and reduces additional
|
|
|
|
timeouts introduced by strace, and -m option allows you to mask certain
|
|
|
|
classes of *_printf() functions, reducing timeouts even more.
|
|
|
|
|
2001-11-05 11:16:58 +08:00
|
|
|
5. Problems at early startup.
|
2001-09-15 01:43:17 +08:00
|
|
|
Sometimes, something crashes at the very early stages of application
|
|
|
|
initialization, when JIT debugging facility is not yet active. Ok, there's
|
|
|
|
another environment variable that may help. Create program_wrapper.cmd:
|
|
|
|
|
|
|
|
========= program_wrapper.cmd =========
|
2001-10-10 02:12:51 +08:00
|
|
|
rem setting CYGWIN_SLEEP environment variable makes cygwin application
|
2001-09-15 01:43:17 +08:00
|
|
|
rem to sleep for x milliseconds at startup
|
|
|
|
set CYGWIN_SLEEP=20000
|
|
|
|
c:\some\path\bad_program.exe some parameters
|
|
|
|
===================================
|
2001-12-10 11:35:54 +08:00
|
|
|
|
2001-09-15 01:43:17 +08:00
|
|
|
Now, run program_wrapper.cmd. It should print running program pid.
|
|
|
|
After starting program_wrapper.cmd you've got 20 seconds to open another
|
|
|
|
window, cd to c:\cygdeb in it, run gdb there and in gdb prompt type
|
|
|
|
|
|
|
|
(gdb) attach <pid>
|
|
|
|
|
|
|
|
where <pid> is the pid that program_wrapper.cmd have printed.
|
|
|
|
After that you can normally step through the code in cygwin1.dll and
|
|
|
|
bad_program.exe
|
|
|
|
|
|
|
|
6. Heap corruption.
|
|
|
|
If your program crashes at malloc() or free() or when it references some
|
|
|
|
malloc()'ed memory, it looks like heap corruption. You can configure and
|
|
|
|
build special version of cygwin1.dll which includes heap sanity checking.
|
|
|
|
To do it, just add --enable-malloc-debugging option to configure. Be warned,
|
|
|
|
however, that this version of dll is _very_ slow (10-100 times slower than
|
2001-10-10 02:12:51 +08:00
|
|
|
normal), so use it only when absolutely necessary.
|
2001-11-05 11:16:58 +08:00
|
|
|
|
|
|
|
7. Program dies when running under strace.
|
|
|
|
If your program crashes when you run it using strace but runs ok (or has a
|
|
|
|
different problem) otherwise, then there may be a problem in one of the
|
|
|
|
strace *_printf statements. Usually this is caused by a change in arguments
|
|
|
|
resulting in a %s being used with something other than a pointer to a
|
|
|
|
string.
|
|
|
|
|
|
|
|
To debug this scenario, do something like this:
|
|
|
|
|
|
|
|
bash$ gdb -nw yourapp.exe
|
|
|
|
(gdb) dll cygwin1
|
|
|
|
(gdb) l dll_crt0_1
|
|
|
|
(gdb) bp <<first line in the function>>
|
|
|
|
(gdb) run
|
|
|
|
(gdb) set strace.active=1
|
|
|
|
(gdb) continue
|
|
|
|
|
|
|
|
The program will then run in "strace mode", calling each strace *_printf,
|
|
|
|
just like it does when run under the strace program. Eventually, the
|
|
|
|
program will crash, probably in small_printf. At that point, a 'bt'
|
|
|
|
command should show you the offending call to strace_printf with the
|
|
|
|
improper format string.
|