如何从核心转储文件中识别导致崩溃的完整命令

How to identify the full command that caused the crash from the core dump file

本文关键字:崩溃 命令 识别 核心 转储 文件      更新时间:2023-10-16

使用 gdb 从核心转储文件中识别完整命令时出现问题 崩溃的命令本身可能很长

myCommand -f log/SlaRunTimeReport.rep -I input/myFile.txt -t output/myFile.txt

但是当使用 gdb 识别位置"核心生成者"中的命令时

即通过执行

gdb -c core.56536

输出:

GNU gdb (GDB) Red Hat Enterprise Linux 7.10-20.el7
….
Core was generated by `myCommand -f log/SlaRunTimeReport.rep -I 
input/myFile.t'.

可以看到完整的命令(可执行文件+参数(被切在中间

‘myCommand -f log/SlaRunTimeReport.rep -I input/myFile.t'

在另外使用字符串命令时,也无助于识别完整命令

strings core.56536 | grep PMRunTimeReport

输出:

myCommand 
myCommand -f log/SlaRunTimeReport.rep -I input/myFile.t

有没有办法从核心转储文件中获取导致失败的完整命令

提前致谢

有没有办法从核心转储文件中获取导致失败的完整命令

有多种方法,但运行strings错误的方法。

如果您使用调试信息构建了程序,您应该能够简单地执行up命令直到到达main,然后检查argv[0]argv[argc-1]

如果您的main不是使用调试信息构建的,或者它不使用argcargv,您应该能够从__libc_argc__libc_argv变量中恢复该信息。例:

$ ./a.out foo bar baz $(python -c 'print "a" * 500')
Aborted (core dumped)
$ gdb -q ./a.out core
Core was generated by `./a.out foo bar baz aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa'.

注意,"生成者"被截断了——它来自struct prpsinfo内部的固定长度数组,保存在coreNT_PRPSINFOELF笔记中。

Program terminated with signal SIGABRT, Aborted.
#0  0x00007fab38cfcf2b in raise () from /lib64/libc.so.6
Missing separate debuginfos, use: dnf debuginfo-install glibc-2.27-15.fc28.x86_64
(gdb) p (int)__libc_argc
$1 = 5
(gdb) p ((char**)__libc_argv)[0]@5
$2 = {0x7ffede43289f "./a.out", 0x7ffede4328a7 "foo", 0x7ffede4328ab "bar",
0x7ffede4328af "baz", 
0x7ffede4328b3 'a' <repeats 200 times>...}

最後一行實際上是一個謊言——我們知道'a'重複了500次。

我们可以这样修复它:

(gdb) set print elem 0
(gdb) p ((char**)__libc_argv)[0]@5
$3 = {0x7ffede43289f "./a.out", 0x7ffede4328a7 "foo", 0x7ffede4328ab "bar",
0x7ffede4328af "baz", 
0x7ffede4328b3 'a' <repeats 500 times>}

瞧:我们现在有了完整的命令。

最后,如果您安装了 GLIBC 的调试信息,您只需查看__libc_start_main(称为您的main(:

(gdb) set backtrace past-main
(gdb) bt
#0  __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:50
#1  0x00007fab38ce7561 in __GI_abort () at abort.c:79
#2  0x00000000004004ef in main () at foo.c:3
#3  0x00007fab38ce918b in __libc_start_main (main=0x4004e6 <main>, argc=5, argv=0x7ffede431118, 
init=<optimized out>, fini=<optimized out>, rtld_fini=<optimized out>, stack_end=0x7ffede431108)
at ../csu/libc-start.c:308
#4  0x000000000040042a in _start ()

在这里,您可以清楚地看到第 3 帧中的argcargv,并且可以像这样检查argv

(gdb) fr 3
#3  0x00007fab38ce918b in __libc_start_main (main=0x4004e6 <main>, argc=5, argv=0x7ffede431118, 
init=<optimized out>, fini=<optimized out>, rtld_fini=<optimized out>, stack_end=0x7ffede431108)
at ../csu/libc-start.c:308
308       result = main (argc, argv, __environ MAIN_AUXVEC_PARAM);
(gdb) p argv[0]@5
$1 = {0x7ffede43289f "./a.out", 0x7ffede4328a7 "foo", 0x7ffede4328ab "bar",
0x7ffede4328af "baz", 
0x7ffede4328b3 'a' <repeats 500 times>}