[Bug 680120] New: Segmentation fault during "make oldconfig" for a Linux kernel 2.6.38
https://bugzilla.novell.com/show_bug.cgi?id=680120 https://bugzilla.novell.com/show_bug.cgi?id=680120#c0 Summary: Segmentation fault during "make oldconfig" for a Linux kernel 2.6.38 Classification: openSUSE Product: openSUSE 11.4 Version: Final Platform: x86-64 OS/Version: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: Markus.Elfring@web.de QAContact: qa@suse.de Found By: --- Blocker: --- I would like to inform you about the following situation after I have retrieved the source files for a current kernel by the command "git clone". Sonne:/usr/src # cd linux-2.6.38.y && cp /boot/config-2.6.37.y .config && make oldconfig HOSTCC scripts/basic/fixdep /bin/sh: line 1: 7856 Segmentation fault scripts/basic/fixdep scripts/basic/.fixdep.d scripts/basic/fixdep 'gcc -Wp,-MD,scripts/basic/.fixdep.d -Wall -Wmissing-prototypes -Wstrict-prototypes -O2 -fomit-frame-pointer -o scripts/basic/fixdep scripts/basic/fixdep.c '
scripts/basic/.fixdep.tmp make[1]: *** [scripts/basic/fixdep] Error 139 make: *** [scripts_basic] Error 2
Would you like to correct any invalid memory accesses here? Is this the same open issue that I reported yesterday as bug #679865? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c1
Jiri Slaby
Is this the same open issue that I reported yesterday as bug #679865?
Perhaps. Could you run: * ulimit -c unlimited * make oldconfig V=1 2>&1|tee log and attach log? Also it should dump a core file. So attach also the output of: file core* -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c2
Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c3
Jiri Slaby
Created an attachment (id=419819) --> (http://bugzilla.novell.com/attachment.cgi?id=419819) [details] small make log
I do not see that a core file is generated in my situation.
Weird, did you change the ulimit and ran the make from within the same shell? Does this crash (it should not)? gcc -Wp,-MD,scripts/basic/.fixdep.d -Wall -Wmissing-prototypes -Wstrict-prototypes -O2 -fomit-frame-pointer -o scripts/basic/fixdep scripts/basic/fixdep.c and this (it should)? /scripts/basic/fixdep scripts/basic/.fixdep.d scripts/basic/fixdep 'gcc -Wp,-MD,scripts/basic/.fixdep.d -Wall -Wmissing-prototypes -Wstrict-prototypes -O2 -fomit-frame-pointer -o scripts/basic/fixdep scripts/basic/fixdep.c ' Could you prepend 'gdb --args ' before the crashing command and after it crashes, append here the output of bt full? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c4
--- Comment #4 from Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c5
--- Comment #5 from Jiri Slaby
Created an attachment (id=419825) --> (http://bugzilla.novell.com/attachment.cgi?id=419825) [details] small core dump
Ok, it's fixdep what crashes. Could you attach also the binary scripts/basic/fixdep? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c6
--- Comment #6 from Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c7
Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c8
Jiri Slaby
Created an attachment (id=419870) --> (http://bugzilla.novell.com/attachment.cgi?id=419870) [details] /usr/src/linux-2.6.38.y/scripts/basic/fixdep
Can you reproduce the program crash that I reported here also on your test systems?
No. It dies in: => 0x00000000006022d0 <+0>: movd %esi,%xmm1 What processor is that? (attach /proc/cpuinfo). I doubt it's something older than Pentium III? You definitely should have XMM registers. So I see no reason for it to die. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c9
Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c10
Jiri Slaby
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c11
Markus Elfring
No.
Does the unexpected behaviour happen also on systems which are managed by Richard Guenther? https://bugzilla.novell.com/show_bug.cgi?id=679865#c5 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c12
Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c13
--- Comment #13 from Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c14
Jiri Slaby
[ 1856.591497] fixdep[5047]: segfault at 6022d0 ip 00000000006022d0 sp 00007fff846555d8 error 15 in fixdep[602000+1000]
Ah, let's revoke what I wrote. error 15h (10101b) means: 1 The fault was caused by a page-level protection violation. 0 The access causing the fault was a read. 1 The access causing the fault originated when the processor was executing in user mode. 0 The fault was not caused by reserved bit violation. 1 The fault was caused by an instruction fetch. This means the code for strchr was mapped without X bit. Does your gdb already work? I need output of /proc/<PID>/maps obtained by: gdb --args ./scripts/basic/fixdep scripts/basic/.fixdep.d scripts/basic/fixdep 'gcc -Wp,-MD,scripts/basic/.fixdep.d -Wall -Wmissing-prototypes -Wstrict-prototypes -O2 -fomit-frame-pointer -o scripts/basic/fixdep scripts/basic/fixdep.c ' In GDB type: (gdb) run (gdb) info proc process SOME_PID In shell: $ cat /proc/SOME_PID/maps > maps And attach maps. (In reply to comment #13)
Which source files are required for the generation of the program "fixdep"?
scripts/basic/fixdep.c
I had no problems with the compilation of Linux 2.6.37.2 by the tool "GCC 4.5.2" on my own computer (hardware/physical machine). Was anything changed in the meantime that is relevant to the reported software situation?
What's the meantime? What versions do you have now? BTW while Richard is here -- is there a way how to dig out the shared libs map from a core file? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c15
Markus Elfring
Does your gdb already work?
Unfortunately not yet. - I am still affected by the "usual" challenges from another distribution upgrade. Example: http://forums.opensuse.org/english/get-technical-help-here/install-boot-logi... - I do not know at the moment when I will be able to update some remaining packages from the available repositories. - Will I need to install an alternative python version in my directory "/usr/local"?
What's the meantime?
Software releases after the following kernels (where I can currently switch between them from the boot menu): - Linux 2.6.37.2 (generated on my own) - Linux 2.6.37.1-1.2 (distributed by openSUSE 11.4) Do compiler versions matter here? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c16
Jiri Slaby
Example: http://forums.opensuse.org/english/get-technical-help-here/install-boot-logi...
- I do not know at the moment when I will be able to update some remaining packages from the available repositories.
Is there a bug entry somewhere here? I cannot deduce anything from that example... I'm running 11.4 on several machine and have no problem with that.
- Will I need to install an alternative python version in my directory "/usr/local"?
Ok, probably the strace output would be enough. Try that instead of gdb --args.
Do compiler versions matter here?
It should not. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c17
Markus Elfring
Is there a bug entry somewhere here?
By the way: How do you think about an information by Mario Schulze? https://bugzilla.novell.com/show_bug.cgi?id=677225#c1
I cannot deduce anything from that example...
I would appreciate if I can use the current tool "zypper" for improved software management. I hope that it will make further updates for Perl and Python dependencies easier on my system.
Ok, probably the strace output would be enough.
How do you think about the following excerpt? .. close(3) = 0 stat("oldconfig", 0x7fffbf6b15b0) = -1 ENOENT (No such file or directory) pipe([3, 4]) = 0 close(4) = 0 fcntl(3, F_SETFD, FD_CLOEXEC) = 0 pipe([4, 5]) = 0 vfork() = 6704 close(5) = 0 read(4, "", 200) = 0 --- SIGCHLD (Child exited) @ 0 (0) --- rt_sigreturn(0xffffffff) = 0 close(4) = 0 wait4(-1, [{WIFEXITED(s) && WEXITSTATUS(s) == 1}], 0, NULL) = 6704 rt_sigprocmask(SIG_BLOCK, [HUP INT QUIT TERM XCPU XFSZ], NULL, 8) = 0 vfork() = 6706 rt_sigprocmask(SIG_SETMASK, [], NULL, 8) = 0 wait4(-1, HOSTCC scripts/basic/fixdep /bin/sh: line 1: 6713 Segmentation fault scripts/basic/fixdep scripts/basic/.fixdep.d scripts/basic/fixdep 'gcc -Wp,-MD,scripts/basic/.fixdep.d -Wall -Wmissing-prototypes -Wstrict-prototypes -O2 -fomit-frame-pointer -o scripts/basic/fixdep scripts/basic/fixdep.c '
scripts/basic/.fixdep.tmp make[1]: *** [scripts/basic/fixdep] Error 139 [{WIFEXITED(s) && WEXITSTATUS(s) == 2}], 0, NULL) = 6706 --- SIGCHLD (Child exited) @ 0 (0) --- ..
Is the incomplete checking of return values a relevant implementation detail in the source file (e.g. function "do_config_file") for the reported situation? http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=blob;f=sc... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c18
Jiri Slaby
How do you think about the following excerpt?
Unfortunately useless. I need the start and not of strace of make, but strace of fixdep. I.e.: strace -o fixdep_strace ./scripts/basic/fixdep scripts/basic/.fixdep.d scripts/basic/fixdep 'gcc -Wp,-MD,scripts/basic/.fixdep.d -Wall -Wmissing-prototypes -Wstrict-prototypes -O2 -fomit-frame-pointer -o scripts/basic/fixdep scripts/basic/fixdep.c ' and attach fixdep_strace.
Is the incomplete checking of return values a relevant implementation detail in the source file (e.g. function "do_config_file") for the reported situation?
I don't know. It might matter. Try to add the checking like: .. exit(2); } if (fstat(fd, &st)) { perror("fstat"); return; } if (st.st_size == 0) { .. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c19
Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c20
Jiri Slaby
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c21
Petr Baudis
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c
Jiri Slaby
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c22
Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c23
--- Comment #23 from Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c24
--- Comment #24 from Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c25
--- Comment #25 from Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c26
Jiri Slaby
Why has the program that was generated by "my GCC 4.5.2" got problems?
Perhaps a bug in the compiler. As you use your own compiler, you're on your own. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120 https://bugzilla.novell.com/show_bug.cgi?id=680120#c27 --- Comment #27 from Markus Elfring2011-03-29 14:46:21 UTC --- (In reply to comment #26) How do you think about the following result? Sonne:/usr/src/linux-2.6.38.y # git pull .. >From git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2.6.38.y 521cb40..cf6013b master -> origin/master * [new tag] v2.6.38.2 -> v2.6.38.2 >From git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2.6.38.y * [new tag] v2.6.38.1 -> v2.6.38.1 Updating 521cb40..cf6013b .. Sonne:/usr/src/linux-2.6.38.y # make CC:=/usr/bin/gcc V:=1 oldconfig make -f scripts/Makefile.build obj=scripts/basic gcc -Wp,-MD,scripts/basic/.fixdep.d -Wall -Wmissing-prototypes -Wstrict-prototypes -O2 -fomit-frame-pointer -o scripts/basic/fixdep scripts/basic/fixdep.c /bin/sh: line 1: 2958 Segmentation fault (core dumped) scripts/basic/fixdep scripts/basic/.fixdep.d scripts/basic/fixdep 'gcc -Wp,-MD,scripts/basic/.fixdep.d -Wall -Wmissing-prototypes -Wstrict-prototypes -O2 -fomit-frame-pointer -o scripts/basic/fixdep scripts/basic/fixdep.c ' > scripts/basic/.fixdep.tmp make[1]: *** [scripts/basic/fixdep] Error 139 make: *** [scripts_basic] Error 2 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c28
--- Comment #28 from Jiri Slaby
Sonne:/usr/src/linux-2.6.38.y # make CC:=/usr/bin/gcc V:=1 oldconfig make -f scripts/Makefile.build obj=scripts/basic gcc -Wp,-MD,scripts/basic/.fixdep.d -Wall -Wmissing-prototypes
You have to set also HOSTCC for this to take effect. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c29
--- Comment #29 from Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c30
--- Comment #30 from Jiri Slaby
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c31
--- Comment #31 from Markus Elfring
https://bugzilla.novell.com/show_bug.cgi?id=680120
https://bugzilla.novell.com/show_bug.cgi?id=680120#c32
--- Comment #32 from Markus Elfring
participants (1)
-
bugzilla_noreply@novell.com