[Bug 233126] New: Occasional kernel panic
https://bugzilla.novell.com/show_bug.cgi?id=233126 Summary: Occasional kernel panic Product: openSUSE 10.2 Version: Final Platform: 32bit OS/Version: Linux Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: win@huber-und-boehm.de QAContact: qa@suse.de Hi there, I run a fully patched OpenSuSE 10.2, kernel 2.6.18.2-34-default, and get occasional kernel crashs. The machine seems to be frozen afterward - no ping answer. Can't figure out any action that causes the crash. Uptime varies from minutes to days. Crashs occurs by night (idle machine), on editing, browsing, ... To get more info about this I attached a serial console. I attach the crash message below. More crash logs to follow :-(( Specs: Hardware: Asus P5AD2 Premium, Pentium 4 3.2 GHz SATA: 3 discs, DVD Plextor PX-712A IDE: DVD Plextor PX-716A SCSI: Adaptec 2940 Ultra SCSI adapter, DAT+Exabyte tape USB: some hubs, HP 970cxi Printer, ext. HP DAT-72 tape, all switched off additional sound card + TV card I suspected the not supported Fritz Card USB kernel modules ... fcusb-kmp-default-0.1_2.6.18.2_34-0.i586.rpm and fcusb2-kmp-default-0.1_2.6.18.2_34-0.i586.rpm .. so I deinstalled them and disconnected the Fritz!Card USB - HylaFax is down now :-( But this did not help. The machine still crashs. Right after booting the taint value is 64 - can't figure out which kernel module taints my kernel. I looked for kernel modules lacking the "supported: yes" tag and found bt878, hwmon, it821x and w83627ehf. But they are part of the kernel package, not any weird modules. So I think the kernel should be clean. After the crash more tainted flags are set. Any help will be greatly appreciated, and I will be glad to provide more info as needed. Winfried Huber -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #1 from win@huber-und-boehm.de 2007-01-10 03:57 MST ------- Created an attachment (id=112175) --> (https://bugzilla.novell.com/attachment.cgi?id=112175&action=view) Crash output on serial console Messages generated by the crash on a serial console. Catched on a windows machine. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #2 from win@huber-und-boehm.de 2007-01-10 04:00 MST ------- Created an attachment (id=112179) --> (https://bugzilla.novell.com/attachment.cgi?id=112179&action=view) boot messages on serial console This is the output on a serial console from the boot before the crash - IMHO nothing suspicous. But I want to make this accessible anyway so you can look it up if desired. Output captured on a windows machine - thus CR/LF -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 lmb@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|kernel- |npiggin@novell.com |maintainers@forge.provo.nove| |ll.com | ------- Comment #3 from lmb@novell.com 2007-01-10 04:34 MST ------- This looks like a crash in the scheduler. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #4 from win@huber-und-boehm.de 2007-01-11 13:50 MST ------- Created an attachment (id=112546) --> (https://bugzilla.novell.com/attachment.cgi?id=112546&action=view) Crash#2 output on serial console Maybe this provides additional information - I will keep on adding more crash logs. As the suspected (because unsupported) kernel modules fcusb and fcusb2 are obviously not guilty: Would you mind if I reinstall them? This would make by fax machine work again. But I don't want to make the crash logs worthlsss ... so I ask ... Cheers, Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 npiggin@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO Info Provider| |win@huber-und-boehm.de ------- Comment #5 from npiggin@novell.com 2007-01-11 22:48 MST -------
From the looks of your first oops, it crashed in the first loop in kernel/sched.c:rebalance_tick() -- scale went to 0, which seems to be impossible (but let me know if it crashes there again).
Your second crash is somewhere different, so this could point towards a hardware error, or maybe a random memory scribble. Could you try running memtest86 overnight, and also post a few more oops messages? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #6 from win@huber-und-boehm.de 2007-01-12 00:51 MST ------- Created an attachment (id=112586) --> (https://bugzilla.novell.com/attachment.cgi?id=112586&action=view) Crash#3 output on serial console -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #7 from win@huber-und-boehm.de 2007-01-12 01:09 MST ------- Created an attachment (id=112591) --> (https://bugzilla.novell.com/attachment.cgi?id=112591&action=view) Older Crash output on serial console (see Comment) As this crash occured there was the Fritz!Card USB v2.1 attached to the machine (and probably the unsupported kernel modules fcusb and fcusb2 loaded) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #8 from win@huber-und-boehm.de 2007-01-12 01:21 MST ------- Created an attachment (id=112593) --> (https://bugzilla.novell.com/attachment.cgi?id=112593&action=view) One more old Crash output on serial console (see Comment) As this crash occured there was the Fritz!Card USB v2.1 attached to the machine (and probably the unsupported kernel modules fcusb and fcusb2 loaded) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #9 from win@huber-und-boehm.de 2007-01-12 01:39 MST ------- Created an attachment (id=112604) --> (https://bugzilla.novell.com/attachment.cgi?id=112604&action=view) Crash log (digital camera image) At this time (Jan 3rd) the console was a DEC VT510 terminal. Got a digital camera shot and reworked it with gimp to make it smaller and give better contrast). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #10 from win@huber-und-boehm.de 2007-01-12 01:42 MST ------- Created an attachment (id=112606) --> (https://bugzilla.novell.com/attachment.cgi?id=112606&action=view) Crash#4 output on serial cosole a new one again :-( -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 win@huber-und-boehm.de changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |NEW Info Provider|win@huber-und-boehm.de | ------- Comment #11 from win@huber-und-boehm.de 2007-01-12 01:49 MST ------- Added two new crash logs (Crash#3 output and Crash#4 output) and some older ones (of the last days). I ran memtest86 a few days ago for 2 full cycles - all fine. Will give it another chance (and more time) next night. The machine was running SuSE-9.3 before - uptime for weeks, no problem. I will keep on providing new crash logs as they occur - the machine makes it possible (Sigh...) Cheers, Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 npiggin@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED ------- Comment #12 from npiggin@novell.com 2007-01-12 03:18 MST ------- No scrap that, your latest batch of crashes do show that there is indeed a pattern, so it is probably a bug in the scheduler code. Tomorrow I'll make up a patch for you to that might help narrow it down. Thanks.. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #13 from win@huber-und-boehm.de 2007-01-13 01:22 MST ------- Created an attachment (id=112844) --> (https://bugzilla.novell.com/attachment.cgi?id=112844&action=view) Crash#5 output on serial console one more ... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #14 from win@huber-und-boehm.de 2007-01-14 03:21 MST ------- Created an attachment (id=112903) --> (https://bugzilla.novell.com/attachment.cgi?id=112903&action=view) Crash#6 output on serial console -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #15 from win@huber-und-boehm.de 2007-01-15 02:28 MST ------- Created an attachment (id=112941) --> (https://bugzilla.novell.com/attachment.cgi?id=112941&action=view) Crash#7 output on serial console looks somewhat different -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #16 from win@huber-und-boehm.de 2007-01-16 02:19 MST ------- Created an attachment (id=113065) --> (https://bugzilla.novell.com/attachment.cgi?id=113065&action=view) Crash#8 output on serial console The machine keeps on happy crashing... BTW: As the unsupported kernel modules fcusb and fcusb2 are obiously not guilty and I needed my fax machine I reinstalled them and plugged my "Fritz!CARD USB v2.1" so Hylafax gets back to work. This is the first "Crash#?" output with fcusb+fcusb2 back to work. Hope you don't mind. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #17 from win@huber-und-boehm.de 2007-01-16 05:29 MST ------- Created an attachment (id=113100) --> (https://bugzilla.novell.com/attachment.cgi?id=113100&action=view) Crash#9 output on serial console double oops again... @nick: Can I do anything to make the machine more stable for now? I won't mind losing performance! -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #18 from win@huber-und-boehm.de 2007-01-17 15:30 MST ------- Created an attachment (id=113510) --> (https://bugzilla.novell.com/attachment.cgi?id=113510&action=view) Crash#10 output on serial console -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #19 from win@huber-und-boehm.de 2007-01-17 18:47 MST ------- Created an attachment (id=113533) --> (https://bugzilla.novell.com/attachment.cgi?id=113533&action=view) Crash#11 output on serial console Any clue what I can do to make the machine more stable: Or perhaps to make it boot automatically after a crash? TIA, Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #20 from win@huber-und-boehm.de 2007-01-18 01:30 MST ------- Created an attachment (id=113559) --> (https://bugzilla.novell.com/attachment.cgi?id=113559&action=view) Crash#12 output on serial console -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #21 from win@huber-und-boehm.de 2007-01-18 01:31 MST ------- Created an attachment (id=113560) --> (https://bugzilla.novell.com/attachment.cgi?id=113560&action=view) Crash#13 output on serial console -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #22 from win@huber-und-boehm.de 2007-01-18 01:34 MST ------- Created an attachment (id=113562) --> (https://bugzilla.novell.com/attachment.cgi?id=113562&action=view) Crash#14 output on serial console lots of crashes... I uninstalled the unsupported modules fcusb-kmp-default-0.1_2.6.18.2_34-0 and fcusb2-kmp-default-0.1_2.6.18.2_34-0 ... next crash log will be widout them. Hopefully the uptime will increase now -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 win@huber-und-boehm.de changed: What |Removed |Added ---------------------------------------------------------------------------- Attachment #113559|0 |1 is obsolete| | ------- Comment #23 from win@huber-und-boehm.de 2007-01-18 05:39 MST ------- (From update of attachment 113559) This obviously is a duplicate of the last crash log. Sorry for the noise! Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #24 from win@huber-und-boehm.de 2007-01-18 05:44 MST ------- Created an attachment (id=113619) --> (https://bugzilla.novell.com/attachment.cgi?id=113619&action=view) the machine keeps on crashing :-( argghhhh ... my wife starts argueing I should have sticked to SuSE 9.3 ... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #25 from npiggin@novell.com 2007-01-21 22:33 MST ------- Sorry for the delay :( I was away last week. Still looking at your crash logs... can't make too much sense of them yet, but they could point to some low level bootup code not doing the right thing... Can you try booting with maxcpus=1 as a command line option, and see whether that makes your system more stable? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #26 from win@huber-und-boehm.de 2007-02-02 02:14 MST ------- No, maxcpus=1 seems to make no difference. Two weeks ago I hoped the system got more stable and looked for recently installed patches. The only one I could imagine to be the reason was hal. But this did not hold. Following your suggestion I added maxcpus=1 to ny kernel cmd line. The system still crashs, uptime is about 24..60 hours. So I abandoned maxcpus=1 after the last crash (this morning), No more crash logs because I need the only serial interface for an old fax modem I recently disinterred. - would more crash logs be useful? - should I uploat boot.msg (kernel log level boosted)? Would it be helpful to provide some more crash logs? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #27 from npiggin@novell.com 2007-02-20 08:51 MST ------- Can you get a crash log for the maxcpus=1 case? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #28 from win@huber-und-boehm.de 2007-02-23 10:43 MST ------- intermediate report: As you requested I booted into maxcpus=1 and serial console. The System runs stable now, uptime more than 3 days, still running. The system never was up for such a long time since I installed openSuSE 10.2. Although the machine is somewhat sluggish now this is way better than a crash after less than 60 hours. I will keep on reporting. If you don't hear anything from me I'm still up with maxcpus=1. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #29 from win@huber-und-boehm.de 2007-02-28 03:57 MST ------- intermediate report#2: The machine was dead this morning - uptime was 7 days 15 hours. Unfortunately no crash log on the serial console, and sysrq did not work either. So I was unable to get any information what was going wrong. Maybe this is related to a sky2 driver issue - occasionally one of my 2 Marvell onboard networks is down - can't ping any device attached to this ring, e.g. thin clients. Running ... "rcnetwork down; rmmod sky2; modprobe sky2;rcnetwork start" .. solves this problem, no reboot needed. The thin clients are back to work, no need to log in, the X11 session is still running, no work is lost. But this problem is rare (maybe once a week). Kind regards, Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #30 from win@huber-und-boehm.de 2007-02-28 17:36 MST ------- Created an attachment (id=121699) --> (https://bugzilla.novell.com/attachment.cgi?id=121699&action=view) Crash#15: sysrq trace of frozen kernel (maxcpus=1) Hi Nick, the machine was frozen, running with "maxcpus=1". The last messages on the serial console were "capidrv-1: controller dead ??" and "capidrv-1: listen_change_state state=3 event=1 ????". sysrq's were still OK, so I got a ... - register dump showing EIP at "lock_kernel+..." - show state - show memory .. captured on the serial console, attached below. Hope this is not a completely different issue... Kind regards, Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #31 from win@huber-und-boehm.de 2007-03-04 05:08 MST ------- Created an attachment (id=122222) --> (https://bugzilla.novell.com/attachment.cgi?id=122222&action=view) Crash#15 output on serial console captured crash log on serial console (maxcpus=1) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #32 from npiggin@novell.com 2007-03-07 04:53 MST ------- OK the problem is still a mystery to me, but, I might have another similar bug on OpenSUSE, that isn't present in mainline kernel. Are you able to test your system with the latest 2.6.20 kernel from kernel.org? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #33 from win@huber-und-boehm.de 2007-03-07 12:14 MST ------- OK, I will grab the new kernel. But I need some time to do this. I will report... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #34 from npiggin@novell.com 2007-03-08 04:52 MST ------- Thanks a lot. Sorry you are having so much trouble... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 npiggin@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |win@huber-und-boehm.de -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 win@huber-und-boehm.de changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|win@huber-und-boehm.de | ------- Comment #35 from win@huber-und-boehm.de 2007-03-24 16:13 MST ------- Created an attachment (id=126362) --> (https://bugzilla.novell.com/attachment.cgi?id=126362&action=view) Crash log new kernel 2.6.20.4 - better info! Hi Nick, sorry for the delay. Few time for the moment. Last Weekend I tried to get 2.6.20.3 to work. But I encountered severe poblems, file system issues. Today I grabbed 2.6.20.4 and managed to get it running reasonable. Due to fiddling the kernel hacking config options I guess this crash log is much better. Note: I get complaints about missing modules: ip6tables_filter, ip6tables_mangle and dm_mod. But the firewal is OK, including NAT. Did not figure out which config options are missing yet. Hope this is not related to the crash. Cheers, Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #36 from win@huber-und-boehm.de 2007-03-24 19:16 MST ------- Created an attachment (id=126363) --> (https://bugzilla.novell.com/attachment.cgi?id=126363&action=view) one more crash log produced by the 2.6.20.4 kernel At least I got a crash log on my serial console. This is not always the case; from time to time the kernel simply freezes, no ping, no sysctl. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #37 from win@huber-und-boehm.de 2007-03-25 23:41 MST ------- Created an attachment (id=126397) --> (https://bugzilla.novell.com/attachment.cgi?id=126397&action=view) kernel 2.20.6.4 now loads without complaints Hi Nick, after some studying .config I managed to get the new kernel booting without any complaints - see boot.msg. But the machine keeps on crashing happily. Sometimes a non-fatal OOps precedes the final crash. I will attach some crash and oops logs below. Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #38 from win@huber-und-boehm.de 2007-03-25 23:44 MST ------- Created an attachment (id=126398) --> (https://bugzilla.novell.com/attachment.cgi?id=126398&action=view) spinlock wrong owner oops non-fatal oops, machine continued to work, BUG: spinlock wrong owner on CPU#0, swapper/0 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #39 from win@huber-und-boehm.de 2007-03-25 23:46 MST ------- Created an attachment (id=126399) --> (https://bugzilla.novell.com/attachment.cgi?id=126399&action=view) fatal exception in interrupt <0>Kernel panic - not syncing: Fatal exception in interrupt -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #40 from win@huber-und-boehm.de 2007-03-25 23:49 MST ------- Created an attachment (id=126400) --> (https://bugzilla.novell.com/attachment.cgi?id=126400&action=view) BUG: at kernel/lockdep.c:1410 check_chain_key() preceding final crash This is one with... BUG: at kernel/lockdep.c:1410 check_chain_key() .. preceding the final crash about 50 minutes later. BTW: The messages between are log messages from my firewall (guess you know that). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #41 from win@huber-und-boehm.de 2007-03-25 23:52 MST ------- Created an attachment (id=126401) --> (https://bugzilla.novell.com/attachment.cgi?id=126401&action=view) non-fatal Oops preceding crash about 20 minutes later [ 1257.258059] hm#2, depth: 3 [3], 0000000000000001 != 00000000de5bd001 [ 1257.277029] BUG: at kernel/lockdep.c:1410 check_chain_key() ... 20 minutes ... zong,,, [ 2540.442826] BUG: unable to handle kernel NULL pointer dereference at virtual address 0000004c -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #43 from ak@novell.com 2007-04-03 02:47 MST ------- Hmm, did the machine ever run stable? Could you perhaps go back to that kernel version for a few days and check if it still runs stable? That's just to verify you don't have a weird hardware problem. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 npiggin@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |win@huber-und-boehm.de -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 ------- Comment #44 from win@huber-und-boehm.de 2007-05-15 16:31 MST ------- Hi Andreas, the machine was stable running 9.2. Could not get 10.0 and 10.1 up, 10.0 did not install (hanging on gathering info), and 10.1 suffered from serious network problems. I needed to downgrade to 9.2. Downgrading back to 9.2 would be a problem for me. Luckily my wife is at home all the time and can reboot the machine as needed. So booting over and over is the minor hassle. As this seems to be a problem just for me I tend to believe in obscure hardware problems, too. I replaced piece for piece, memory, graphics card, kicked my SCSI controller out and so on. Did not help. I plan to rebuild the machine, new motherboard, new CPU now. I hope this will solve the problem. I will report. I was not able to do this earlier, no bucks... I was seriously ill and could not earn any money for more than one year. Not an easy situation for a freelancer. I will report as soon as I am able to provide reliable info; I want to see the machine run for a week or so before I shout "hurray!!". The status remains "NEEDINFO" ... Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 win@huber-und-boehm.de changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|win@huber-und-boehm.de | ------- Comment #45 from win@huber-und-boehm.de 2007-05-23 18:08 MST ------- Hi Nick, Hi Andreas, I am really glad to report the problems are gone with my new hardware. I rebuilt the machine, new ... MB Asus P5B Deluxe WiFi P965 S775 and CPU Intel Core2 Duo E6600 2.4GHz .. and the machine is as stable as a rock (one week now). Smaller problems remain, can't get the second onboard GB LAN to fly, but this is not a real problem for me, and I guess this will be solved with 10.3 Guess we can't tell if this is was really broken hardware or a unfortunately instable combination of components, MB/Bios Version/CPU/other components. Anyway. I am happy now, and I apologize for the bad headache I have given to you, perhaps chasing a spurious hardware problem. I really enjoy the pleasures of Unix/Linux now, months without a second unplanned down time (OK, don't count the (very rare) power fails, and my new motherboard can be configured as "boot on power back"). Would you please be so kind to assign an appropriate new state to this ticket? I'm not sure wich state I should assign, FIXED, INVALID or WORKSFORME. Thanks again, Winfried -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=233126 npiggin@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |INVALID ------- Comment #46 from npiggin@novell.com 2007-05-23 20:36 MST ------- Hi Winfried, Well I am glad your new machine is stable now... it looks like a nice system! Please don't apologize for reporting your problems, you were really helpful and persistent in testing which is one of the most valuable things we have in the Linux community. I'm just sorry we couldn't work out what the problem was. Anyway, regarding your other problems like onboard LAN, you should report that too when you have time (if 10.3 does not solve it). I'll close this as invalid, as we might assume it is a hardware problem. Thanks, Nick -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
participants (1)
-
bugzilla_noreply@novell.com