[Bug 576909] New: strange OOPSes and crashes after updating to 11.2
http://bugzilla.novell.com/show_bug.cgi?id=576909 http://bugzilla.novell.com/show_bug.cgi?id=576909#c0 Summary: strange OOPSes and crashes after updating to 11.2 Classification: openSUSE Product: openSUSE 11.2 Version: Final Platform: Other OS/Version: openSUSE 11.2 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: jsj@jsj.dyndns.org QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Opera/9.80 (X11; Linux x86_64; U; en) Presto/2.2.15 Version/10.10 I have two DELL Poweredge 850's, both running fine for more than 3 years. I made an update from 11.1 to 11.2 using zupper dup, and after running the new system the machine locks up every 3 or 4 hours, but sometimes it is up for two days. I receive this OOPS: === [173192.691025] BUG: unable to handle kernel NULL pointer dereference at 0000000000000002 [173192.692003] IP: [<ffffffff81321de3>] put_tty_queue+0x53/0xb0 [173192.692003] PGD 0 [173192.692003] Oops: 0002 [#1] PREEMPT SMP [173192.692003] last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map [173192.692003] CPU 0 [173192.692003] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801 pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan processor pata_acpi piix ide_core thermal thermal_sys [173192.692003] Pid: 23281, comm: telnet Not tainted 2.6.31.8-0.1-desktop #1 PowerEdge 850 [173192.692003] RIP: 0010:[<ffffffff81321de3>] [<ffffffff81321de3>] put_tty_queue+0x53/0xb0 [173192.692003] RSP: 0018:ffff880026c27728 EFLAGS: 00010097 [173192.692003] RAX: 0000000000000286 RBX: ffff88003dcabd1c RCX: 0000000000000000 [173192.692003] RDX: 0000000000000002 RSI: 0000000000000286 RDI: ffff88003dcabd1c [173192.692003] RBP: ffff880026c27758 R08: ffff880026c27de0 R09: 0000000000000000 [173192.692003] R10: 00000000ffffffff R11: 0000000000000001 R12: 0000000000000074 [173192.692003] R13: ffff88003dcab800 R14: ffff880037c3646c R15: 0000000000000002 [173192.692003] FS: 00007fcfebd4b6f0(0000) GS:ffff880001c80000(0000) knlGS:0000000000000000 [173192.692003] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [173192.692003] CR2: 0000000000000002 CR3: 000000003c55b000 CR4: 00000000000006f0 [173192.692003] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [173192.692003] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [173192.692003] Process telnet (pid: 23281, threadinfo ffff880026c26000, task ffff8800218e2740) [173192.692003] Stack: [173192.692003] ffff8800218e2778 0000000032f80331 ffff880001c93a00 ffff88003dcab800 [173192.692003] <0> 0000000000000074 0000000000000000 ffff880026c27798 ffffffff813248ac [173192.692003] <0> 0000000000000000 0000000032f80331 ffff88003dcab800 ffff880037c3656d [173192.692003] Call Trace: [173192.692003] [<ffffffff813248ac>] n_tty_receive_char+0x1dc/0x7f0 [173192.692003] [<ffffffff8132509c>] n_tty_receive_buf+0x1dc/0x490 [173192.692003] [<ffffffff813280cb>] flush_to_ldisc+0x19b/0x1d0 [173192.692003] [<ffffffff813281c3>] tty_flush_to_ldisc+0x23/0x40 [173192.692003] [<ffffffff81322a11>] n_tty_poll+0x81/0x1c0 [173192.692003] [<ffffffff8131d822>] tty_poll+0xa2/0xc0 [173192.692003] [<ffffffff81160395>] do_select+0x3e5/0x740 [173192.692003] [<ffffffff81160aaa>] core_sys_select+0x22a/0x3e0 [173192.692003] [<ffffffff81160f11>] sys_select+0x51/0x130 [173192.692003] [<ffffffff8100c682>] system_call_fastpath+0x16/0x1b [173192.692003] [<00007fcfeb66c6c3>] 0x7fcfeb66c6c3 [173192.692003] Code: 25 28 00 00 00 48 89 45 d8 31 c0 e8 a8 42 23 00 41 81 bd 60 02 00 00 ff 0f 00 00 7f 31 49 63 95 58 02 00 00 49 8b 8d 50 02 00 00 <44> 88 24 11 41 8b 95 58 02 00 00 41 83 85 60 02 00 00 01 83 c2 [173192.692003] RIP [<ffffffff81321de3>] put_tty_queue+0x53/0xb0 [173192.692003] RSP <ffff880026c27728> [173192.692003] CR2: 0000000000000002 [173193.795695] ---[ end trace ad9dabd8b20d69ef ]--- [173193.796006] note: telnet[23281] exited with preempt_count 1 [173193.873177] BUG: scheduling while atomic: telnet/23281/0x10000002 [173193.912921] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801 pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan processor pata_acpi piix ide_core thermal thermal_sys [173194.059193] CPU 0: [173194.097465] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801 pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan processor pata_acpi piix ide_core thermal thermal_sys [173194.254302] Pid: 23281, comm: telnet Tainted: G D 2.6.31.8-0.1-desktop #1 PowerEdge 850 [173194.304201] RIP: 0010:[<ffffffff81321de3>] [<ffffffff81321de3>] put_tty_queue+0x53/0xb0 [173194.353912] RSP: 0018:ffff880026c27728 EFLAGS: 00010097 [173194.400719] RAX: 0000000000000286 RBX: ffff88003dcabd1c RCX: 0000000000000000 [173194.449671] RDX: 0000000000000002 RSI: 0000000000000286 RDI: ffff88003dcabd1c [173194.498564] RBP: ffff880026c27758 R08: ffff880026c27de0 R09: 0000000000000000 [173194.547518] R10: 00000000ffffffff R11: 0000000000000001 R12: 0000000000000074 [173194.596420] R13: ffff88003dcab800 R14: ffff880037c3646c R15: 0000000000000002 [173194.645243] FS: 0000000000000000(0000) GS:ffff880001c80000(0000) knlGS:0000000000000000 [173194.695348] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [173194.742611] CR2: 0000000000000002 CR3: 0000000001001000 CR4: 00000000000006f0 [173194.791232] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [173194.839569] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [173194.887835] Call Trace: [173194.930826] [<ffffffff813248ac>] n_tty_receive_char+0x1dc/0x7f0 [173194.977619] [<ffffffff8132509c>] n_tty_receive_buf+0x1dc/0x490 [173195.023312] [<ffffffff813280cb>] flush_to_ldisc+0x19b/0x1d0 [173195.067614] [<ffffffff813281c3>] tty_flush_to_ldisc+0x23/0x40 [173195.111021] [<ffffffff81322a11>] n_tty_poll+0x81/0x1c0 [173195.153233] [<ffffffff8131d822>] tty_poll+0xa2/0xc0 [173195.194734] [<ffffffff81160395>] do_select+0x3e5/0x740 [173195.236191] [<ffffffff81160aaa>] core_sys_select+0x22a/0x3e0 [173195.278023] [<ffffffff81160f11>] sys_select+0x51/0x130 [173195.319248] [<ffffffff8100c682>] system_call_fastpath+0x16/0x1b [173195.361252] [<00007fcfeb66c6c3>] 0x7fcfeb66c6c3 === then, after about a minute === [173260.055001] BUG: soft lockup - CPU#0 stuck for 61s! [telnet:23281] [173260.055001] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801 pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan processor pata_acpi piix ide_core thermal thermal_sys [173260.055001] CPU 0: [173260.055001] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801 pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan processor pata_acpi piix ide_core thermal thermal_sys [173260.055001] Pid: 23281, comm: telnet Tainted: G D 2.6.31.8-0.1-desktop #1 PowerEdge 850 [173260.055001] RIP: 0010:[<ffffffff8103a67c>] [<ffffffff8103a67c>] __ticket_spin_lock+0x2c/0x50 [173260.055001] RSP: 0018:ffff880026c27398 EFLAGS: 00000297 [173260.055001] RAX: 0000000000000b03 RBX: ffff880026c273a8 RCX: 00000000000002da [173260.055001] RDX: 0000000000000b02 RSI: 0000000000000286 RDI: ffffffff818b3880 [173260.055001] RBP: ffffffff8100d27e R08: 00000000ffffffff R09: 0000000000000000 [173260.055001] R10: 00000000ffffffff R11: 0000000000000001 R12: ffff8800218e2740 [173260.055001] R13: ffff88003c546040 R14: 0000000000013a00 R15: ffff8800218e2b00 [173260.055001] FS: 00007fcfebd4b6f0(0000) GS:ffff880001c80000(0000) knlGS:0000000000000000 [173260.055001] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [173260.055001] CR2: 00007f0d65226000 CR3: 000000003e8b3000 CR4: 00000000000006f0 [173260.055001] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [173260.055001] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [173260.055001] Call Trace: [173260.055001] Inexact backtrace: [173260.055001] [173260.055001] [<ffffffff815569e0>] ? lock_kernel+0xc0/0x152 [173260.055001] [<ffffffff8131ef34>] ? disassociate_ctty+0x44/0x270 [173260.055001] [<ffffffff8106f42a>] ? do_exit+0x1aa/0x3c0 [173260.055001] [<ffffffff8106ab81>] ? print_oops_end_marker+0x31/0x50 [173260.055001] [<ffffffff81557b83>] ? oops_end+0xd3/0x130 [173260.055001] [<ffffffff810448fb>] ? no_context+0x10b/0x1a0 [173260.055001] [<ffffffff81044b0d>] ? __bad_area_nosemaphore+0x17d/0x230 [173260.055001] [<ffffffff81044be1>] ? bad_area_nosemaphore+0x21/0x40 [173260.055001] [<ffffffff81559daa>] ? do_page_fault+0x3aa/0x470 [173260.055001] [<ffffffff81556bf5>] ? page_fault+0x25/0x30 [173260.055001] [<ffffffff81321de3>] ? put_tty_queue+0x53/0xb0 [173260.055001] [<ffffffff81321dc8>] ? put_tty_queue+0x38/0xb0 [173260.055001] [<ffffffff813248ac>] ? n_tty_receive_char+0x1dc/0x7f0 [173260.055001] [<ffffffff8132509c>] ? n_tty_receive_buf+0x1dc/0x490 [173260.055001] [<ffffffff815564d6>] ? _spin_unlock_irq+0x26/0x90 [173260.055001] [<ffffffff8105703a>] ? finish_task_switch+0x13a/0x170 [173260.055001] [<ffffffff8103a7d6>] ? default_spin_lock_flags+0x26/0x50 [173260.055001] [<ffffffff8103a7d6>] ? default_spin_lock_flags+0x26/0x50 [173260.055001] [<ffffffff813280cb>] ? flush_to_ldisc+0x19b/0x1d0 [173260.055001] [<ffffffff813281c3>] ? tty_flush_to_ldisc+0x23/0x40 [173260.055001] [<ffffffff81322a11>] ? n_tty_poll+0x81/0x1c0 [173260.055001] [<ffffffff8131d822>] ? tty_poll+0xa2/0xc0 [173260.055001] [<ffffffff81160395>] ? do_select+0x3e5/0x740 [173260.055001] [<ffffffff811606f0>] ? __pollwait+0x0/0x110 [173260.055001] [<ffffffff81160800>] ? pollwake+0x0/0x80 [173260.055001] [<ffffffff81160800>] ? pollwake+0x0/0x80 [173260.055001] [<ffffffff81160800>] ? pollwake+0x0/0x80 [173260.055001] [<ffffffff81063cfe>] ? try_to_wake_up+0xce/0x350 [173260.055001] [<ffffffff81063fa0>] ? default_wake_function+0x20/0x40 [173260.055001] [<ffffffff8104fc89>] ? __wake_up_common+0x69/0xb0 [173260.055001] [<ffffffff81054c5d>] ? __wake_up+0x5d/0x90 [173260.055001] [<ffffffff81160aaa>] ? core_sys_select+0x22a/0x3e0 [173260.055001] [<ffffffff813272fc>] ? tty_ldisc_deref+0x1c/0x40 [173260.055001] [<ffffffff8131f678>] ? tty_write+0x248/0x2b0 [173260.055001] [<ffffffff81160f11>] ? sys_select+0x51/0x130 [173260.055001] [<ffffffff8114c82b>] ? sys_write+0x5b/0xa0 [173260.055001] [<ffffffff8100c682>] ? system_call_fastpath+0x16/0x1b === and this on and on every 65-67 seconds. I changed RAM, I tried kernel flavours pae and default, initially with i586. I used the second 850 for a fresh install x86_64, but otherwise a replica of the original one, (this is where the actual OOPSes are captured from), the same OOPSes. As the primary machine is our syslog server, I tried 2.6.33rc5 from KOTD, this one is stable for 6 days now. Do I need to supply more information? Reproducible: Always Steps to Reproduce: 1. install 11.2 on PE 850 w/ actual update kernel 2. run and wait. -- Configure bugmail: http://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
http://bugzilla.novell.com/show_bug.cgi?id=576909
http://bugzilla.novell.com/show_bug.cgi?id=576909#c1
Jeff Mahoney
http://bugzilla.novell.com/show_bug.cgi?id=576909
http://bugzilla.novell.com/show_bug.cgi?id=576909#c2
Greg Kroah-Hartman
http://bugzilla.novell.com/show_bug.cgi?id=576909
http://bugzilla.novell.com/show_bug.cgi?id=576909#c3
Stefan Schmidt
http://bugzilla.novell.com/show_bug.cgi?id=576909
http://bugzilla.novell.com/show_bug.cgi?id=576909#c4
--- Comment #4 from Greg Kroah-Hartman
http://bugzilla.novell.com/show_bug.cgi?id=576909
http://bugzilla.novell.com/show_bug.cgi?id=576909#c5
Stefan Schmidt
http://bugzilla.novell.com/show_bug.cgi?id=576909
http://bugzilla.novell.com/show_bug.cgi?id=576909#c6
Stefan Schmidt
http://bugzilla.novell.com/show_bug.cgi?id=576909
http://bugzilla.novell.com/show_bug.cgi?id=576909#c7
--- Comment #7 from Greg Kroah-Hartman
https://bugzilla.novell.com/show_bug.cgi?id=576909
https://bugzilla.novell.com/show_bug.cgi?id=576909#c8
Greg Kroah-Hartman
https://bugzilla.novell.com/show_bug.cgi?id=576909
https://bugzilla.novell.com/show_bug.cgi?id=576909#c9
--- Comment #9 from Stefan Botter
https://bugzilla.novell.com/show_bug.cgi?id=576909
https://bugzilla.novell.com/show_bug.cgi?id=576909#c10
Greg Kroah-Hartman
participants (1)
-
bugzilla_noreply@novell.com