Mailinglist Archive: opensuse-bugs (4664 mails)

< Previous Next >
[Bug 576909] New: strange OOPSes and crashes after updating to 11.2
  • From: bugzilla_noreply@xxxxxxxxxx
  • Date: Thu, 4 Feb 2010 09:12:08 +0000
  • Message-id: <bug-576909-21960@xxxxxxxxxxxxxxxxxxxxxxxx/>
http://bugzilla.novell.com/show_bug.cgi?id=576909

http://bugzilla.novell.com/show_bug.cgi?id=576909#c0


Summary: strange OOPSes and crashes after updating to 11.2
Classification: openSUSE
Product: openSUSE 11.2
Version: Final
Platform: Other
OS/Version: openSUSE 11.2
Status: NEW
Severity: Critical
Priority: P5 - None
Component: Kernel
AssignedTo: kernel-maintainers@xxxxxxxxxxxxxxxxxxxxxx
ReportedBy: jsj@xxxxxxxxxxxxxx
QAContact: qa@xxxxxxx
Found By: ---
Blocker: ---


User-Agent: Opera/9.80 (X11; Linux x86_64; U; en) Presto/2.2.15
Version/10.10

I have two DELL Poweredge 850's, both running fine for more than 3 years.
I made an update from 11.1 to 11.2 using zupper dup, and after running the new
system the machine locks up every 3 or 4 hours, but sometimes it is up for two
days.

I receive this OOPS:

===
[173192.691025] BUG: unable to handle kernel NULL pointer dereference at
0000000000000002
[173192.692003] IP: [<ffffffff81321de3>] put_tty_queue+0x53/0xb0
[173192.692003] PGD 0
[173192.692003] Oops: 0002 [#1] PREEMPT SMP
[173192.692003] last sysfs file:
/sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map
[173192.692003] CPU 0
[173192.692003] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf
ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt
iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801
pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq
async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan
processor pata_acpi piix ide_core thermal thermal_sys
[173192.692003] Pid: 23281, comm: telnet Not tainted 2.6.31.8-0.1-desktop #1
PowerEdge 850
[173192.692003] RIP: 0010:[<ffffffff81321de3>] [<ffffffff81321de3>]
put_tty_queue+0x53/0xb0
[173192.692003] RSP: 0018:ffff880026c27728 EFLAGS: 00010097
[173192.692003] RAX: 0000000000000286 RBX: ffff88003dcabd1c RCX:
0000000000000000
[173192.692003] RDX: 0000000000000002 RSI: 0000000000000286 RDI:
ffff88003dcabd1c
[173192.692003] RBP: ffff880026c27758 R08: ffff880026c27de0 R09:
0000000000000000
[173192.692003] R10: 00000000ffffffff R11: 0000000000000001 R12:
0000000000000074
[173192.692003] R13: ffff88003dcab800 R14: ffff880037c3646c R15:
0000000000000002
[173192.692003] FS: 00007fcfebd4b6f0(0000) GS:ffff880001c80000(0000)
knlGS:0000000000000000
[173192.692003] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[173192.692003] CR2: 0000000000000002 CR3: 000000003c55b000 CR4:
00000000000006f0
[173192.692003] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[173192.692003] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[173192.692003] Process telnet (pid: 23281, threadinfo ffff880026c26000, task
ffff8800218e2740)
[173192.692003] Stack:
[173192.692003] ffff8800218e2778 0000000032f80331 ffff880001c93a00
ffff88003dcab800
[173192.692003] <0> 0000000000000074 0000000000000000 ffff880026c27798
ffffffff813248ac
[173192.692003] <0> 0000000000000000 0000000032f80331 ffff88003dcab800
ffff880037c3656d
[173192.692003] Call Trace:
[173192.692003] [<ffffffff813248ac>] n_tty_receive_char+0x1dc/0x7f0
[173192.692003] [<ffffffff8132509c>] n_tty_receive_buf+0x1dc/0x490
[173192.692003] [<ffffffff813280cb>] flush_to_ldisc+0x19b/0x1d0
[173192.692003] [<ffffffff813281c3>] tty_flush_to_ldisc+0x23/0x40
[173192.692003] [<ffffffff81322a11>] n_tty_poll+0x81/0x1c0
[173192.692003] [<ffffffff8131d822>] tty_poll+0xa2/0xc0
[173192.692003] [<ffffffff81160395>] do_select+0x3e5/0x740
[173192.692003] [<ffffffff81160aaa>] core_sys_select+0x22a/0x3e0
[173192.692003] [<ffffffff81160f11>] sys_select+0x51/0x130
[173192.692003] [<ffffffff8100c682>] system_call_fastpath+0x16/0x1b
[173192.692003] [<00007fcfeb66c6c3>] 0x7fcfeb66c6c3
[173192.692003] Code: 25 28 00 00 00 48 89 45 d8 31 c0 e8 a8 42 23 00 41 81 bd
60 02 00 00 ff 0f 00 00 7f 31 49 63 95 58 02 00 00 49 8b 8d 50 02 00 00 <44> 88
24 11 41 8b 95 58 02 00 00 41 83 85 60 02 00 00 01 83 c2
[173192.692003] RIP [<ffffffff81321de3>] put_tty_queue+0x53/0xb0
[173192.692003] RSP <ffff880026c27728>
[173192.692003] CR2: 0000000000000002
[173193.795695] ---[ end trace ad9dabd8b20d69ef ]---
[173193.796006] note: telnet[23281] exited with preempt_count 1
[173193.873177] BUG: scheduling while atomic: telnet/23281/0x10000002
[173193.912921] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf
ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt
iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801
pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq
async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan
processor pata_acpi piix ide_core thermal thermal_sys
[173194.059193] CPU 0:
[173194.097465] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf
ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt
iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801
pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq
async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan
processor pata_acpi piix ide_core thermal thermal_sys
[173194.254302] Pid: 23281, comm: telnet Tainted: G D
2.6.31.8-0.1-desktop #1 PowerEdge 850
[173194.304201] RIP: 0010:[<ffffffff81321de3>] [<ffffffff81321de3>]
put_tty_queue+0x53/0xb0
[173194.353912] RSP: 0018:ffff880026c27728 EFLAGS: 00010097
[173194.400719] RAX: 0000000000000286 RBX: ffff88003dcabd1c RCX:
0000000000000000
[173194.449671] RDX: 0000000000000002 RSI: 0000000000000286 RDI:
ffff88003dcabd1c
[173194.498564] RBP: ffff880026c27758 R08: ffff880026c27de0 R09:
0000000000000000
[173194.547518] R10: 00000000ffffffff R11: 0000000000000001 R12:
0000000000000074
[173194.596420] R13: ffff88003dcab800 R14: ffff880037c3646c R15:
0000000000000002
[173194.645243] FS: 0000000000000000(0000) GS:ffff880001c80000(0000)
knlGS:0000000000000000
[173194.695348] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[173194.742611] CR2: 0000000000000002 CR3: 0000000001001000 CR4:
00000000000006f0
[173194.791232] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[173194.839569] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[173194.887835] Call Trace:
[173194.930826] [<ffffffff813248ac>] n_tty_receive_char+0x1dc/0x7f0
[173194.977619] [<ffffffff8132509c>] n_tty_receive_buf+0x1dc/0x490
[173195.023312] [<ffffffff813280cb>] flush_to_ldisc+0x19b/0x1d0
[173195.067614] [<ffffffff813281c3>] tty_flush_to_ldisc+0x23/0x40
[173195.111021] [<ffffffff81322a11>] n_tty_poll+0x81/0x1c0
[173195.153233] [<ffffffff8131d822>] tty_poll+0xa2/0xc0
[173195.194734] [<ffffffff81160395>] do_select+0x3e5/0x740
[173195.236191] [<ffffffff81160aaa>] core_sys_select+0x22a/0x3e0
[173195.278023] [<ffffffff81160f11>] sys_select+0x51/0x130
[173195.319248] [<ffffffff8100c682>] system_call_fastpath+0x16/0x1b
[173195.361252] [<00007fcfeb66c6c3>] 0x7fcfeb66c6c3
===

then, after about a minute
===
[173260.055001] BUG: soft lockup - CPU#0 stuck for 61s! [telnet:23281]
[173260.055001] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf
ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt
iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801
pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq
async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan
processor pata_acpi piix ide_core thermal thermal_sys
[173260.055001] CPU 0:
[173260.055001] Modules linked in: dell_rbu ftdi_sio usbserial ipmi_devintf
ipmi_si ipmi_msghandler edd af_packet microcode fuse loop iTCO_wdt
iTCO_vendor_support shpchp sr_mod i3000_edac pcspkr dcdbas tg3 cdrom i2c_i801
pci_hotplug sg edac_core button ext4 jbd2 crc16 linear raid456 raid6_pq
async_xor async_memcpy async_tx xor raid1 raid0 dm_snapshot dm_mod fan
processor pata_acpi piix ide_core thermal thermal_sys
[173260.055001] Pid: 23281, comm: telnet Tainted: G D
2.6.31.8-0.1-desktop #1 PowerEdge 850
[173260.055001] RIP: 0010:[<ffffffff8103a67c>] [<ffffffff8103a67c>]
__ticket_spin_lock+0x2c/0x50
[173260.055001] RSP: 0018:ffff880026c27398 EFLAGS: 00000297
[173260.055001] RAX: 0000000000000b03 RBX: ffff880026c273a8 RCX:
00000000000002da
[173260.055001] RDX: 0000000000000b02 RSI: 0000000000000286 RDI:
ffffffff818b3880
[173260.055001] RBP: ffffffff8100d27e R08: 00000000ffffffff R09:
0000000000000000
[173260.055001] R10: 00000000ffffffff R11: 0000000000000001 R12:
ffff8800218e2740
[173260.055001] R13: ffff88003c546040 R14: 0000000000013a00 R15:
ffff8800218e2b00
[173260.055001] FS: 00007fcfebd4b6f0(0000) GS:ffff880001c80000(0000)
knlGS:0000000000000000
[173260.055001] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[173260.055001] CR2: 00007f0d65226000 CR3: 000000003e8b3000 CR4:
00000000000006f0
[173260.055001] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[173260.055001] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[173260.055001] Call Trace:
[173260.055001] Inexact backtrace:
[173260.055001]
[173260.055001] [<ffffffff815569e0>] ? lock_kernel+0xc0/0x152
[173260.055001] [<ffffffff8131ef34>] ? disassociate_ctty+0x44/0x270
[173260.055001] [<ffffffff8106f42a>] ? do_exit+0x1aa/0x3c0
[173260.055001] [<ffffffff8106ab81>] ? print_oops_end_marker+0x31/0x50
[173260.055001] [<ffffffff81557b83>] ? oops_end+0xd3/0x130
[173260.055001] [<ffffffff810448fb>] ? no_context+0x10b/0x1a0
[173260.055001] [<ffffffff81044b0d>] ? __bad_area_nosemaphore+0x17d/0x230
[173260.055001] [<ffffffff81044be1>] ? bad_area_nosemaphore+0x21/0x40
[173260.055001] [<ffffffff81559daa>] ? do_page_fault+0x3aa/0x470
[173260.055001] [<ffffffff81556bf5>] ? page_fault+0x25/0x30
[173260.055001] [<ffffffff81321de3>] ? put_tty_queue+0x53/0xb0
[173260.055001] [<ffffffff81321dc8>] ? put_tty_queue+0x38/0xb0
[173260.055001] [<ffffffff813248ac>] ? n_tty_receive_char+0x1dc/0x7f0
[173260.055001] [<ffffffff8132509c>] ? n_tty_receive_buf+0x1dc/0x490
[173260.055001] [<ffffffff815564d6>] ? _spin_unlock_irq+0x26/0x90
[173260.055001] [<ffffffff8105703a>] ? finish_task_switch+0x13a/0x170
[173260.055001] [<ffffffff8103a7d6>] ? default_spin_lock_flags+0x26/0x50
[173260.055001] [<ffffffff8103a7d6>] ? default_spin_lock_flags+0x26/0x50
[173260.055001] [<ffffffff813280cb>] ? flush_to_ldisc+0x19b/0x1d0
[173260.055001] [<ffffffff813281c3>] ? tty_flush_to_ldisc+0x23/0x40
[173260.055001] [<ffffffff81322a11>] ? n_tty_poll+0x81/0x1c0
[173260.055001] [<ffffffff8131d822>] ? tty_poll+0xa2/0xc0
[173260.055001] [<ffffffff81160395>] ? do_select+0x3e5/0x740
[173260.055001] [<ffffffff811606f0>] ? __pollwait+0x0/0x110
[173260.055001] [<ffffffff81160800>] ? pollwake+0x0/0x80
[173260.055001] [<ffffffff81160800>] ? pollwake+0x0/0x80
[173260.055001] [<ffffffff81160800>] ? pollwake+0x0/0x80
[173260.055001] [<ffffffff81063cfe>] ? try_to_wake_up+0xce/0x350
[173260.055001] [<ffffffff81063fa0>] ? default_wake_function+0x20/0x40
[173260.055001] [<ffffffff8104fc89>] ? __wake_up_common+0x69/0xb0
[173260.055001] [<ffffffff81054c5d>] ? __wake_up+0x5d/0x90
[173260.055001] [<ffffffff81160aaa>] ? core_sys_select+0x22a/0x3e0
[173260.055001] [<ffffffff813272fc>] ? tty_ldisc_deref+0x1c/0x40
[173260.055001] [<ffffffff8131f678>] ? tty_write+0x248/0x2b0
[173260.055001] [<ffffffff81160f11>] ? sys_select+0x51/0x130
[173260.055001] [<ffffffff8114c82b>] ? sys_write+0x5b/0xa0
[173260.055001] [<ffffffff8100c682>] ? system_call_fastpath+0x16/0x1b
===

and this on and on every 65-67 seconds.

I changed RAM, I tried kernel flavours pae and default, initially with i586.

I used the second 850 for a fresh install x86_64, but otherwise a replica of
the original one, (this is where the actual OOPSes are captured from), the same
OOPSes.

As the primary machine is our syslog server, I tried 2.6.33rc5 from KOTD, this
one is stable for 6 days now.

Do I need to supply more information?


Reproducible: Always

Steps to Reproduce:
1. install 11.2 on PE 850 w/ actual update kernel
2. run and wait.

--
Configure bugmail: http://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

< Previous Next >