[Bug 951956] New: kernel soft-lockup in leap 42.1 RC1 kernel
http://bugzilla.opensuse.org/show_bug.cgi?id=951956 Bug ID: 951956 Summary: kernel soft-lockup in leap 42.1 RC1 kernel Classification: openSUSE Product: openSUSE Distribution Version: Leap 42.1 RC1 1 Hardware: x86-64 OS: openSUSE 42.1 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel Assignee: kernel-maintainers@forge.provo.novell.com Reporter: rudamir@gmail.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:41.0) Gecko/20100101 Firefox/41.0 Build Identifier: Kernel is frozen unexpectedly. It happend several times after upgrade from Leap 42.1 Beta to RC1. This is last occurrence, when I was able to get error log kernel:[172780.000001] NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [swapper/0:0] [Sat Oct 24 14:54:46 2015] NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [swapper/0:0] [Sat Oct 24 14:54:46 2015] Modules linked in: bnep bluetooth rfkill fuse vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) iscsi_ibft iscsi_boot_sysfs af_packet snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel dm_mod snd_hda_controller ppdev iTCO_wdt iTCO_vendor_support r8169 snd_hda_codec snd_hda_core snd_hwdep mii gpio_ich acpi_cpufreq snd_pcm 8250_fintek joydev snd_timer parport_pc i2c_i801 snd parport serio_raw lpc_ich mfd_core pcspkr processor soundcore shpchp coretemp button kvm_intel kvm xfs libcrc32c hid_generic usbhid raid0 md_mod ata_generic uas usb_storage ata_piix ehci_pci pata_jmicron uhci_hcd ehci_hcd usbcore usb_common radeon i2c_algo_bit drm_kms_helper ttm drm sg [Sat Oct 24 14:54:46 2015] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G O 4.1.10-1-default #1 [Sat Oct 24 14:54:46 2015] Hardware name: ATComputers OFFICEPRO 1000/P43T-ES3G, BIOS F7 ZA 10/12/2010 [Sat Oct 24 14:54:46 2015] task: ffffffff81e15480 ti: ffffffff81e00000 task.ti: ffffffff81e00000 [Sat Oct 24 14:54:46 2015] RIP: 0010:[<ffffffff8165f2ae>] [<ffffffff8165f2ae>] _raw_spin_unlock_irqrestore+0xe/0x30 [Sat Oct 24 14:54:46 2015] RSP: 0018:ffff88011fc03de0 EFLAGS: 00000282 [Sat Oct 24 14:54:46 2015] RAX: ffffffff82144940 RBX: 0000000000006a1d RCX: 0000000000006a87 [Sat Oct 24 14:54:46 2015] RDX: 000000006a876a87 RSI: 0000000000000282 RDI: 0000000000000282 [Sat Oct 24 14:54:46 2015] RBP: ffffffff82144940 R08: ffffffff818120a0 R09: ffffffff82144940 [Sat Oct 24 14:54:46 2015] R10: 00000000ffffffff R11: 0000000000000005 R12: ffff88011fc03d58 [Sat Oct 24 14:54:46 2015] R13: ffffffff8166099e R14: 00000000ffffffff R15: ffffffff81662719 [Sat Oct 24 14:54:46 2015] FS: 0000000000000000(0000) GS:ffff88011fc00000(0000) knlGS:0000000000000000 [Sat Oct 24 14:54:46 2015] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [Sat Oct 24 14:54:46 2015] CR2: 00007f715bac4014 CR3: 00000000bd2aa000 CR4: 00000000000406f0 [Sat Oct 24 14:54:46 2015] Stack: [Sat Oct 24 14:54:46 2015] ffffffff810d36bc ffffffff810d3712 0000000000000282 0000000000000286 [Sat Oct 24 14:54:46 2015] ffff88003815ff28 ffff880036ec0040 ffffffff810d371a ffff88003815fe90 [Sat Oct 24 14:54:46 2015] ffffffff815a7fe8 0000000000011a10 0000000000000005 ffff88003815fe90 [Sat Oct 24 14:54:46 2015] Call Trace: [Sat Oct 24 14:54:46 2015] [<ffffffff810d36bc>] try_to_del_timer_sync+0x4c/0x60 [Sat Oct 24 14:54:46 2015] [<ffffffff810d371a>] del_timer_sync+0x4a/0x60 [Sat Oct 24 14:54:46 2015] [<ffffffff815a7fe8>] inet_csk_reqsk_queue_drop+0x78/0x1e0 [Sat Oct 24 14:54:46 2015] [<ffffffff815a83b8>] reqsk_timer_handler+0x268/0x2d0 [Sat Oct 24 14:54:46 2015] [<ffffffff810d1bc0>] call_timer_fn+0x30/0x140 [Sat Oct 24 14:54:46 2015] [<ffffffff810d200b>] run_timer_softirq+0x24b/0x300 [Sat Oct 24 14:54:46 2015] [<ffffffff8106cd10>] __do_softirq+0xe0/0x2b0 [Sat Oct 24 14:54:46 2015] [<ffffffff8106d125>] irq_exit+0x95/0xa0 [Sat Oct 24 14:54:46 2015] [<ffffffff8166271e>] smp_apic_timer_interrupt+0x3e/0x50 [Sat Oct 24 14:54:46 2015] [<ffffffff8166099e>] apic_timer_interrupt+0x6e/0x80 [Sat Oct 24 14:54:46 2015] [<ffffffff8100dc3f>] mwait_idle+0xaf/0x1a0 [Sat Oct 24 14:54:46 2015] [<ffffffff810ab17c>] cpu_startup_entry+0x34c/0x420 [Sat Oct 24 14:54:46 2015] [<ffffffff81f2a083>] start_kernel+0x4a3/0x4ae [Sat Oct 24 14:54:46 2015] [<ffffffff81f296f9>] x86_64_start_kernel+0x149/0x158 [Sat Oct 24 14:54:46 2015] Code: 66 90 66 83 07 01 65 ff 0d 90 b7 9a 7e 74 06 c3 0f 1f 44 00 00 e8 f4 7b d0 ff c3 66 90 66 66 66 66 90 66 83 07 01 48 89 f7 57 9d <66> 66 90 66 90 65 ff 0d 66 b7 9a 7e 74 04 c3 0f 1f 00 e8 cc 7b [Sat Oct 24 14:55:26 2015] NMI watchdog: BUG: soft lockup - CPU#0 stuck for 37s! [swapper/0:0] [Sat Oct 24 14:55:26 2015] Modules linked in: bnep bluetooth rfkill fuse vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) iscsi_ibft iscsi_boot_sysfs af_packet snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel dm_mod snd_hda_controller ppdev iTCO_wdt iTCO_vendor_support r8169 snd_hda_codec snd_hda_core snd_hwdep mii gpio_ich acpi_cpufreq snd_pcm 8250_fintek joydev snd_timer parport_pc i2c_i801 snd parport serio_raw lpc_ich mfd_core pcspkr processor soundcore shpchp coretemp button kvm_intel kvm xfs libcrc32c hid_generic usbhid raid0 md_mod ata_generic uas usb_storage ata_piix ehci_pci pata_jmicron uhci_hcd ehci_hcd usbcore usb_common radeon i2c_algo_bit drm_kms_helper ttm drm sg [Sat Oct 24 14:55:26 2015] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G O L 4.1.10-1-default #1 [Sat Oct 24 14:55:26 2015] Hardware name: ATComputers OFFICEPRO 1000/P43T-ES3G, BIOS F7 ZA 10/12/2010 [Sat Oct 24 14:55:26 2015] task: ffffffff81e15480 ti: ffffffff81e00000 task.ti: ffffffff81e00000 [Sat Oct 24 14:55:26 2015] RIP: 0010:[<ffffffff810d20d1>] [<ffffffff810d20d1>] lock_timer_base.isra.36+0x11/0x60 [Sat Oct 24 14:55:26 2015] RSP: 0018:ffff88011fc03dc0 EFLAGS: 00000296 [Sat Oct 24 14:55:26 2015] RAX: 00000000ffffffff RBX: ffffffff818120a0 RCX: 000000000000f8d5 [Sat Oct 24 14:55:26 2015] RDX: 00000000f8d5f8d5 RSI: ffff88011fc03df0 RDI: ffff88003815ff40 [Sat Oct 24 14:55:26 2015] RBP: ffff880036ec0040 R08: ffffffff818120a0 R09: ffffffff82144940 [Sat Oct 24 14:55:26 2015] R10: 00000000ffffffff R11: 0000000000000005 R12: ffff88011fc03d38 [Sat Oct 24 14:55:26 2015] R13: ffffffff8166099e R14: ffff88003815ff28 R15: ffffffff81662719 [Sat Oct 24 14:55:26 2015] FS: 0000000000000000(0000) GS:ffff88011fc00000(0000) knlGS:0000000000000000 [Sat Oct 24 14:55:26 2015] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [Sat Oct 24 14:55:26 2015] CR2: 00007f715bac4014 CR3: 00000000bd2aa000 CR4: 00000000000406f0 [Sat Oct 24 14:55:26 2015] Stack: [Sat Oct 24 14:55:26 2015] ffff88003815ff28 ffff880036ec0040 ffff880036ec0400 0000000000000001 [Sat Oct 24 14:55:26 2015] ffffffff810d368c ffffffff810d3712 0000000000000282 0000000000000286 [Sat Oct 24 14:55:26 2015] ffff88003815ff28 ffff880036ec0040 ffffffff810d371a ffff88003815fe90 [Sat Oct 24 14:55:26 2015] Call Trace: [Sat Oct 24 14:55:26 2015] [<ffffffff810d368c>] try_to_del_timer_sync+0x1c/0x60 [Sat Oct 24 14:55:26 2015] [<ffffffff810d371a>] del_timer_sync+0x4a/0x60 [Sat Oct 24 14:55:26 2015] [<ffffffff815a7fe8>] inet_csk_reqsk_queue_drop+0x78/0x1e0 [Sat Oct 24 14:55:26 2015] [<ffffffff815a83b8>] reqsk_timer_handler+0x268/0x2d0 [Sat Oct 24 14:55:26 2015] [<ffffffff810d1bc0>] call_timer_fn+0x30/0x140 [Sat Oct 24 14:55:26 2015] [<ffffffff810d200b>] run_timer_softirq+0x24b/0x300 [Sat Oct 24 14:55:26 2015] [<ffffffff8106cd10>] __do_softirq+0xe0/0x2b0 [Sat Oct 24 14:55:26 2015] [<ffffffff8106d125>] irq_exit+0x95/0xa0 [Sat Oct 24 14:55:26 2015] [<ffffffff8166271e>] smp_apic_timer_interrupt+0x3e/0x50 [Sat Oct 24 14:55:26 2015] [<ffffffff8166099e>] apic_timer_interrupt+0x6e/0x80 [Sat Oct 24 14:55:26 2015] [<ffffffff8100dc3f>] mwait_idle+0xaf/0x1a0 [Sat Oct 24 14:55:26 2015] [<ffffffff810ab17c>] cpu_startup_entry+0x34c/0x420 [Sat Oct 24 14:55:26 2015] [<ffffffff81f2a083>] start_kernel+0x4a3/0x4ae [Sat Oct 24 14:55:26 2015] [<ffffffff81f296f9>] x86_64_start_kernel+0x149/0x158 [Sat Oct 24 14:55:26 2015] Code: 4c 89 e9 4c 89 e7 e8 df 24 01 00 e9 31 fe ff ff 66 2e 0f 1f 84 00 00 00 00 00 66 66 66 66 90 41 55 49 89 f5 41 54 49 89 fc 55 53 <48> 83 ec 08 49 8b 1c 24 48 89 dd 48 83 e5 fc 74 2b 48 89 ef e8 Reproducible: Sometimes Steps to Reproduce: It happened three times during last week, in all cases when desktop was idle. Last case happened on Saturday, few minutes after one day delay (user left computer Fri Oct 23 14:38:13 2015, lockup happened Sat Oct 24 14:54:46 2015). Desktop workload is standard gnome, with firefox, thunderbird, chrome running. Kernel is from rpm kernel-default-4.1.10-1.1.x86_64 . -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=951956
http://bugzilla.opensuse.org/show_bug.cgi?id=951956#c1
Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=951956
http://bugzilla.opensuse.org/show_bug.cgi?id=951956#c2
--- Comment #2 from Takashi Iwai
Looks like somewhere in a net stack deadlocking.
Could you check whether the same problem still happens with the kernel package in OBS Kernel:openSUSE-42.1 repo?
Also, if yes, please try kernel-debug package. The lockdep might be able to catch something. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=951956
http://bugzilla.opensuse.org/show_bug.cgi?id=951956#c3
--- Comment #3 from Miroslav Ruda
Looks like somewhere in a net stack deadlocking.
Could you check whether the same problem still happens with the kernel package in OBS Kernel:openSUSE-42.1 repo?
Do you mean kernel-debug-4.1.11-1.1.g99c44ff.x86_64.rpm from http://download.opensuse.org/repositories/Kernel:/openSUSE-42.1/standard/x86... (repo file http://download.opensuse.org/repositories/Kernel:/openSUSE-42.1/standard/Ker... I can install it. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=951956
http://bugzilla.opensuse.org/show_bug.cgi?id=951956#c4
--- Comment #4 from Takashi Iwai
(In reply to Takashi Iwai from comment #1)
Looks like somewhere in a net stack deadlocking.
Could you check whether the same problem still happens with the kernel package in OBS Kernel:openSUSE-42.1 repo?
Do you mean kernel-debug-4.1.11-1.1.g99c44ff.x86_64.rpm from http://download.opensuse.org/repositories/Kernel:/openSUSE-42.1/standard/ x86_64/ (repo file http://download.opensuse.org/repositories/Kernel:/openSUSE-42.1/standard/ Kernel:openSUSE-42.1.repo)?
Yes. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=951956
http://bugzilla.opensuse.org/show_bug.cgi?id=951956#c5
--- Comment #5 from Miroslav Ruda
Do you mean kernel-debug-4.1.11-1.1.g99c44ff.x86_64.rpm ?
Yes.
OK, machine is running kernel 4.1.11-1.g99c44ff-debug. I will sent more info when lockup appears. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=951956
Ludwig Nussel
http://bugzilla.opensuse.org/show_bug.cgi?id=951956
http://bugzilla.opensuse.org/show_bug.cgi?id=951956#c6
--- Comment #6 from Miroslav Ruda
http://bugzilla.opensuse.org/show_bug.cgi?id=951956
http://bugzilla.opensuse.org/show_bug.cgi?id=951956#c7
--- Comment #7 from Takashi Iwai
participants (1)
-
bugzilla_noreply@novell.com