[Bug 893428] New: cpu soft lockup in ext4_es_lru_del
https://bugzilla.novell.com/show_bug.cgi?id=893428 https://bugzilla.novell.com/show_bug.cgi?id=893428#c0 Summary: cpu soft lockup in ext4_es_lru_del Classification: openSUSE Product: openSUSE 13.1 Version: Final Platform: x86-64 OS/Version: openSUSE 13.1 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: russellx.j.miller@intel.com QAContact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1985.143 Safari/537.36 Saw a bunch of these - enough that it made a 48 core system completely unresponsive to anything but keystrokes on the console. This seems to be the same bug as I found on the LKML: https://lkml.org/lkml/2014/5/13/440 stack trace as follows: 2014-08-25T08:49:31.356797-07:00 cov kernel: [322676.159200] BUG: soft lockup - CPU#22 stuck for 22s! [cc1plus:33182] 2014-08-25T08:49:31.356798-07:00 cov kernel: [322676.159720] Modules linked in: veth xt_REDIRECT xt_tcpudp binfmt_misc xt_addrtype xt_conntrack iptable_filter ipt_MASQUERADE iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack ip_tables x_tables bridge stp llc dm_thin_pool dm_bio_prison dm_persistent_data dm_bufio libcrc32c loop bonding x86_pkg_temp_thermal coretemp kvm_intel iTCO_wdt kvm joydev crc32_pclmul crc32c_intel iTCO_vendor_support gpio_ich tg3 ghash_clmulni_intel aesni_intel libphy igb ablk_helper cryptd lpc_ich lrw ptp gf128mul ioatdma glue_helper mei_me hid_generic sr_mod dcdbas usb_storage pcspkr usbhid dca cdrom mei mfd_core pps_core aes_x86_64 shpchp wmi acpi_pad acpi_power_meter button sg mperf ipmi_devintf ipmi_si ipmi_msghandler dm_mod autofs4 mgag200 ttm drm_kms_helper ehci_pci drm ehci_hcd i2c_algo_bit sysimgblt sysfillrect usbcore syscopyarea usb_common megaraid_sas processor thermal_sys scsi_dh_hp_sw scsi_dh_emc scsi_dh_alua scsi_dh_rdac scsi_dh 2014-08-25T08:49:31.356798-07:00 cov kernel: [322676.159766] CPU: 22 PID: 33182 Comm: cc1plus Not tainted 3.11.10-17-default #1 2014-08-25T08:49:31.356800-07:00 cov kernel: [322676.159768] Hardware name: Dell Inc. PowerEdge R720/061P35, BIOS 2.2.3 05/20/2014 2014-08-25T08:49:31.356801-07:00 cov kernel: [322676.159769] task: ffff880009a7c180 ti: ffff88000ca8a000 task.ti: ffff88000ca8a000 2014-08-25T08:49:31.356801-07:00 cov kernel: [322676.159770] RIP: 0010:[<ffffffff8155daea>] [<ffffffff8155daea>] _raw_spin_lock+0x1a/0x30 2014-08-25T08:49:31.356801-07:00 cov kernel: [322676.159770] RIP: 0010:[<ffffffff8155daea>] [<ffffffff8155daea>] _raw_spin_lock+0x1a/0x30 2014-08-25T08:49:31.356802-07:00 cov kernel: [322676.159775] RSP: 0000:ffff88000ca8b970 EFLAGS: 00000283 2014-08-25T08:49:31.356802-07:00 cov kernel: [322676.159776] RAX: 00000000000096fd RBX: ffff880624dcab08 RCX: 0000000000009709 2014-08-25T08:49:31.356802-07:00 cov kernel: [322676.159777] RDX: 0000000000009709 RSI: ffff880037088290 RDI: ffff880801301480 2014-08-25T08:49:31.356803-07:00 cov kernel: [322676.159778] RBP: ffff880801301000 R08: 0000000000000000 R09: 0000000000000000 2014-08-25T08:49:31.356805-07:00 cov kernel: [322676.159779] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88080561cc00 2014-08-25T08:49:31.356805-07:00 cov kernel: [322676.159779] R13: ffff880037088290 R14: 0000000000000000 R15: 0000000000000000 2014-08-25T08:49:31.356805-07:00 cov kernel: [322676.159781] FS: 00002b9b0d4cb200(0000) GS:ffff88082fb60000(0000) knlGS:0000000000000000 2014-08-25T08:49:31.356806-07:00 cov kernel: [322676.159782] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 2014-08-25T08:49:31.356806-07:00 cov kernel: [322676.159783] CR2: 00002b9b1dcef5a0 CR3: 0000000d9c0ad000 CR4: 00000000001407e0 2014-08-25T08:49:31.356807-07:00 cov kernel: [322676.159784] Stack: 2014-08-25T08:49:31.356807-07:00 cov kernel: [322676.159784] ffffffff8122c83c ffff880624dca8a0 ffff880624dca9a8 ffffffff8120e498 2014-08-25T08:49:31.356809-07:00 cov kernel: [322676.159788] ffff880624dca8a0 ffffffff81193da3 ffff88000ca8b9e0 ffff880dbc0ed530 2014-08-25T08:49:31.356810-07:00 cov kernel: [322676.159791] ffff88080561cd08 ffffffff81193ec1 ffff880dbc0ed5a0 ffffffff81194ccc 2014-08-25T08:49:31.356810-07:00 cov kernel: [322676.159794] Call Trace: 2014-08-25T08:49:31.356810-07:00 cov kernel: [322676.159802] [<ffffffff8122c83c>] ext4_es_lru_del+0x1c/0x60 2014-08-25T08:49:31.356811-07:00 cov kernel: [322676.159807] [<ffffffff8120e498>] ext4_clear_inode+0x38/0x80 2014-08-25T08:49:31.356811-07:00 cov kernel: [322676.159811] [<ffffffff81193da3>] evict+0xa3/0x190 2014-08-25T08:49:31.356812-07:00 cov kernel: [322676.159813] [<ffffffff81193ec1>] dispose_list+0x31/0x40 2014-08-25T08:49:31.356814-07:00 cov kernel: [322676.159816] [<ffffffff81194ccc>] prune_icache_sb+0x16c/0x310 2014-08-25T08:49:31.356814-07:00 cov kernel: [322676.159820] [<ffffffff8117ee9b>] prune_super+0x15b/0x190 2014-08-25T08:49:31.356814-07:00 cov kernel: [322676.159825] [<ffffffff81128953>] shrink_slab+0x153/0x2d0 2014-08-25T08:49:31.356815-07:00 cov kernel: [322676.159828] [<ffffffff8112b6ef>] do_try_to_free_pages+0x39f/0x4c0 2014-08-25T08:49:31.356815-07:00 cov kernel: [322676.159831] [<ffffffff8112b8e8>] try_to_free_pages+0xd8/0x160 2014-08-25T08:49:31.356816-07:00 cov kernel: [322676.159836] [<ffffffff81121959>] __alloc_pages_nodemask+0x5c9/0x980 2014-08-25T08:49:31.356818-07:00 cov kernel: [322676.159840] [<ffffffff8115cc85>] alloc_pages_vma+0x95/0x140 2014-08-25T08:49:31.356818-07:00 cov kernel: [322676.159844] [<ffffffff8113d58a>] do_wp_page+0x43a/0x810 2014-08-25T08:49:31.356818-07:00 cov kernel: [322676.159849] [<ffffffff8113e86b>] handle_pte_fault+0x29b/0xa60 2014-08-25T08:49:31.356819-07:00 cov kernel: [322676.159853] [<ffffffff815612f4>] __do_page_fault+0x124/0x4d0 2014-08-25T08:49:31.356819-07:00 cov kernel: [322676.159856] [<ffffffff8155e048>] page_fault+0x28/0x30 2014-08-25T08:49:31.356820-07:00 cov kernel: [322676.159860] [<0000000000d81d60>] 0xd81d5f 2014-08-25T08:49:31.356820-07:00 cov kernel: [322676.159861] Code: ec b8 01 00 00 00 c3 66 2e 0f 1f 84 00 00 00 00 00 b8 00 00 01 00 f0 0f c1 07 89 c1 c1 e9 10 66 39 c1 89 ca 74 0d 0f 1f 00 f3 90 <0f> b7 07 66 39 d0 75 f6 c3 66 66 66 66 2e 0f 1f 84 00 00 00 00 This server runs Coverity, and is constantly creating and destroying docker instances. This seems to be exercising the system in a wholly unusual way - this is not the first kernel fault it exposed. I'll file a bug for the other one too. Reproducible: Sometimes Steps to Reproduce: 1. use the server as normal. 2. 3. Actual Results: CPU SOFT LOCKUP Expected Results: No CPU SOFT LOCKUP -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=893428
https://bugzilla.novell.com/show_bug.cgi?id=893428#c1
Takashi Iwai
https://bugzilla.novell.com/show_bug.cgi?id=893428
https://bugzilla.novell.com/show_bug.cgi?id=893428#c2
Jan Kara
https://bugzilla.novell.com/show_bug.cgi?id=893428
https://bugzilla.novell.com/show_bug.cgi?id=893428#c3
--- Comment #3 from Russell Miller
https://bugzilla.novell.com/show_bug.cgi?id=893428
https://bugzilla.novell.com/show_bug.cgi?id=893428#c4
--- Comment #4 from Takashi Iwai
https://bugzilla.novell.com/show_bug.cgi?id=893428
https://bugzilla.novell.com/show_bug.cgi?id=893428#c5
Jan Kara
https://bugzilla.novell.com/show_bug.cgi?id=893428
https://bugzilla.novell.com/show_bug.cgi?id=893428#c6
--- Comment #6 from Russell Miller
https://bugzilla.novell.com/show_bug.cgi?id=893428
https://bugzilla.novell.com/show_bug.cgi?id=893428#c7
--- Comment #7 from Russell Miller
https://bugzilla.novell.com/show_bug.cgi?id=893428
https://bugzilla.novell.com/show_bug.cgi?id=893428#c8
--- Comment #8 from Jan Kara
http://bugzilla.novell.com/show_bug.cgi?id=893428
Swamp Workflow Management
http://bugzilla.novell.com/show_bug.cgi?id=893428
--- Comment #14 from Swamp Workflow Management
http://bugzilla.novell.com/show_bug.cgi?id=893428
Swamp Workflow Management
http://bugzilla.novell.com/show_bug.cgi?id=893428
Swamp Workflow Management
http://bugzilla.novell.com/show_bug.cgi?id=893428
Swamp Workflow Management
http://bugzilla.novell.com/show_bug.cgi?id=893428
--- Comment #15 from Swamp Workflow Management
http://bugzilla.novell.com/show_bug.cgi?id=893428
Swamp Workflow Management
http://bugzilla.novell.com/show_bug.cgi?id=893428
Swamp Workflow Management
participants (1)
-
bugzilla_noreply@novell.com