https://bugzilla.novell.com/show_bug.cgi?id=864430
https://bugzilla.novell.com/show_bug.cgi?id=864430#c17
Achim Mildenberger changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |admin@fph.physik.uni-karlsr
| |uhe.de
--- Comment #17 from Achim Mildenberger 2014-06-11 10:00:36 UTC ---
I have rather similar looking problem, I hope it is related.
Since updating a pool of 34 machines to openSuSE 13.1 (now running kernel
3.11.10-11-default), I am observing machines which get stuck with
BUG: soft lockup - CPU#3 stuck for 22s! [mozStorage #5:4530]
I couldn't observe any specific pattern when this occurs, it seems more or
less random. Ok, there is alway desktop-usage and firefox (mozStorage)
runnning.
Frequency is less than once per month per client, depending on the usage
of the 34 clients.
In total I observed about 19 of these hanging states.
NFSv4 (4.1) is used for the home-directories, the server running openSuSE 12.3.
If a machine encounters the "soft lockup"-state, it is kind of unresponsively
hanging. The error message repeats about every 30 seconds. ssh access is still
possible, some (simple) commands still work, but e.g. lsof hangs. also reboot
(with sync) hangs.
I'm attaching a typical entry of /var/log/messages.
---------------------------------------------------------------------------
2014-06-10T15:58:50.075919+02:00 fphct10 kernel: [367916.088007] BUG: soft
lockup - CPU#1 stuck for 22s! [mozStorage #4:12030]
2014-06-10T15:58:50.075943+02:00 fphct10 kernel: [367916.088007] Modules linked
in: nls_iso8859_1 nls_cp437 vfat fat usb_storage nfsv3 nfs_acl fuse bnep
bluetooth rfkill rpcsec_gss_krb5 auth_rpcgss oid_registry nfsv4 nfs fscache
lockd sunrpc sch5636 sch56xx_common snd_hda_codec_hdmi snd_hda_codec_conexant
iTCO_wdt iTCO_vendor_support snd_hda_intel snd_hda_codec snd_hwdep snd_pcm
ppdev snd_timer snd joydev x86_pkg_temp_thermal intel_powerclamp coretemp
kvm_intel kvm crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel
ablk_helper cryptd lrw gf128mul glue_helper aes_x86_64 serio_raw pcspkr
i2c_i801 sr_mod cdrom lpc_ich mfd_core soundcore e1000e snd_page_alloc ptp
pps_core mei_me mei mperf parport_pc parport sg dm_mod autofs4 hid_generic
usbhid ehci_pci ehci_hcd i915 xhci_hcd drm_kms_helper usbcore usb_common drm
i2c_algo_bit fan thermal video button processor thermal_sys scsi_dh_rdac
scsi_dh_emc scsi_dh_alua scsi_dh_hp_sw scsi_dh reiserfs
2014-06-10T15:58:50.075944+02:00 fphct10 kernel: [367916.088007] CPU: 1 PID:
12030 Comm: mozStorage #4 Not tainted 3.11.10-11-default #1
2014-06-10T15:58:50.075946+02:00 fphct10 kernel: [367916.088007] Hardware name:
FUJITSU ESPRIMO P910/D3162-A1, BIOS V4.6.5.3 R1.20.0 for D3162-A1x 07/17/2013
2014-06-10T15:58:50.075947+02:00 fphct10 kernel: [367916.088007] task:
ffff88005fbf4440 ti: ffff88005fbf6000 task.ti: ffff88005fbf6000
2014-06-10T15:58:50.075949+02:00 fphct10 kernel: [367916.088007] RIP:
0010:[<ffffffff810703cd>] [<ffffffff810703cd>] prepare_to_wait+0x4d/0x80
2014-06-10T15:58:50.075950+02:00 fphct10 kernel: [367916.088007] RSP:
0018:ffff88005fbf7dc8 EFLAGS: 00000246
2014-06-10T15:58:50.075951+02:00 fphct10 kernel: [367916.088007] RAX:
0000000000000246 RBX: 0000000000000000 RCX: ffff88005fbf4440
2014-06-10T15:58:50.075952+02:00 fphct10 kernel: [367916.088007] RDX:
0000000000000082 RSI: 0000000000000246 RDI: 0000000000000246
2014-06-10T15:58:50.075953+02:00 fphct10 kernel: [367916.088007] RBP:
ffff88011e5c3e98 R08: ffff88005fbf6000 R09: 00014e980dd19b8e
2014-06-10T15:58:50.075954+02:00 fphct10 kernel: [367916.088007] R10:
0000000000000000 R11: 0000000000000006 R12: ffff88005fbf7d70
2014-06-10T15:58:50.075955+02:00 fphct10 kernel: [367916.088007] R13:
ffff88011e5c3e98 R14: 0000000000000000 R15: ffffffff81563e38
2014-06-10T15:58:50.075956+02:00 fphct10 kernel: [367916.088007] FS:
00007fa34f1ff700(0000) GS:ffff88011e280000(0000) knlGS:0000000000000000
2014-06-10T15:58:50.075957+02:00 fphct10 kernel: [367916.088007] CS: 0010 DS:
0000 ES: 0000 CR0: 0000000080050033
2014-06-10T15:58:50.075958+02:00 fphct10 kernel: [367916.088007] CR2:
00000000008d19f0 CR3: 0000000099006000 CR4: 00000000001407e0
2014-06-10T15:58:50.075959+02:00 fphct10 kernel: [367916.088007] Stack:
2014-06-10T15:58:50.075960+02:00 fphct10 kernel: [367916.088007]
0000000000000082 ffff8800ca48a9b0 ffff88005fbf7e50 ffff88011e5c3e98
2014-06-10T15:58:50.075962+02:00 fphct10 kernel: [367916.088007]
ffffffffa0566b92 ffff8800ca48a9b0 0000000000000000 0000000000000000
2014-06-10T15:58:50.075963+02:00 fphct10 kernel: [367916.088007]
ffff88005fbf4440 ffffffff81070660 ffff88011e5c3ea0 ffff88011e5c3ea0
2014-06-10T15:58:50.075963+02:00 fphct10 kernel: [367916.088007] Call Trace:
2014-06-10T15:58:50.075964+02:00 fphct10 kernel: [367916.088007]
[<ffffffffa0566b92>] nfs_iocounter_wait+0xa2/0xd0 [nfs]
2014-06-10T15:58:50.075965+02:00 fphct10 kernel: [367916.088007]
[<ffffffffa055d2c9>] do_unlk+0x49/0xd0 [nfs]
2014-06-10T15:58:50.075966+02:00 fphct10 kernel: [367916.088007]
[<ffffffff811c5276>] fcntl_setlk+0x126/0x2c0
2014-06-10T15:58:50.075967+02:00 fphct10 kernel: [367916.088007]
[<ffffffff8118bb41>] SyS_fcntl+0x261/0x510
2014-06-10T15:58:50.075968+02:00 fphct10 kernel: [367916.088007]
[<ffffffff815655ad>] system_call_fastpath+0x1a/0x1f
2014-06-10T15:58:50.075969+02:00 fphct10 kernel: [367916.088007]
[<00007fa3915edc22>] 0x7fa3915edc21
2014-06-10T15:58:50.075970+02:00 fphct10 kernel: [367916.088007] Code: 39 d1 74
3a 4c 89 24 24 48 8b 14 24 65 48 8b 0c 25 40 b9 00 00 48 87 11 48 89 ef 48 89
c6 48 89 14 24 48 8b 14 24 e8 83 d3 4e 00 <48> 83 c4 08 5b 5d 41 5c c3 66 2e 0f
1f 84 00 00 00 00 00 48 8b
--
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.