[Bug 820028] New: kernel BUG at svc.c:559

https://bugzilla.novell.com/show_bug.cgi?id=820028 https://bugzilla.novell.com/show_bug.cgi?id=820028#c0 Summary: kernel BUG at svc.c:559 Classification: openSUSE Product: openSUSE 12.3 Version: Final Platform: x86-64 OS/Version: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: stephan.barth@suse.com QAContact: qa-bugs@suse.de Found By: --- Blocker: --- 2013-05-15T09:04:13.094662+02:00 linux-zied rsyslogd: -- MARK -- 2013-05-15T09:21:42.338027+02:00 linux-zied kernel: [1879445.000180] ------------[ cut here ]------------ 2013-05-15T09:21:42.338039+02:00 linux-zied kernel: [1879445.000665] kernel BUG at /home/abuild/rpmbuild/BUILD/kernel-desktop-3.7.10/linux-3.7/net/sunrpc/svc.c:559! This morning I typed in the password for unlocking the screen saver, tried to open a new terminal and then this happened: 2013-05-15T09:21:42.338039+02:00 linux-zied kernel: [1879445.001130] invalid opcode: 0000 [#1] PREEMPT SMP 2013-05-15T09:21:42.338040+02:00 linux-zied kernel: [1879445.001616] Modules linked in: snd_usb_audio uvcvideo videobuf2_core videodev snd_usbmidi_lib videobuf2_vmalloc videobuf2_memops snd_rawmidi iptable_filter ip_tables x_tables ppdev parport_pc lp parport snd_seq_dummy snd_hrtimer joydev snd_mixer_oss nfsv3 nfs_acl nfsv4 auth_rpcgss nfs fscache lockd sunrpc bnep bluetooth rfkill fuse af_packet nls_iso8859_1 nls_cp437 vfat fat snd_hda_codec_hdmi snd_hda_codec_realtek iTCO_wdt iTCO_vendor_support acpi_cpufreq mperf coretemp crc32c_intel ghash_clmulni_intel aesni_intel ablk_helper cryptd lrw aes_x86_64 xts gf128mul kvm_intel kvm dcdbas snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_seq snd_timer snd_seq_device snd microcode pcspkr sb_edac edac_core i2c_i801 sr_mod cdrom lpc_ich mfd_core e1000e sg mei soundcore snd_page_alloc pciehp pci_hotplug autofs4 dm_mod linear nouveau ttm drm_kms_helper drm i2c_algo_bit mxm_wmi video wmi xhci_hcd button processor thermal_sys scsi_dh_alua scsi_dh_rdac scsi_dh_hp_sw scsi_dh_emc scsi_dh megaraid_sas 2013-05-15T09:21:42.338041+02:00 linux-zied kernel: [1879445.006307] CPU 2 2013-05-15T09:21:42.338041+02:00 linux-zied kernel: [1879445.006322] Pid: 23068, comm: mount.nfs Not tainted 3.7.10-1.1-desktop #1 Dell Inc. Precision T3600/0PTTT9 2013-05-15T09:21:42.338042+02:00 linux-zied kernel: [1879445.007976] RIP: 0010:[<ffffffffa056a50c>] [<ffffffffa056a50c>] svc_destroy+0x11c/0x120 [sunrpc] 2013-05-15T09:21:42.338043+02:00 linux-zied kernel: [1879445.008828] RSP: 0018:ffff8804348c1a08 EFLAGS: 00010283 2013-05-15T09:21:42.338055+02:00 linux-zied kernel: [1879445.009660] RAX: ffff8804353e2038 RBX: ffff8804353e2000 RCX: ffffffff81c452a8 2013-05-15T09:21:42.338056+02:00 linux-zied kernel: [1879445.010510] RDX: 0000000000000001 RSI: 0000000000000282 RDI: 0000000000000282 2013-05-15T09:21:42.338056+02:00 linux-zied kernel: [1879445.011369] RBP: 0000000000000000 R08: 0000000000000000 R09: ffff88044fbe9f80 2013-05-15T09:21:42.338057+02:00 linux-zied kernel: [1879445.012218] R10: 00000000000000f8 R11: 0000000000000001 R12: ffffffff81a74280 2013-05-15T09:21:42.338057+02:00 linux-zied kernel: [1879445.013076] R13: ffff880434a25800 R14: ffff8804348c1d70 R15: ffff8804348c1d70 2013-05-15T09:21:42.338057+02:00 linux-zied kernel: [1879445.013914] FS: 00007f3c488cd800(0000) GS:ffff88044f240000(0000) knlGS:0000000000000000 2013-05-15T09:21:42.338058+02:00 linux-zied kernel: [1879445.014768] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 2013-05-15T09:21:42.338059+02:00 linux-zied kernel: [1879445.016063] CR2: 00007f8368a89000 CR3: 00000004369b6000 CR4: 00000000000407e0 2013-05-15T09:21:42.338059+02:00 linux-zied kernel: [1879445.016927] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 2013-05-15T09:21:42.338059+02:00 linux-zied kernel: [1879445.017793] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 2013-05-15T09:21:42.338060+02:00 linux-zied kernel: [1879445.018660] Process mount.nfs (pid: 23068, threadinfo ffff8804348c0000, task ffff880438da8440) 2013-05-15T09:21:42.338060+02:00 linux-zied kernel: [1879445.019503] Stack: 2013-05-15T09:21:42.338061+02:00 linux-zied kernel: [1879445.020370] 0000000000000000 ffffffffa0623eca ffff880439404800 00000000ffffffff 2013-05-15T09:21:42.338061+02:00 linux-zied kernel: [1879445.021222] ffffffffffffffff ffffffffa062738c ffff880439404800 ffffffffa062757d 2013-05-15T09:21:42.338061+02:00 linux-zied kernel: [1879445.022098] 0000000000000000 ffffffffa056830f ffff880439404800 ffff880439404800 2013-05-15T09:21:42.338062+02:00 linux-zied kernel: [1879445.022980] Call Trace: 2013-05-15T09:21:42.338062+02:00 linux-zied kernel: [1879445.023927] [<ffffffffa0623eca>] nfs_callback_down+0x6a/0xe0 [nfsv4] 2013-05-15T09:21:42.338063+02:00 linux-zied kernel: [1879445.024809] [<ffffffffa062738c>] nfs4_free_client+0x4c/0xb0 [nfsv4] 2013-05-15T09:21:42.338063+02:00 linux-zied kernel: [1879445.025696] [<ffffffffa062757d>] nfs4_init_client+0x18d/0x2c0 [nfsv4] 2013-05-15T09:21:42.338064+02:00 linux-zied kernel: [1879445.026589] [<ffffffffa06270e5>] nfs4_set_client.isra.5+0x85/0x100 [nfsv4] 2013-05-15T09:21:42.338064+02:00 linux-zied kernel: [1879445.027464] [<ffffffffa0627fba>] nfs4_create_server+0xda/0x390 [nfsv4] 2013-05-15T09:21:42.338064+02:00 linux-zied kernel: [1879445.028354] [<ffffffffa0620c91>] nfs4_remote_mount+0x31/0x80 [nfsv4] 2013-05-15T09:21:42.338065+02:00 linux-zied kernel: [1879445.029243] [<ffffffff81172d35>] mount_fs+0x45/0x1d0 2013-05-15T09:21:42.338065+02:00 linux-zied kernel: [1879445.030085] [<ffffffff8118c613>] vfs_kern_mount+0x73/0x120 2013-05-15T09:21:42.338066+02:00 linux-zied kernel: [1879445.030966] [<ffffffffa0620c10>] nfs_do_root_mount+0x90/0xe0 [nfsv4] 2013-05-15T09:21:42.338066+02:00 linux-zied kernel: [1879445.031888] [<ffffffffa0620f8a>] nfs4_try_mount+0x3a/0xc0 [nfsv4] 2013-05-15T09:21:42.338066+02:00 linux-zied kernel: [1879445.032786] [<ffffffffa05e3465>] nfs_fs_mount+0x255/0xcd0 [nfs] 2013-05-15T09:21:42.338067+02:00 linux-zied kernel: [1879445.033645] [<ffffffff81172d35>] mount_fs+0x45/0x1d0 2013-05-15T09:21:42.338067+02:00 linux-zied kernel: [1879445.034504] [<ffffffff8118c613>] vfs_kern_mount+0x73/0x120 2013-05-15T09:21:42.338068+02:00 linux-zied kernel: [1879445.035391] [<ffffffff8118cf93>] do_kern_mount+0x53/0x110 2013-05-15T09:21:42.338068+02:00 linux-zied kernel: [1879445.036284] [<ffffffff8118eb34>] do_mount+0x334/0x900 2013-05-15T09:21:42.338069+02:00 linux-zied kernel: [1879445.037171] [<ffffffff8118f198>] sys_mount+0x98/0xf0 2013-05-15T09:21:42.338069+02:00 linux-zied kernel: [1879445.038075] [<ffffffff8159ebad>] system_call_fastpath+0x1a/0x1f 2013-05-15T09:21:42.338069+02:00 linux-zied kernel: [1879445.038948] [<00007f3c47fbcf4a>] 0x7f3c47fbcf49 2013-05-15T09:21:42.338070+02:00 linux-zied kernel: [1879445.039824] Code: 58 a0 31 c0 e8 e1 24 02 e1 e9 2b ff ff ff 48 8b 07 8b 57 14 48 c7 c7 92 01 58 a0 48 8b 70 20 31 c0 e8 c4 24 02 e1 e9 f5 fe ff ff <0f> 0b 0f 0b 48 83 ec 18 48 89 5c 24 08 48 89 fb 48 89 6c 24 10 2013-05-15T09:21:42.338071+02:00 linux-zied kernel: [1879445.041757] RIP [<ffffffffa056a50c>] svc_destroy+0x11c/0x120 [sunrpc] 2013-05-15T09:21:42.338071+02:00 linux-zied kernel: [1879445.043195] RSP <ffff8804348c1a08> 2013-05-15T09:21:42.338071+02:00 linux-zied kernel: [1879445.061504] ---[ end trace a42ffc2f62a45091 ]--- 2013-05-15T09:21:45.357766+02:00 linux-zied automount[3143]: rmdir_path: lstat of /suse/sdb failed -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.

https://bugzilla.novell.com/show_bug.cgi?id=820028 https://bugzilla.novell.com/show_bug.cgi?id=820028#c1 Jiri Slaby <jslaby@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO CC| |jslaby@suse.com InfoProvider| |stephan.barth@suse.com --- Comment #1 from Jiri Slaby <jslaby@suse.com> 2013-06-28 06:45:28 UTC --- Isn't it fixed with commit 37add3a70dcbf215dd35935a9957b398b1381703 Author: Jiri Slaby <jslaby@suse.cz> Date: Sun Mar 24 11:40:51 2013 +0100 svcrpc: fix rpc server shutdown races (bnc#802812). ? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.

https://bugzilla.novell.com/show_bug.cgi?id=820028 https://bugzilla.novell.com/show_bug.cgi?id=820028#c2 Stephan Barth <stephan.barth@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |NEW InfoProvider|stephan.barth@suse.com | --- Comment #2 from Stephan Barth <stephan.barth@suse.com> 2013-06-28 08:54:21 CEST --- I thought so too and had updated the kernel, but it crashed twice again with similar behavior. Yesterday it crashed again and I updated to the most recent kernel. I can check. If you want you can close the bug. I can reopen it if necessary. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.

https://bugzilla.novell.com/show_bug.cgi?id=820028 https://bugzilla.novell.com/show_bug.cgi?id=820028#c3 Jiri Slaby <jslaby@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO InfoProvider| |nfbrown@suse.com --- Comment #3 from Jiri Slaby <jslaby@suse.com> 2013-06-28 06:55:57 UTC --- Neil, any ideas, please? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.

https://bugzilla.novell.com/show_bug.cgi?id=820028 https://bugzilla.novell.com/show_bug.cgi?id=820028#c4 Neil Brown <nfbrown@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED CC| |nfbrown@suse.com InfoProvider|nfbrown@suse.com | AssignedTo|kernel-maintainers@forge.pr |nfbrown@suse.com |ovo.novell.com | --- Comment #4 from Neil Brown <nfbrown@suse.com> 2013-06-28 21:03:36 UTC --- Hi Stephan, do you have a complete Oops trace from a kernel which includes the patch Jiri mentioned. I know you said "similar behaviour", but sometimes small details can be different and very important. I'll have a look next week and see what I can find. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.

https://bugzilla.novell.com/show_bug.cgi?id=820028 https://bugzilla.novell.com/show_bug.cgi?id=820028#c5 --- Comment #5 from Neil Brown <nfbrown@suse.com> 2013-07-01 00:36:09 UTC --- Definitely a different bug to the one fixed by the commit in comment 1. nfs4_init_client is hitting an error and when it calls nfs_put_client something gets confused and crashes. No upstream patches look related. I'll keep hunting. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.

https://bugzilla.novell.com/show_bug.cgi?id=820028 https://bugzilla.novell.com/show_bug.cgi?id=820028#c6 Neil Brown <nfbrown@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |FIXED --- Comment #6 from Neil Brown <nfbrown@suse.com> 2013-07-01 01:16:52 UTC --- On the other hand ... maybe it is the same bug. When nfs_put_client is called it calls nfs_callback_down. It calls nfs_callback_down_net() which should call svc_shutdown_net which closes all sockets. It then calls svc_exit_thread() which finds some sockets still open (hence the bug). There are no permanent sockets (TCP listeners and UDP), only temporary sockets (TCP connections), so it certainly seems that svc_shutdown_net has run. The commit mentioned fixed a race were svc_shutdown_net might miss some sockets. So I think this trace does match that commit. If you can get a BUG trace with a newer kernel that has that commit I'd certainly like to see it. For now: closing as suggested in comment #2. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com