[Bug 704788] New: kernel BUG at nfs4state.c:391 after you update
https://bugzilla.novell.com/show_bug.cgi?id=704788 https://bugzilla.novell.com/show_bug.cgi?id=704788#c0 Summary: kernel BUG at nfs4state.c:391 after you update Classification: openSUSE Product: openSUSE 11.4 Version: Final Platform: Other OS/Version: openSUSE 11.4 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: sweet_f_a@gmx.de QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:5.0) Gecko/20100101 Firefox/5.0 Hi, today some seconds after maintainance updates were dropped in on nfs server and some clients I've got the trace below on server. The following simultaneous events on server(glaukos) and client(skylla) seems to be the cause of this: skylla:~ # rpm -qa --last |head |grep nfs nfs-kernel-server-1.2.3-11.16.1 Sat 09 Jul 2011 05:00:14 AM CEST nfs-client-1.2.3-11.16.1 Sat 09 Jul 2011 05:00:12 AM CEST glaukos(server):~ # rpm -qa --last |head |grep nfs nfs-kernel-server-1.2.3-11.16.1 Sat 09 Jul 2011 05:00:13 AM CEST nfs-client-1.2.3-11.16.1 Sat 09 Jul 2011 05:00:09 AM CEST My important question now: Is this just an unlucky thing because of that stressful mix of update/restarting clients and servers while umounting/mounting exports plus a lot NFS I/O? Or are these nfs updates buggy? Here the server log which starts with some stress by client, then you update on server and finally the trace and hanging clients: Jul 9 05:00:01 glaukos run-crons[10218]: logrotate: OK Jul 9 05:00:03 glaukos rpc.mountd[1881]: authenticated mount request from skylla.ga.local:970 for /exports/var/lib/software/opensuse/distribution/11.4/repo/non-oss (/exports/var/lib/software) Jul 9 05:00:03 glaukos rpc.mountd[1881]: authenticated unmount request from skylla.ga.local:861 for /exports/var/lib/software/opensuse/distribution/11.4/repo/non-oss (/exports/var/lib/software) Jul 9 05:00:03 glaukos rpc.mountd[1881]: authenticated mount request from skylla.ga.local:759 for /exports/var/lib/software/opensuse/distribution/11.4/repo/oss (/exports/var/lib/software) Jul 9 05:00:03 glaukos rpc.mountd[1881]: authenticated unmount request from skylla.ga.local:867 for /exports/var/lib/software/opensuse/distribution/11.4/repo/oss (/exports/var/lib/software) Jul 9 05:00:03 glaukos rpc.mountd[1881]: authenticated mount request from skylla.ga.local:966 for /exports/var/lib/software/opensuse/update/11.4 (/exports/var/lib/software) Jul 9 05:00:03 glaukos rpc.mountd[1881]: authenticated unmount request from skylla.ga.local:873 for /exports/var/lib/software/opensuse/update/11.4 (/exports/var/lib/software) Jul 9 05:00:03 glaukos rpc.mountd[1881]: authenticated mount request from skylla.ga.local:682 for /exports/var/lib/software/opensuse/update/11.4 (/exports/var/lib/software) Jul 9 05:00:03 glaukos rpc.mountd[1881]: authenticated unmount request from skylla.ga.local:879 for /exports/var/lib/software/opensuse/update/11.4 (/exports/var/lib/software) Jul 9 05:00:03 glaukos rpc.mountd[1881]: authenticated mount request from skylla.ga.local:1021 for /exports/var/lib/software/opensuse/update/11.4 (/exports/var/lib/software) Jul 9 05:00:04 glaukos rpc.mountd[1881]: authenticated unmount request from skylla.ga.local:922 for /exports/var/lib/software/opensuse/update/11.4 (/exports/var/lib/software) Jul 9 05:00:09 glaukos useradd[10369]: account already exists - account=statd, by=0 Jul 9 05:00:11 glaukos rpc.mountd[1881]: authenticated mount request from skylla.ga.local:725 for /exports/var/lib/software/opensuse/update/11.4 (/exports/var/lib/software) Jul 9 05:00:11 glaukos rpc.statd[1884]: Caught signal 15, un-registering and exiting Jul 9 05:00:11 glaukos rpc.statd[10408]: Version 1.2.3 starting Jul 9 05:00:11 glaukos rpc.statd[10408]: Flags: TI-RPC Jul 9 05:00:13 glaukos kernel: [3225039.515240] nfsd: last server has exited, flushing export cache Jul 9 05:00:13 glaukos rpc.statd[10408]: Caught signal 15, un-registering and exiting Jul 9 05:00:13 glaukos rpc.mountd[1881]: Caught signal 15, un-registering and exiting. Jul 9 05:00:13 glaukos rpc.mountd[10485]: Version 1.2.3 starting Jul 9 05:00:13 glaukos rpc.statd[10497]: Version 1.2.3 starting Jul 9 05:00:13 glaukos rpc.statd[10497]: Flags: TI-RPC Jul 9 05:00:14 glaukos kernel: [3225040.193389] NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Jul 9 05:00:14 glaukos kernel: [3225040.193490] NFSD: starting 90-second grace period Jul 9 05:00:14 glaukos sm-notify[10528]: Version 1.2.3 starting Jul 9 05:00:14 glaukos sm-notify[10528]: Backgrounding to notify hosts... Jul 9 05:00:18 glaukos run-crons[10218]: opensuse.org-online_update: OK Jul 9 05:00:18 glaukos run-crons[10218]: suse-clean_catman: OK Jul 9 05:00:18 glaukos run-crons[10218]: suse-do_mandb: OK Jul 9 05:00:19 glaukos su: (to nobody) root on none Jul 9 05:00:24 su: last message repeated 2 times Jul 9 05:00:24 glaukos run-crons[10218]: suse-updatedb: OK Jul 9 05:00:25 glaukos run-crons[10218]: suse.de-backup-rc.config: OK Jul 9 05:00:26 glaukos run-crons[10218]: suse.de-backup-rpmdb: OK Jul 9 05:00:26 glaukos run-crons[10218]: suse.de-check-battery: OK Jul 9 05:00:26 glaukos run-crons[10218]: suse.de-clean-tmp: OK Jul 9 05:00:26 glaukos run-crons[10218]: suse.de-cron-local: OK Jul 9 05:00:39 glaukos kernel: [3225065.434416] ------------[ cut here ]------------ Jul 9 05:00:39 glaukos kernel: [3225065.434429] kernel BUG at /usr/src/packages/BUILD/kernel-desktop-2.6.37.6/linux-2.6.37/fs/nfsd/nfs4state.c:391! Jul 9 05:00:39 glaukos kernel: [3225065.434439] invalid opcode: 0000 [#1] PREEMPT SMP Jul 9 05:00:39 glaukos kernel: [3225065.434448] last sysfs file: /sys/devices/system/cpu/cpu3/cache/index2/shared_cpu_map Jul 9 05:00:39 glaukos kernel: [3225065.434456] CPU 0 Jul 9 05:00:39 glaukos kernel: [3225065.434459] Modules linked in: microcode raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx md5 nfsd lockd nfs_acl auth_rpcgss sunrpc w83793 hwmon_vid cor etemp edd cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq mperf xfs exportfs radeon ttm drm_kms_helper drm i3200_edac e1000e ses enclosure iTCO_wdt edac_core i2c_algo_bit i2c_i801 iTCO_vendor_suppor t shpchp sg pci_hotplug sr_mod cdrom pcspkr ghes serio_raw hed video container button ext4 jbd2 crc16 linear dm_snapshot dm_mod fan processor thermal thermal_sys aacraid Jul 9 05:00:39 glaukos kernel: [3225065.434545] Jul 9 05:00:39 glaukos kernel: [3225065.434551] Pid: 10526, comm: nfsd Not tainted 2.6.37.6-0.5-desktop #1 Supermicro X7SB4/E/X7SB4/E Jul 9 05:00:39 glaukos kernel: [3225065.434564] RIP: 0010:[<ffffffffa04ee3b2>] [<ffffffffa04ee3b2>] nfs4_access_to_omode+0x12/0x40 [nfsd] Jul 9 05:00:39 glaukos kernel: [3225065.434588] RSP: 0018:ffff88003c755b98 EFLAGS: 00010297 Jul 9 05:00:39 glaukos kernel: [3225065.434595] RAX: 0000000000000004 RBX: ffff8800490aebf0 RCX: ffff88003c755b90 Jul 9 05:00:39 glaukos kernel: [3225065.434602] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000 Jul 9 05:00:39 glaukos kernel: [3225065.434610] RBP: ffff8800071b20d8 R08: dead000000100100 R09: dead000000200200 Jul 9 05:00:39 glaukos kernel: [3225065.434617] R10: dead000000100100 R11: dead000000200200 R12: ffff8800071b2110 Jul 9 05:00:39 glaukos kernel: [3225065.434625] R13: ffff8800071b20d8 R14: ffff88001eb0aa58 R15: 0000000000000000 Jul 9 05:00:39 glaukos kernel: [3225065.434633] FS: 0000000000000000(0000) GS:ffff8800cfc00000(0000) knlGS:0000000000000000 Jul 9 05:00:39 glaukos kernel: [3225065.434642] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b Jul 9 05:00:39 glaukos kernel: [3225065.434649] CR2: 00007f0173626000 CR3: 0000000225588000 CR4: 00000000000006f0 Jul 9 05:00:39 glaukos kernel: [3225065.434657] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Jul 9 05:00:39 glaukos kernel: [3225065.434664] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Jul 9 05:00:39 glaukos kernel: [3225065.434672] Process nfsd (pid: 10526, threadinfo ffff88003c754000, task ffff8802241b2400) Jul 9 05:00:39 glaukos kernel: [3225065.434679] Stack: Jul 9 05:00:39 glaukos kernel: [3225065.434684] ffffffffa04ef2e0 ffff88001eb0aa58 00000000071b20d8 ffff8800a3ea2dd0 Jul 9 05:00:39 glaukos kernel: [3225065.434696] ffff8800490aebf0 ffff8800071b20d8 ffffffffa04ef421 ffff8800071b20d8 Jul 9 05:00:39 glaukos kernel: [3225065.434707] 000000001d270000 ffff8802249fd040 ffffffffa04ef4f9 ffff8802257ce1a0 Jul 9 05:00:39 glaukos kernel: [3225065.434718] Call Trace: Jul 9 05:00:39 glaukos kernel: [3225065.434786] [<ffffffffa04ef2e0>] free_generic_stateid+0x20/0xb0 [nfsd] Jul 9 05:00:39 glaukos kernel: [3225065.434850] [<ffffffffa04ef421>] unhash_lockowner+0xb1/0x180 [nfsd] Jul 9 05:00:39 glaukos kernel: [3225065.434914] [<ffffffffa04ef4f9>] release_lockowner+0x9/0x20 [nfsd] Jul 9 05:00:39 glaukos kernel: [3225065.434977] [<ffffffffa04f4618>] nfsd4_lock+0x258/0x5e0 [nfsd] Jul 9 05:00:39 glaukos kernel: [3225065.435045] [<ffffffffa04e3ca1>] nfsd4_proc_compound+0x3f1/0x4d0 [nfsd] Jul 9 05:00:39 glaukos kernel: [3225065.435095] [<ffffffffa04d198d>] nfsd_dispatch+0xfd/0x240 [nfsd] Jul 9 05:00:39 glaukos kernel: [3225065.435122] [<ffffffffa0487184>] svc_process_common+0x344/0x680 [sunrpc] Jul 9 05:00:39 glaukos kernel: [3225065.435169] [<ffffffffa04875ce>] svc_process+0x10e/0x150 [sunrpc] Jul 9 05:00:39 glaukos kernel: [3225065.435210] [<ffffffffa04d10e2>] nfsd+0xb2/0x150 [nfsd] Jul 9 05:00:39 glaukos kernel: [3225065.435224] [<ffffffff81079ad6>] kthread+0x96/0xa0 Jul 9 05:00:39 glaukos kernel: [3225065.435238] [<ffffffff81003d74>] kernel_thread_helper+0x4/0x10 Jul 9 05:00:39 glaukos kernel: [3225065.435247] Code: 03 e1 eb 97 83 eb 01 75 d9 31 c0 eb a2 66 66 66 2e 0f 1f 84 00 00 00 00 00 83 e7 03 83 ff 02 74 28 83 ff 03 74 13 83 ff 01 74 06 <0f> 0b 0f 1f 40 00 31 c0 c3 0f 1f 44 00 00 b8 02 00 00 00 c3 66 Jul 9 05:00:39 glaukos kernel: [3225065.435302] RIP [<ffffffffa04ee3b2>] nfs4_access_to_omode+0x12/0x40 [nfsd] Jul 9 05:00:39 glaukos kernel: [3225065.435320] RSP <ffff88003c755b98> Jul 9 05:00:39 glaukos kernel: [3225065.534070] ---[ end trace a6e93c30dd913031 ]--- Jul 9 05:16:14 glaukos sm-notify[10529]: Unable to notify chantico.ga.local, giving up Jul 9 05:16:14 glaukos sm-notify[10529]: Unable to notify otto.ga.local, giving up Jul 9 05:16:14 glaukos sm-notify[10529]: Unable to notify quant.ga.local, giving up Jul 9 05:16:14 glaukos sm-notify[10529]: Unable to notify clyde.ga.local, giving up Reproducible: Always Steps to Reproduce: 1. 2. 3. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c1
--- Comment #1 from Ruediger Meier
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c2
Randall Smith
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c3
--- Comment #3 from Ruediger Meier
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c4
--- Comment #4 from Ruediger Meier
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c4
--- Comment #4 from Ruediger Meier
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c5
--- Comment #5 from Ruediger Meier
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c
Leonardo Chiquitto
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c6
Neil Brown
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c7
Neil Brown
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c8
Randall Smith
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c9
Marcus Meissner
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c10
--- Comment #10 from Randall Smith
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c11
Neil Brown
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c12
--- Comment #12 from Bernhard Wiedemann
https://bugzilla.novell.com/show_bug.cgi?id=704788
https://bugzilla.novell.com/show_bug.cgi?id=704788#c13
Swamp Workflow Management
participants (1)
-
bugzilla_noreply@novell.com