https://bugzilla.novell.com/show_bug.cgi?id=644880 https://bugzilla.novell.com/show_bug.cgi?id=644880#c0 Summary: rpciod takes constant 10-20% cpu, X hangs (in NFSv4 environment) Classification: openSUSE Product: openSUSE 11.3 Version: Final Platform: x86-64 OS/Version: openSUSE 11.3 Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: joschibrauchle@gmx.de QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US) AppleWebKit/534.3 (KHTML, like Gecko) Chrome/6.0.472.63 Safari/534.3 I have the following problem with Opensuse 11.3, x86_64 in an NFSv4 environment: Mostly in the morning when I return to the box (it's running idle all night), the X server hangs and 'top' shows 'rpciod/0' taking between 10-20% cpu constantly. A 'reboot' fails as stopping the 'automount' service fails due to my home (NFSv4 mount) being busy. Sometimes the X server freezes with the same symptoms even randomly during work, but mostly it seems to happen when the session is idle at night. I'm not very experienced in debugging a problem like this, so I definitely need instructions on how to get more information. Here is the information I have at hand: ----------------------- 'uname -a' on the Opensuse 11.3 box: Linux <hostname> 2.6.34.7-0.3-desktop #1 SMP PREEMPT 2010-09-20 15:27:38 +0200 x86_64 x86_64 x86_64 GNU/Linux ----------------------- 'mount' returns the following line on my home: 192.168.109.3:/home/staff/<username> on /home/<username> type nfs4 (rw,rsize=32768,wsize=32768,sec=krb5,sloppy,addr=192.168.109.3,clientaddr=192.168.109.72) ----------------------- I get a LOT of these messages in '/var/log/messages': Sep 28 18:30:58 <hostname> kernel: [117887.140931] NFS: v4 server returned a bad sequence-id error on an unconfirmed sequence ffff88007babd828! ----------------------- The NFS server is running SLES 10SP3 in a two-node cluster configuration with a shared IP (192.168.109.3), serving NFSv3 and NFSv4 (with Kerberos/GSS Security). /etc/exports on the server contains: ----------------------- # NFSv4 entries (with Kerberos and GSS Security): /export gss/krb5(rw,fsid=0,no_all_squash,async,no_subtree_check) We have about 40 OpenSuse 11.1 clients running getting their homes from this server, they are all running fine without problems for several months. It seems to be a problem in the kernel of 11.3. As I said, just let me know what more debug info is needed and how to obtain it. Also I have not found a way to reproduce or trigger the problem manually. Thanks! Reproducible: Couldn't Reproduce -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.