http://bugzilla.novell.com/show_bug.cgi?id=557760 http://bugzilla.novell.com/show_bug.cgi?id=557760#c0 Summary: rpciod taking 100% of cpu, box *almost* unusable Classification: openSUSE Product: openSUSE 11.2 Version: Final Platform: x86-64 OS/Version: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: jnelson-suse@jamponi.net QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.4) Gecko/20091016 SUSE/3.5.4-1.1.2 Firefox/3.5.4 While reading/writing an .iso over NFS (gig-e connection), rpciod/0 started consuming 100% of the CPU, and it stayed that way. The mount is NFSv4. echo 't' > /proc/sysrq-trigger and looking for the process: Nov 23 08:58:53 frank kernel: [ 3169.336539] rpciod/0 R running task 0 1471 2 0x00000008 Nov 23 08:58:53 frank kernel: [ 3169.336539] 0000000000000000 000000006521588b ffff88002c45c610 ffffffffa029b730 Nov 23 08:58:53 frank kernel: [ 3169.336539] ffffffffa0297be0 ffffc90010a43448 ffff88002c53be20 ffffffffa0297c0d Nov 23 08:58:53 frank kernel: [ 3169.336539] 0000000000000010 000000006521588b ffff88002c53be80 ffffffff81088631 Nov 23 08:58:53 frank kernel: [ 3169.336539] Call Trace: Nov 23 08:58:53 frank kernel: [ 3169.336539] Inexact backtrace: Nov 23 08:58:53 frank kernel: [ 3169.336539] Nov 23 08:58:53 frank kernel: [ 3169.336539] [<ffffffffa029b730>] ? rpc_async_schedule+0x0/0x40 [sunrpc] Nov 23 08:58:53 frank kernel: [ 3169.336539] [<ffffffffa029b753>] ? rpc_async_schedule+0x23/0x40 [sunrpc] Nov 23 08:58:53 frank kernel: [ 3169.336539] [<ffffffff81088631>] ? run_workqueue+0xc1/0x1f0 Nov 23 08:58:53 frank kernel: [ 3169.336539] [<ffffffff81088814>] ? worker_thread+0xb4/0x140 Nov 23 08:58:54 frank kernel: [ 3169.336539] [<ffffffff8108f390>] ? autoremove_wake_function+0x0/0x60 Nov 23 08:58:54 frank kernel: [ 3169.336539] [<ffffffff81088760>] ? worker_thread+0x0/0x140 Nov 23 08:58:54 frank kernel: [ 3169.336539] [<ffffffff8108ec26>] ? kthread+0xb6/0xc0 Nov 23 08:58:54 frank kernel: [ 3169.336539] [<ffffffff8100d70a>] ? child_rip+0xa/0x20 Nov 23 08:58:54 frank kernel: [ 3169.336539] [<ffffffff8108eb70>] ? kthread+0x0/0xc0 Nov 23 08:58:54 frank kernel: [ 3169.336539] [<ffffffff8100d700>] ? child_rip+0x0/0x20 Other processes are stuck in 'D' for more than 120s. This appears to be a regression. Easy to reproduce. Client and server are both openSUSE 11.2, and up-to-date as of now. I enabled all of the rpc debug flags for rpc and that was a mistake. I see a lot of this: Nov 23 09:12:27 frank kernel: [ 3855.711413] 16588 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711446] 16589 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711482] 16590 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711517] 16591 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711550] 16592 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711582] 16593 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711619] 16594 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711653] 16595 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711686] 16596 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711722] 16597 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711756] 16598 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711788] 16599 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711821] 16600 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711853] 16601 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711886] 16602 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711918] 16603 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711951] 16604 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.711983] 16605 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.712016] 16606 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.712048] 16607 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.712079] 16608 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.712115] 16609 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog Nov 23 09:12:27 frank kernel: [ 3855.712148] 16610 0281 -11 ffff88002cc1d800 (null) 0 ffffffffa035ce10 nfsv4 RENEW a:call_reserveresult q:xprt_backlog but a bunch of other stuff too. Reproducible: Always Steps to Reproduce: 1. 2. 3. -- Configure bugmail: http://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.