[Bug 743101] New: kworker thread jumps to 100% when laptop is redocked
https://bugzilla.novell.com/show_bug.cgi?id=743101 https://bugzilla.novell.com/show_bug.cgi?id=743101#c0 Summary: kworker thread jumps to 100% when laptop is redocked Classification: openSUSE Product: openSUSE 12.1 Version: Final Platform: x86-64 OS/Version: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: aspiers@suse.com QAContact: qa@suse.de Found By: Development Blocker: --- I am running fully patched openSUSE 12.1 x86_64. Every time I put my Dell Latitude D630 back in the docking station, the kworker kernel thread jumps to 100% CPU (i.e. one of the two cores) and stays there until reboot. /var/log/messages fills up with these errors: Jan 24 12:09:37 southern kernel: [254699.227808] ACPI Error: Method parse/execution failed [\SMI_] (Node ffff88007b772a88), AE_AML_INFINITE_LOOP (20110623/psparse-536) Jan 24 12:09:37 southern kernel: [254699.227821] ACPI Error: Method parse/execution failed [\DCKE] (Node ffff88007b778a60), AE_AML_INFINITE_LOOP (20110623/psparse-536) Jan 24 12:09:37 southern kernel: [254699.227828] ACPI Error: Method parse/execution failed [\NEVT] (Node ffff88007b772da8), AE_AML_INFINITE_LOOP (20110623/psparse-536) Jan 24 12:09:37 southern kernel: [254699.227835] ACPI Error: Method parse/execution failed [\_GPE._L1C] (Node ffff88007b778920), AE_AML_INFINITE_LOOP (20110623/psparse-536) Jan 24 12:09:37 southern kernel: [254699.227846] ACPI Exception: AE_AML_INFINITE_LOOP, while evaluating GPE method [_L1C] (20110623/evgpe-560) This is similar to bug 689848 and bug 714455, but it must be different because it is only triggered by a redock, and also I am running the 3.1.0-1.2-desktop kernel, which is quite a lot newer. Happy to provide further debug if required - problem is 100% reproducible. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c1
Guy Zelck
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c2
--- Comment #2 from Greg Kroah-Hartman
I'm waiting for almost a year now just for opensuse to package a more recent kernel (>= 2.6.38) than the 2.6.37.x of the 11.4 distribution. The bug in 714455 is solved.So, good luck 8-(. Can you vote for my bug, maybe that will draw their attention.
That's not going to happen, if you need a newer kernel, please use 12.1, or the Kernel:stable repo. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c3
Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c
Jeff Mahoney
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c4
Joey Lee
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c5
--- Comment #5 from Adam Spiers
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c6
Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c7
--- Comment #7 from Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c8
--- Comment #8 from Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c9
--- Comment #9 from Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c10
--- Comment #10 from Michael Chang
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c
Joey Lee
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c11
--- Comment #11 from Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c12
Joey Lee
Michael,
No, I didn't need a dock to reproduce this condition (high CPU consumption by kworker processes). I agree -- it is possible that I am seeing similar symptoms from a different root cause.
I've just installed the perf package, and I will get you a set of ``perf top'' output.
There have kernel oops from r8192se_pci in your dmesg, please add it to blacklist or remove it then test again. And, Your machine is using 2.6.37.6-0.11-desktop, does it openSUSE 12.1? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c13
--- Comment #13 from Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c14
--- Comment #14 from Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c15
--- Comment #15 from Michael Chang
Output from "perf top"
i915 seems to be the culprit, get_clock spent cpu much more cycles than others. Take a bit look at the kernel source, get_clock seems to be related with intel gmbus, which is used to display detection/hotplug function based on i2c protocol. Can't tell too much what's going wrong due to lack of knowledge. 1. If you could provide more info related with your connected display would be great .. for eg, did you plug an external monitor when booting ? 2. Try kotd .. form "git log drivers/gpu/drm/i915/intel_i2c.c" seems to have many fixes related with gmbus recently. http://en.opensuse.org/openSUSE:Kernel_of_the_day 3. Possible to blacklist i915 and use runlevel 3 to see if it happen or not? (not sure systemd's step to do this, suppose same with init .. I'll post the step for your reference once I know howto do it on 12.1) Thanks. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c16
--- Comment #16 from Michael Chang
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c17
--- Comment #17 from Steve Revilak
find /lib/modules/3.1.9-1.4-desktop/ -name rtl8192\*.ko /lib/modules/3.1.9-1.4-desktop/kernel/drivers/net/wireless/rtlwifi/rtl8192c/rtl8192c-common.ko /lib/modules/3.1.9-1.4-desktop/kernel/drivers/net/wireless/rtlwifi/rtl8192se/rtl8192se.ko /lib/modules/3.1.9-1.4-desktop/kernel/drivers/net/wireless/rtlwifi/rtl8192cu/rtl8192cu.ko /lib/modules/3.1.9-1.4-desktop/kernel/drivers/net/wireless/rtlwifi/rtl8192ce/rtl8192ce.ko /lib/modules/3.1.9-1.4-desktop/kernel/drivers/net/wireless/rtlwifi/rtl8192de/rtl8192de.ko
find /lib/modules/3.1.9-1.4-desktop/ -name rtl8192\*.ko | xargs -n1 rpm -qf kernel-desktop-3.1.9-1.4.1.i586 kernel-desktop-3.1.9-1.4.1.i586 kernel-desktop-3.1.9-1.4.1.i586 kernel-desktop-3.1.9-1.4.1.i586 kernel-desktop-3.1.9-1.4.1.i586
boot.msg still shows the same kernel oops. Next, I added "blacklist r8192se_pci" to /etc/modprobe.d/99-local.conf and rebooted. Upon reboot, my laptop's wireless no longer worked (expected), but it was only a few minutes before kworker started consuming most of one CPU core. Here is "perf top" taken while kworker was monopolizing the CPU. PerfTop: 45 irqs/sec kernel:82.2% exact: 0.0% [1000Hz cycles], (all, 2 CPUs) ------------------------------------------------------------------------------------------- samples pcnt function DSO _______ _____ ______________________ ______________ 597.00 69.5% get_clock [i915] 73.00 8.5% delay_tsc [kernel] 58.00 6.8% read_hpet [kernel] 12.00 1.4% mark_object emacs 9.00 1.0% _raw_spin_lock_irqsave [kernel] 8.00 0.9% acpi_os_read_port [kernel] 7.00 0.8% i8042_interrupt [kernel] 6.00 0.7% strlen libc-2.14.1.so 5.00 0.6% critical_factorization libc-2.14.1.so This occurred in runlevel 5. I'm NOT using any external hardware with my laptop (monitor, keyboard, mouse, etc). Next, I tried booting into runlevel 3, still with rtl9192se_pci blacklisted. cd /etc/systemd/system sudo mv default.target default.target.OLD sudo ln -s /lib/systemd/system/runlevel3.target default.target sudo reboot I used my laptop for several hours in this configuration (I needed to do some writing ... but emacs is happy to work in runlevel 3). The kworker process never monopolized the CPU. Normally, that would have happened at least once. Unfortunately, turning of wireless and X11 does limit the utility of my laptop. Next, I'll try kernel of the day. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c18
Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c19
Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c21
Steve Revilak
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c22
Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=743101
https://bugzilla.novell.com/show_bug.cgi?id=743101#c23
--- Comment #23 from Steve Revilak
participants (1)
-
bugzilla_noreply@novell.com