[Bug 247566] New: Missing (important) thermal zone on HP tc4200
https://bugzilla.novell.com/show_bug.cgi?id=247566 Summary: Missing (important) thermal zone on HP tc4200 Product: openSUSE 10.2 Version: Final Platform: i686 OS/Version: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: jbr.1@gmx.net QAContact: qa@suse.de My HP tc4200 had no ACPI problems with SuSE 10.1. But now after installing (from scratch) openSUSE 10.2 I got temperature problems. Obviously a thermal zone is missing and the system overheats sooner or later. I've already installed: ftp://ftp.suse.com/pub/projects/kernel/kotd/10.2-i386/SL102_BRANCH/kernel-default-2.6.18.5-SL102_BRANCH_20070220193246.i586.rpm I will attach dmesg and acpidump. The interesting part of dmesg: ACPI: read EC, IB not empty ACPI: read EC, OB not full ACPI Exception (evregion-0424): AE_TIME, Returned by Handler for [EmbeddedControl] [20060707] ACPI Exception (dswexec-0458): AE_TIME, While resolving operands for [OpcodeName unavailable] [20060707] ACPI Error (psparse-0537): Method parse/execution failed [\_SB_.C002.C003.C004.C005] (Node df7b34f0), AE_TIME ACPI Error (psparse-0537): Method parse/execution failed [\_PR_.C000._CST] (Node df7bc310), AE_TIME ACPI: CPU0 (power states: C1[C1] C2[C2] C3[C3]) ACPI: Processor [C000] (supports 8 throttling states) Time: acpi_pm clocksource has been installed. ACPI: write EC, IB not empty ACPI: write EC, IB not empty ACPI: write EC, IB not empty ACPI Exception (evregion-0424): AE_TIME, Returned by Handler for [EmbeddedControl] [20060707] ACPI Error (psparse-0537): Method parse/execution failed [\_SB_.C002.C003.C004.C151] (Node df7b3658), AE_TIME ACPI Error (psparse-0537): Method parse/execution failed [\_TZ_.C25E] (Node df7afaa4), AE_TIME ACPI Error (psparse-0537): Method parse/execution failed [\_TZ_.TZ1_._TMP] (Node df7afef0), AE_TIME ACPI: Thermal Zone [TZ2] (36 C) ACPI: Thermal Zone [TZ3] (24 C) ACPI: Thermal Zone [TZ4] (50 C) Obviously "TZ1" is missing: # ls /proc/acpi/thermal_zone/ TZ2 TZ3 TZ4 Things look much better using ftp://ftp.suse.com/pub/projects/kernel/kotd/10.2-i386/SL102_BRANCH/kernel-vanilla-2.6.18.5-SL102_BRANCH_20070220193246.i586.rpm ACPI: read EC, IB not empty ACPI: read EC, OB not full ACPI Exception (evregion-0424): AE_TIME, Returned by Handler for [EmbeddedControl] [20060707] ACPI Exception (dswexec-0458): AE_TIME, While resolving operands for [OpcodeName unavailable] [20060707] ACPI Error (psparse-0537): Method parse/execution failed [\_SB_.C002.C003.C004.C005] (Node c14814f0), AE_TIME ACPI Error (psparse-0537): Method parse/execution failed [\_PR_.C000._CST] (Node c1478310), AE_TIME ACPI: CPU0 (power states: C1[C1] C2[C2] C3[C3]) ACPI: Processor [C000] (supports 8 throttling states) Time: acpi_pm clocksource has been installed. ACPI: write EC, IB not empty ACPI: write EC, IB not empty ACPI: Thermal Zone [TZ1] (45 C) ACPI: Thermal Zone [TZ2] (36 C) ACPI: Thermal Zone [TZ3] (24 C) ACPI: Thermal Zone [TZ4] (50 C) Still some error messages, but the thermal zone is there and seems to be working correctly. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 ------- Comment #1 from jbr.1@gmx.net 2007-02-21 14:07 MST ------- Created an attachment (id=120394) --> (https://bugzilla.novell.com/attachment.cgi?id=120394&action=view) dmesg -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 ------- Comment #2 from jbr.1@gmx.net 2007-02-21 14:08 MST ------- Created an attachment (id=120395) --> (https://bugzilla.novell.com/attachment.cgi?id=120395&action=view) acpidump -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 ------- Comment #3 from jbr.1@gmx.net 2007-02-22 00:23 MST ------- Two additional comments: Suspend2RAM is not working using kernel-vanilla-2.6.18.5-SL102_BRANCH_20070220193246.i586.rpm :-( The thermal zone TZ1 is ok, when booting kernel-default-2.6.18.5-SL102_BRANCH_20070220193246.i586.rpm while the ac adapter is NOT plugged. Plugging it in after booting is complete, everything seems to work fine. The error messages (when adding the thermal zone) do not appear like described above when booting the vanilla kernel. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 trenn@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED ------- Comment #5 from trenn@novell.com 2007-02-22 06:00 MST ------- does ec_intr=0 work? Can you also try the latest head kernel (not in 10.2-arch but only arch/HEAD directory of the kernels of the day). Maybe this already got fixed in 2.6.19 or 2.6.20 (the latter is what you should find there). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 ------- Comment #6 from jbr.1@gmx.net 2007-02-22 07:44 MST ------- (In reply to comment #5)
does ec_intr=0 work?
No, I'm afraid, exactly the same errors.
Can you also try the latest head kernel (not in 10.2-arch but only arch/HEAD directory of the kernels of the day). Maybe this already got fixed in 2.6.19 or 2.6.20 (the latter is what you should find there).
I installed kernel-debug-2.6.20-20070221170021.i586.rpm But the kernel hangs during boot after: input: PS/2 Generic Mouse as /class/input/input3 <ctrl>-<alt>-<del> is still working... ;-) Nevertheless that doesn't look good at all... Do you have some more ideas? I will try testing a few other things an my own as soon as I have more time... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 ------- Comment #7 from jbr.1@gmx.net 2007-02-22 08:36 MST ------- Found enough time to boot Knoppix 5.1.1 ... It's using 2.6.19 and nearly all ACPI error messages have vanished. The thermal zones are obviously working correctly. Suspend2RAM does not work. Just the same as with kernel-vanilla-2.6.18.5-SL102_BRANCH_20070220193246.i586.rpm -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 trenn@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |jbr.1@gmx.net ------- Comment #8 from trenn@novell.com 2007-02-28 02:34 MST ------- Can you try again latest kotd. This could be a side effect of the "execute notify handlers in own thread" patch that got removed: ftp://ftp.suse.com/pub/projects/kernel/kotd/10.2-i386/SL102_BRANCH/kernel-default-2.6.18.X-...rpm -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 jbr.1@gmx.net changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |RESOLVED Info Provider|jbr.1@gmx.net | Resolution| |FIXED ------- Comment #9 from jbr.1@gmx.net 2007-02-28 03:52 MST ------- Installed and booted: kernel-default-2.6.18.8-SL102_BRANCH_20070227134733.i586.rpm Everything is working fine! Thermal zone is working, related error messages disappeared, and Suspend2RAM is working. Thanks very much! -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 jbr.1@gmx.net changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |REOPENED Resolution|FIXED | ------- Comment #10 from jbr.1@gmx.net 2007-03-13 04:47 MST ------- Sorry, I have to reopen this bug... I still have the same problems. I'm not really sure whether the problems reappeared or they never were really fixed by the new kernel. That's really strange. I'm quite sure that the problems disappeared after using the new kernel, but now everything is the same as before. I also realised some more quirks that most likely also existed before. E.g. when booting up ac adapter not connected, ACPI obviously "believes" ac is connected. After Suspend2RAM and waking up again, this changes and ACPI reports the ac being unplugged. Is it possible that due to parsing errors the whole ACPI is somehow "unstable"? The parsing errors seem to change depending on the state the machine is during booting, ac connected<->not connected, battery empty<->full. Any more ideas about what I can try to fix the problem? Thanks in advance for any help! -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 trenn@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|REOPENED |ASSIGNED ------- Comment #11 from trenn@novell.com 2007-03-13 05:17 MST ------- This very much sounds like the psmouse needs to be cleaned on shutdown/suspend problem: http://bugzilla.kernel.org/show_bug.cgi?id=7689 Very interesting for you should be comment #45 and #46. That means: rebooting a fixed kernel will result in a working booted system rebooting a broken kernel will result in a broken booted system It doesn't matter whehther the newly booted kernel is fine or not. This quickly leads to confusing test results as e.g. yours above. Could it be that this is your problem? Try to get rid of old kernels or update. Latest 10.2 update kernel should have the fix. Not sure whether there came out a 10.1 update kernel yet with this fix. That means if you boot from a fixed 10.2 into a broken 10.1 kernel everything may look fine, even this kernel will break things. If you boot from 10.1 into a fixed 10.2 kernel things will be broken again. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 trenn@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |jbr.1@gmx.net -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 jbr.1@gmx.net changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|jbr.1@gmx.net | ------- Comment #12 from jbr.1@gmx.net 2007-03-13 09:56 MST ------- Booting the newest update kernel more than once doesn't fix the problem for me :-( I've rebooted the notebook quite often and once the thermal zones were working correctly! It was after I booted Windows XP... But after the next reboot, the thermal zones were corrupted again. I tried to get them working using various reboot combinations with Windows XP, but I wasn't able to reproduce the successfully initialised thermal zones once again. Perhaps it depends on the state of the thermal zones during boot. One of the thermal zones (TZ4) represents obviously not a temperature, but the status of the case fan. This fan is on to 50°C (that's weird...), if the ac adapter is plugged in. If it's unplugged it goes down to 0°C. That would perhaps explain why initialising the thermal zones is working if ac is unplugged during boot... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 trenn@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |jbr.1@gmx.net ------- Comment #13 from trenn@novell.com 2007-03-14 07:45 MST -------
I've rebooted the notebook quite often and once the thermal zones were working correctly! It was after I booted Windows XP... But after the next reboot, the thermal zones were corrupted again. That very much sounds like the problem.
I tried to get them working using various reboot combinations with Windows XP, but I wasn't able to reproduce the successfully initialised thermal zones once again. Hmm, this does not.
Can you give me the most recent changelog entries of the kernel you are using. I'd like to check whether you really have a fixed kernel. rpm -q --changelog kernel-default-VERSION |head (you may want to check for the exact version via uname -r and/or rpm -qa |grep kernel). I do not think your problem has to do with the 0C thermal zone. This one (TZ4) is ok, right? It's the first not showing up? Also the error message points to a confused EC. Can you also check if there is a new BIOS for this machine! -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 jbr.1@gmx.net changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|jbr.1@gmx.net | ------- Comment #14 from jbr.1@gmx.net 2007-03-14 15:43 MST -------
Can you give me the most recent changelog entries of the kernel you are using. I'd like to check whether you really have a fixed kernel. rpm -q --changelog kernel-default-VERSION |head
I've included some more lines because it looks like the patch you're looking for is than listed: ================================================================= # rpm -q --changelog kernel-default-2.6.18.8-0.1 |head * Fr Mär 02 2007 - lmb@suse.de - patches.suse/revert-netfilter-source-routing: Revert parts of the netfilter patch from 2.6.18.5 due to kABI changes. * Do Mär 01 2007 - agruen@suse.de - scripts/kabi-checks: update so that this will also work with additional columns in the symvers files. Bump tolerate_kabi_changes to 31; we don't want the build to fail. * Mi Feb 28 2007 - trenn@suse.de hermod:~ # rpm -q --changelog kernel-default-2.6.18.8-0.1 |head -20 * Fr Mär 02 2007 - lmb@suse.de - patches.suse/revert-netfilter-source-routing: Revert parts of the netfilter patch from 2.6.18.5 due to kABI changes. * Do Mär 01 2007 - agruen@suse.de - scripts/kabi-checks: update so that this will also work with additional columns in the symvers files. Bump tolerate_kabi_changes to 31; we don't want the build to fail. * Mi Feb 28 2007 - trenn@suse.de - patches.fixes/serio_unregister_on_shutdown.patch: Delete. - patches.fixes/c-states_use_IPI.patch: Fix not waking up from C2 for Banias and Dothan Pentium Ms (216205). - patches.fixes/psmouse-fiddle-with-reset.patch: psmouse - properly reset mouse on shutdown/suspend (179702,202389,226069). Take the final patch -> remove quick hack serio_unregister_on_shutdown =================================================================
I do not think your problem has to do with the 0C thermal zone. This one (TZ4) is ok, right? It's the first not showing up?
Yes, that's right! But there are two points that make me think there's a connection: - If the AC adapter is _not_ plugged in during boot while loading of ACPI modules, the thermal zones are correct. And the AC adapter is obviously "controling" TZ4. I think: If the adapter is plugged in, TZ4 changes and TZ1 cannot be initialised. - There is a BIOS option that turns the fan connected to TZ4 always on only if the AC adapter is plugged in. If this options is disabled, Windows and a Linux with correctly initialised thermal zones don't turn on the fan if AC is plugged in. But Linux with corrupted thermal zones, ignores that the option is disabled and the fan is always turned on if AC is plugged in.
Also the error message points to a confused EC. Can you also check if there is a new BIOS for this machine!
I have the newest BIOS installed. Perhaps I should contact HP. There's an interesting entry in the "Revision History": - Fixes installation issue that occurs with SUSE Linux Looks like they care about Linux, although it's officially not supported... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566 ------- Comment #15 from jbr.1@gmx.net 2007-05-03 06:37 MST ------- I was curious and installed 2.6.21.1 from the kernel.org source tarball. And everything seems to work fine! The thermal zones are correct (no parse errors). They were always working with vanilla kernels, but now Suspend2RAM is also working! I tried to install the following KOTD from HEAD: kernel-default-2.6.21-20070430084129.i586.rpm kernel-vanilla-2.6.21-20070430084129.i586.rpm But for both I got: error: Failed dependencies: perl-Bootloader >= 0.4.14 is needed by ... apparmor-profiles <= 2.0.1 conflicts with ... apparmor-parser <= 2.0.1 conflicts with ... Is it possible to install a 2.6.21 KOTD without updating half of my system RPMs to unstable versions? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
Is it possible to install a 2.6.21 KOTD without updating half of my system RPMs to unstable versions? Yes. I'd uninstall the apparmor things if you don't need them urgently. Then install with --no-deps (perl-Bootloader is updating /boot/grub/menu.lst, so you possibly have to do or correct this by hand -> just add a new entry to
https://bugzilla.novell.com/show_bug.cgi?id=247566 ------- Comment #16 from trenn@novell.com 2007-05-03 07:19 MST ------- the newly installed kernel). And you have to reinstall kmp (Kernel Module Packages) you make use of. These are all kind of drivers that are not in the mainline kernel, e.g. some WLan and/or graphics card drivers. rpm -qa |grep -i kmp should give you an overview of the packages you need to reinstall (if you use them). Sorry, I am quite busy currently and as this works for mainline, priority for this bug even drops some more... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=247566#c17
Jochen Reinwand
participants (1)
-
bugzilla_noreply@novell.com