[Bug 387207] New: Processor is overheated
https://bugzilla.novell.com/show_bug.cgi?id=387207 Summary: Processor is overheated Product: openSUSE 11.0 Version: Factory Platform: i686 OS/Version: openSUSE 11.0 Status: NEW Severity: Major Priority: P5 - None Component: Basesystem AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: j.reitsma@hccnet.nl QAContact: qa@suse.de Found By: Beta-Customer The processor of my Acer 1710 laptop is much hotter running 11.0b3 then when running 10.3 (or, for that matter, 10.0 trough 10.2). Temp when running only one desktop, and no applications beside that is up to 75 degrees, where some 40 - 45 was normal. Changing to runlevel 1 doesn't lower the temperature apparently, so I file it under the Basesystem component. cat /proc/cpuinfo gives processor : 0
vendor_id : GenuineIntel cpu family : 15 model : 2 model name : Intel(R) Pentium(R) 4 CPU 2.80GHz stepping : 9 cpu MHz : 2793.029 cache size : 512 KB fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 2 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe up pebs bts cid xtpr bogomips : 5591.59 clflush size : 64
This cpu doesn't support CPU frequency scaling, so there's cause nor solution to find. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User sboyce@blueyonder.co.uk added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c1 Sid Boyce <sboyce@blueyonder.co.uk> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |sboyce@blueyonder.co.uk --- Comment #1 from Sid Boyce <sboyce@blueyonder.co.uk> 2008-05-06 19:49:41 MST --- My Acer 1501LCe powers off if I boot 11.0 Alpha onwards from CD/DVD, is OK with 10.3. It will generally get as far as software selection, then power down. Also if I try building a kernel with "powersave -f" set. Building a kernel, I have to set "powersave -l". The one difference I notice straight off is that 11.0 immediately puts the fan into high speed, I guess because that's because the DVD kernel sets it to Performance. On another laptop which doesn't suffer the same problem (travelMate 7520), though it looks to be on the high side while currently building a new kernel. With "powersave -A" and it's operating at max CPU speed. Kernel build completed successfully and it has dropped back to 800Mhz. processor : 0 vendor_id : AuthenticAMD cpu family : 15 model : 104 model name : AMD Turion(tm) 64 X2 Mobile Technology TL-58 stepping : 2 cpu MHz : 1900.000 cache size : 512 KB physical id : 0 siblings : 2 core id : 0 cpu cores : 2 apicid : 0 initial apicid : 0 fpu : yes fpu_exception : yes cpuid level : 1 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt rdtscp lm 3dnowext 3dnow rep_good pni cx16 lahf_lm cmp_legacy svm extapic cr8_legacy 3dnowprefetch bogomips : 3855.76 TLB size : 1024 4K pages clflush size : 64 cache_alignment : 64 address sizes : 40 bits physical, 48 bits virtual power management: ts fid vid ttp tm stc 100mhzsteps processor : 1 vendor_id : AuthenticAMD cpu family : 15 model : 104 model name : AMD Turion(tm) 64 X2 Mobile Technology TL-58 stepping : 2 cpu MHz : 1900.000 cache size : 512 KB physical id : 0 siblings : 2 core id : 1 cpu cores : 2 apicid : 1 initial apicid : 1 fpu : yes fpu_exception : yes cpuid level : 1 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt rdtscp lm 3dnowext 3dnow rep_good pni cx16 lahf_lm cmp_legacy svm extapic cr8_legacy 3dnowprefetch bogomips : 3855.76 TLB size : 1024 4K pages clflush size : 64 cache_alignment : 64 address sizes : 40 bits physical, 48 bits virtual power management: ts fid vid ttp tm stc 100mhzsteps # sensors k8temp-pci-00c3 Adapter: PCI adapter Core0 Temp: +56.0°C Core0 Temp: +60.0°C Core1 Temp: +56.0°C Core1 Temp: +57.0°C Temperature readings not changing throughout the build. After the kernel build, still:- # sensors k8temp-pci-00c3 Adapter: PCI adapter Core0 Temp: +45.0°C Core0 Temp: +47.0°C Core1 Temp: +43.0°C Core1 Temp: +43.0°C Desktop with 64x2 6000+ (1000Mhz idle) # sensors k8temp-pci-00c3 Adapter: PCI adapter Core0 Temp: +22.0°C Core1 Temp: +30.0°C -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 Andreas Jaeger <aj@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|bnc-team-screening@forge.provo.novell.com |trenn@novell.com -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c2 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |rui.zhang@intel.com Status|NEW |ASSIGNED --- Comment #2 from Thomas Renninger <trenn@novell.com> 2008-05-07 04:34:28 MST --- Two totally different machines..., hope this will not give a mess and two unrelated problems...: Sid: Do you also have temp exported via: /proc/acpi/thermal_zone/*/* It may happen that sensor tools and ACPI are accessing the thermal sensor at the same time and confusing it, but this should not happen at installation. Can do: lsmod |grep thermal and try to unload everything that has to do with thermal (not sure, but there now might be a thermal_sys or similar named driver). Sid: better open another bug repost the info and assign to me or things will get too confusing. For the Intel P4, Rui could have a look at, this could be related to latest thermal changes. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User sboyce@blueyonder.co.uk added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c3 --- Comment #3 from Sid Boyce <sboyce@blueyonder.co.uk> 2008-05-07 06:26:02 MST --- Opened new bug #387702 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c4 --- Comment #4 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-07 10:29:06 MST --- Hi Thomas, lsmod | grep thermal thermal 39452 0 processor 68400 2 thermal Should I remove these two? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c5 --- Comment #5 from Thomas Renninger <trenn@novell.com> 2008-05-08 06:23:42 MST --- Yes, pls. To be honest this is still digging a bit in the dark. There are currently a lot bug reports that machines are running hotter than they did in previous kernel versions. As thermal management changed (and throttling maybe it's this) removing these could help and we have a first pointer where to look at detailed. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c6 --- Comment #6 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-08 08:13:18 MST --- I can unload the module thermal: lsmod | grep processor processor 68400 2 thermal linux:/home/jogchum # rmmod thermal linux:/home/jogchum # rmmod processor ERROR: Module processor is in use linux:/home/jogchum # The proc is 70 degr. C now, I'll leave the system up to see what temp does. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c7 --- Comment #7 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-08 10:41:24 MST --- I've left the system for a few hours now. gkrellm doesn't show the temp since I unloaded he thermal module. I loaded it again, and started gkrellm immediately after that: it shows a temperature of 83 degr C. So the thermal module seems to have no influence. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User rui.zhang@intel.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c8 --- Comment #8 from Rui Zhang <rui.zhang@intel.com> 2008-05-08 19:43:45 MST --- what's the kernel version of Suse 11.0b3? Maybe coretemp is reading the wrong value? Please refer to this thread http://marc.info/?l=linux-acpi&m=120856995800552&w=2 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c9 --- Comment #9 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-09 00:27:07 MST --- Kernel version is uname -a Linux linux 2.6.25-26-pae #1 SMP 2008-04-30 07:56:05 +0200 i686 i686 i386 GNU/Linux Apart from the results of the temp. *reading*, one can also hear the fan speed going up to it max, and besides that the temperature of the body of the laptop is at one place so hot that one can barely keep a hand on it. So I am somewhat in doubt if wrong readings are the case here, though the kernel is indeed in the 2.6.25 range. The tread you give states also the possibility that the old readings are wrong, and the new one correct... Thanks for thinking with us. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User sboyce@blueyonder.co.uk added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c10 --- Comment #10 from Sid Boyce <sboyce@blueyonder.co.uk> 2008-05-09 00:34:30 MST --- I shall have to see if I can find out what kernel was used from the first 11.0 Alpha as that was when the problem first occurred on the 1501LCe. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User sboyce@blueyonder.co.uk added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c11 --- Comment #11 from Sid Boyce <sboyce@blueyonder.co.uk> 2008-05-09 00:39:02 MST --- If I boot the 1501LCe from a 11.0 CD/DVD, immediately the fan speed goes to max. It will get as far as partitioning or software selection during an install, then power off. Using a 10.3 CD/DVD, it completes the install. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User rui.zhang@intel.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c12 --- Comment #12 from Rui Zhang <rui.zhang@intel.com> 2008-05-09 01:31:57 MST --- Please attach the dmesg output. Please attach the kernel config file Please attach the acpidump output using the latest pmtools here: http://www.kernel.org/pub/linux/kernel/people/lenb/acpi/utils/ -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c13 --- Comment #13 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-09 04:11:34 MST --- Created an attachment (id=213856) --> (https://bugzilla.novell.com/attachment.cgi?id=213856) acpidump.txt The pmtools that come with opensuse 11.0 beta 2 *is* the latest to be found at the link gives, so I took this (20071116-16). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c14 --- Comment #14 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-09 04:12:11 MST --- Created an attachment (id=213857) --> (https://bugzilla.novell.com/attachment.cgi?id=213857) Output of dmesg -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c15 --- Comment #15 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-09 04:20:43 MST --- I've added dmesg and acpidump output. Where do I find the kernel config file? (I didn't compile this kernel myself). It's not in /usr/src/linux-2.6.25-26-obj -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c16 --- Comment #16 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-16 09:49:03 MST --- Upgraded to 11.0 beta 3, and the problem has gone! -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c17 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |INVALID --- Comment #17 from Thomas Renninger <trenn@novell.com> 2008-05-19 04:02:59 MST --- ?!? This is strange..., I am not aware of any changes for 2.6.25 in this area. You may want to watch this for a while and reopen if you see the machine running at higher temperature again. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c18 --- Comment #18 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-19 11:11:51 MST --- Yes, I had warned my wife that I didn't expect any change in this, since I didn't see a resolving comment on this bug. But the change in temp reported and fanspeed is really significant, (temp dropping from some 75 deg Celsius to some 40 with only the desktop running). This improvement is persistant so far. I'll test RC of course, and will reopen indeed if the problem re-occurs. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c19 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |dmueller@novell.com --- Comment #19 from Thomas Renninger <trenn@novell.com> 2008-05-20 10:07:41 MST --- This probably was because of bug #390729. A 3D feature was switched off, which I expect let the GPU run really hot (not proved, just guessing). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User funtasyspace@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c20 Jörg Hermsdorf <funtasyspace@yahoo.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |funtasyspace@yahoo.com --- Comment #20 from Jörg Hermsdorf <funtasyspace@yahoo.com> 2008-05-22 11:28:19 MST --- I experience the same on my Lenovo ThinkPad T60p and openSUSE 11.0 Beta3. My notebook gets really hot. Is this already fixed in post-Beta3 FACTORY? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c21 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |gp@novell.com, aj@suse.de --- Comment #21 from Thomas Renninger <trenn@novell.com> 2008-05-23 05:59:53 MST --- I wonder whether this still happens when you enable compiz and 3D desktop things. Dirk, can you explain what to switch on, what has been switched off by default recently, pls. It's not yet verified yet whether the improvements Jogchum sees are coming from 3D desktop or xorg issues. This would be very interesting to know. Adding aj and Gerald, they also had/have hot ThinkPads. Aj could stop critical shutdowns by cleaning the fans, but the root cause probably was increased temperature with 11.0. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User funtasyspace@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c22 --- Comment #22 from Jörg Hermsdorf <funtasyspace@yahoo.com> 2008-05-24 03:05:58 MDT --- I just want to add, that I am using the radeonhd driver on my ThinkPad and therefore I have no compiz or other 3D stuff enabled. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c23 --- Comment #23 from Thomas Renninger <trenn@novell.com> 2008-05-24 15:42:35 MDT --- I also have a radeonhd driver and I opened a bug because desktop switching was rather slow and the solution should be compiz disabling (still need to test this, I have new packages, but configuration probably was kept). It would make a lot sense, especially if 3D acceleration does not work as expected, that such functionality makes the GPU run very hot. See bug #390729 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User tom-osbug@tautologysolutions.ca added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c24 Tom Hui <tom-osbug@tautologysolutions.ca> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |tom-osbug@tautologysolutions.ca --- Comment #24 from Tom Hui <tom-osbug@tautologysolutions.ca> 2008-05-28 12:51:28 MDT --- I'm also experiencing very high cpu temperature with my HP ZD7000 laptop (Intel Pentium 4 3.2 GHz with hyper-threading, Nvidia GeForce FX Go5600). The laptop was upgraded from 10.3 to 11.0 beta 3. I normally boot to runlevel 3 (non-graphical). If I just let the laptop sit there for about 5 minutes, the fans starts to kick into overdrive and you can feel that the air coming out is very hot. When using 10.2 the CPU would be running at around 37 degrees C, with 11.0 beta 3 it is doing about 59 degrees C when idle. Checking CPU usage with 'top' shows that the machine is 99% idle. One observation is that the heat problem goes away if I boot with "acpi=off" or "acpi=ht" however, you don't get any power management. so, I'm not so sure that this is resolved. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c25 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |REOPENED Resolution|INVALID | Summary|Processor is overheated |11.0 shows much higher power consumpition than | |10.3 --- Comment #25 from Thomas Renninger <trenn@novell.com> 2008-05-29 06:10:02 MDT --- No it is probably not solved. Dirk also realized higher fan activity on his machine. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c26 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |pavel@novell.com --- Comment #26 from Thomas Renninger <trenn@novell.com> 2008-05-29 06:12:28 MDT --- I had a short look at Dirk's machine, but I could not see anything obvious. frequency is lowered, C-states are used (according to powertop). I expect it could be the new cpuidle or the new thermal management implementations, but this is guessing. I expect kernel debugging is necessary to come further, I couldn't find anything in userspace. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User pavel@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c28 --- Comment #28 from Pavel Machek <pavel@novell.com> 2008-05-29 12:01:33 MDT --- Can you try to measure power consumption, not temperature (as temperature also depends on cooling strategy etc?) in opensuse10.3 and 11.0? Measure it in init=/bin/bash mode; either you can attach a wattmeter, or you can use powertop... but that needs lots of averaging as the acpi battery meter is probably not too accurate. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c29 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |fdg@novell.com --- Comment #29 from Thomas Renninger <trenn@novell.com> 2008-05-30 04:00:34 MDT --- In the architecture team (ask Frank Doege) or mobile team there is a power meter, easy to attach. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User j.reitsma@hccnet.nl added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c30 --- Comment #30 from Jogchum Reitsma <j.reitsma@hccnet.nl> 2008-05-30 13:14:25 MDT --- Some further remarks: After installing 11.0 b 3: - although the reported temp and fanspeed lowered significantly, when doing some machine intensive work (lots of demanding sites open in FF, doing some thousand package updates trough "zypper ref; zypper dup" causes the machine to halt. Rebooting immediately doesn't even give the BIOS flash screen. Rebooting after some half an hour lets the machine start up normally, so I think it's overheating too. But after getting kernel 2.6.25.4-8-pae #1 SMP 2008-05-26 15:23:05 +0200 i686 i686 i386 GNU/Linux temp with a few FF win's open stay as low as 44 deg Celsius; (btw, room temp is even higher then with comment #16) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User funtasyspace@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c31 --- Comment #31 from Jörg Hermsdorf <funtasyspace@yahoo.com> 2008-06-03 08:48:38 MDT --- My Lenovo ThinkPad T60p still heats up to 92°C after 5 Minutes of KDE 4.0 running. If I switch to runlevel 3, I think it cools down, but I don't know how to read out the CPU temperature on the console. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User jnelson-suse@jamponi.net added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c32 Jon Nelson <jnelson-suse@jamponi.net> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |jnelson-suse@jamponi.net --- Comment #32 from Jon Nelson <jnelson-suse@jamponi.net> 2008-07-11 08:50:09 MDT --- I have a Thinkpad T61p. With 10.3 I'd hover around 36C most of the time. Since installing 11.0, however, with no other changes, I'm seeing 49C. That's a substantial difference! As part as Acer laptops go, go to the acer europe website and get the latest BIOS. Acer has recently released a number of BIOS updates that made a difference for my friend's 7720. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User pavel@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c33 --- Comment #33 from Pavel Machek <pavel@novell.com> 2008-07-14 00:04:51 MDT --- Temperature subsystem had some significant changes between 10.3 and 11. If you want to claim that power consumption risen, please measure power consumption, not temperature. Crying "me too" does not really help. Measure power consumption under 10.3 and 11.0 and compare. With /proc/acpi/battery/*/* it should be rather easy, and it is probably enough to install both kernels side-by-side. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User tom-osbug@tautologysolutions.ca added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c34 --- Comment #34 from Tom Hui <tom-osbug@tautologysolutions.ca> 2008-07-16 20:24:08 MDT --- I don't think that any of the original reporters/commenters on this bug had ever claimed that is was a power consumption problem. Everyone was experiencing extremely high CPU temp or overheat and fans running continuously at full speed. I believe it was a Novell person that change the summary to reference power consumption. Here is the /proc/acpi/battery/state information from 10.3 and 11.0. I booted the system to runlevel 1 and let it sit idle for about 5 minutes before taking the readings. openSUSE 10.3 cat /proc/acpi/thermal_zone/THRM/temperature temperature: 34 C cat /proc/acpi/battery/BAT1/state present: yes capacity state: ok charging state: discharging present rate: 2336 mA remaining capacity: 1888 mAh present voltage: 15968 mV Power = 2.336A * 15.968V = 37.301 Watts openSUSE 11.0 cat /proc/acpi/thermal_zone/THRM/temperature temperature: 58 C cat /proc/acpi/battery/BAT1/state present: yes capacity state: ok charging state: discharging present rate: 6080 mA remaining capacity: 1760 mAh present voltage: 14624 mV Power = 6.080A * 14.624V = 88.914 Watts I don't know how accurate the information from /proc/acpi/battery/BAT1/state is but if I'm interpreting the information correctly this does show a significant difference in power consumption. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c35 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|REOPENED |ASSIGNED --- Comment #35 from Thomas Renninger <trenn@novell.com> 2008-07-17 02:45:51 MDT --- Tom Hui: thanks a lot for the detailed research.
From the huge difference, I expect we have a CPU problem here. Possibly graphics cards, but I very much expect the CPU.
The first report states a Pentium 4 in the description, so it cannot be cpufreq. It looks like the processor sleep states (C-states) are not entered correctly. The Pentium 4 probably has no deeper sleep states (only C1), it could be that something is still spinning instead of entering the idle function, like if idle=poll is passed? Can you do: echo 1 > /proc/sys/kernel/sysrq Then push the SysRq button (near Scroll/Lock button) and also "P" Do this when the machine is idle. You should then see a backtrace at the end of: dmesg |less It should look like: ----------------- RIP: 0010:[<ffffffff80221970>] [<ffffffff80221970>] native_safe_halt+0x6/0x8 .. Call Trace: Inexact backtrace: [<ffffffff8020b115>] ? default_idle+0x43/0x78 [<ffffffff8020b0d2>] ? default_idle+0x0/0x78 [<ffffffff8020b08a>] ? cpu_idle+0x92/0xda [<ffffffff804447fc>] ? start_secondary+0x408/0x417 ----------------- Important is the RIP line and the backtrace (like above). If you are not in the default_idle function, try again some more times when no process is running. Can you attach the stripped output like above. The C-states statistics are exported to userspace via: /proc/acpi/processor/*/power Please also run powertop and tell us if you see any noticable message/warning. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User funtasyspace@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c36 --- Comment #36 from Jörg Hermsdorf <funtasyspace@yahoo.com> 2008-07-17 04:07:41 MDT ---
Everyone was experiencing extremely high CPU temp or overheat and fans running > continuously at full speed. Yes, the fan of my ThinkPad T60p became a victim of this bug :( I had to send it in to get it repaired.
-- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User tom-osbug@tautologysolutions.ca added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c37 --- Comment #37 from Tom Hui <tom-osbug@tautologysolutions.ca> 2008-07-18 01:08:12 MDT --- Yes, I'm running on an HP zd7000 with a Pentium 4 HT. Ok, here is the C-state and Magic SysRq+P info from 10.3 and 11.0. I think you are right about the C-state not being entered correctly. openSUSE 10.3 ALT-SYSRQ-P EIP is at native_safe_halt+0x2/0x3 ... [<f9083dd6>] acpi_safe_halt+0x1c/0x29 [processor] [<f9083fd4>] acpi_processor_idle+0x184/0x400 [processor] [<f9083e50>] acpi_processor_idle+0x0/0x400 [processor] [<f9083e50>] acpi_processor_idle+0x0/0x400 [processor] [<c01033af>] cpu_idle+0xaa/0xcb [<c038e9a4>] start_kernel+0x352/0x35a [<c038e17e>] unknown_bootoption+0x0/0x216 cat /proc/acpi/processor/CPU0/power active state: C1 max_cstate: C8 bus master activity: 00200001 maximum allowed latency: 8000 usec states: *C1: type[C1] promotion[C3] demotion[--] latency[000] usage[00052482] duration[00000000000000000000] C2: <not supported> C3: type[C3] promotion[--] demotion[C1] latency[085] usage[00000000] duration[00000000000000000000] cat /proc/acpi/processor/CPU1/power active state: C0 max_cstate: C8 bus master activity: 00000000 maximum allowed latency: 8000 usec states: openSUSE 11.0 ALT-SYSRQ-P EIP is at native_safe_halt+0x5/0x7 ... [<f8838233>] acpi_idle_enter_c1+0xf2/0x16e [processor] [<c026f9e2>] cpuidle_idle_call+0x62/0x92 [<c0104a39>] cpu_idle+0xa0/0xc0 [<c02d5859>] rest_init+0x49/0x4b [<c0485874>] start_kernel+0x325/0x32d cat /proc/acpi/processor/CPU0/power active state: C0 max_cstate: C8 bus master activity: 00000000 maximum allowed latency: 2000000000 usec states: C1: type[C1] promotion[--] demotion[--] latency[000] usage[00022722] duration[00000000000000000000] C2: <not supported> C3: type[C3] promotion[--] demotion[--] latency[085] usage[00000000] duration[00000000000000000000] cat /proc/acpi/processor/CPU1/power active state: C0 max_cstate: C8 bus master activity: 00000000 maximum allowed latency: 2000000000 usec states: I ran "powertop" but did not see any noticable messages or warning. The only thing that seems unusual is that every so often ACPI generated a large number of interrupts. Wakeups-from-idle per second : 18.9 interval: 10.0s no ACPI power usage estimate available Top causes for wakeups: 81.8% (131.4) <interrupt> : acpi 5.6% ( 9.0) xfsaild : schedule_timeout (process_timeout) 4.7% ( 7.5) xfsbufd : schedule_timeout (process_timeout) 3.1% ( 5.0) <kernel core> : fbcon_add_cursor_timer (cursor_timer_handler) 2.6% ( 4.1) <kernel module> : usb_hcd_poll_rh_status (rh_timer_func) 1.2% ( 2.0) <kernel core> : clocksource_register (clocksource_watchdog) 0.6% ( 1.0) <kernel core> : queue_delayed_work_on (delayed_work_timer_fn) 0.1% ( 0.2) init : schedule_timeout (process_timeout) 0.1% ( 0.1) <kernel IPI> : function call interrupts 0.1% ( 0.1) <kernel core> : init_nonfatal_mce_checker (delayed_work_timer_fn) 0.1% ( 0.1) xfssyncd : schedule_timeout (process_timeout) 0.1% ( 0.1) <kernel core> : ip_rt_init (delayed_work_timer_fn) 0.1% ( 0.1) <kernel core> : neigh_table_init_no_netlink (neigh_periodic_timer) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c38 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |astarikovskiy@novell.com, | |venkatesh.pallipadi@intel.com --- Comment #38 from Thomas Renninger <trenn@novell.com> 2008-07-18 05:15:23 MDT ---
The only thing that seems unusual is that every so often ACPI generated a large number of interrupts. Yes, this is definetly wronng. Do you see kacpid consume a lot cpu time in top? This could be related to bug #401740. Hmm, but there it only happens after suspend. ACPI daemone running in a loop can also be something else. do (when this happens): echo 0x1F >/sys/module/acpi/parameters/debug_level;sleep 1;echo 0x3 /sys/module/acpi/parameters/debug_level (Be careful, not C&Ped, path might not be 100% correct). and send dmesg output
-- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User tom-osbug@tautologysolutions.ca added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c39 --- Comment #39 from Tom Hui <tom-osbug@tautologysolutions.ca> 2008-07-18 21:39:14 MDT --- Created an attachment (id=228892) --> (https://bugzilla.novell.com/attachment.cgi?id=228892) dmesg output with acpi debug_level set to 0x1f I've been doing all my tests from runlevel 1. So I kacpid should not be running. "top" shows the system is around 99.8% idle. I have attached the "dmesg" output that was requested. I did the "echo 0x1F
/sys/module/acpi/parameters/debug_level" and ran "powertop". Then just waited for the ACPI interrupts to show a spike in the Top causes for wakeups section. When that happened, I quit "powertop" and captured the "dmesg" output.
I don't know much about ACPI so the output is pretty meaningless to me. Hopefully it will be helpful to you guys. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c40 --- Comment #40 from Thomas Renninger <trenn@novell.com> 2008-07-19 02:04:51 MDT --- Pls attach acpidump.
Then just waited for the ACPI interrupts to show a spike How often does that happen? Do the interrupts also calm down for a while, do you have to wait for something that triggers the irq storm? You can also use: watch -n1 cat /proc/interrupts The ACPI interrupts should increase rapidly? Can you give an estimation how much there are per second by e.g. cat /proc/interrupts;sleep 10; cat /proc/interrupts
Does it help if you unload or better not load the battery module? Best copy away battery.ko temporarily: mv /lib/modules/`uname -r`/kernel/drivers/acpi/battery.ko /lib/modules reboot. Do not forget to move the battery.ko driver back when you finished tests(especially if it is not the problem or you will miss battery status): mv /lib/modules/battery.ko /lib/modules/`uname -r`/kernel/drivers/acpi/battery.ko I wonder if this is really related to battery or whether we still do not see what kind of ACPI interrupts/GPEs are processed. If the problem persists, maybe you can again increase acpi debug level, but this time set to 0x21f instead 0x1f (with battery still unloaded). Better copy away the relvant parts from /var/log/messages now, then we have the time how often things happen. You may want to do: logger XXXXXXXXXX before the commands you increase acpi_debug level. Then you can search /var/log/messages for it and know when you started debugging. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User jnelson-suse@jamponi.net added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c41 --- Comment #41 from Jon Nelson <jnelson-suse@jamponi.net> 2008-07-19 08:20:29 MDT --- I generated some output with a command like this: cat /proc/acpi/battery/BAT0/* /proc/acpi/thermal_zone/THM*/* /proc/acpi/processor/*/* /proc/acpi/ibm/* | tee 2.6.25.9 for each of 2.6.22.18 and 2.6.25.9. Both in single-user mode as the 2.6.22.18 kernel and openSUSE 11.0 don't play nice together. This is on a Thinkpad T61p. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User jnelson-suse@jamponi.net added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c42 --- Comment #42 from Jon Nelson <jnelson-suse@jamponi.net> 2008-07-19 08:21:08 MDT --- Created an attachment (id=228912) --> (https://bugzilla.novell.com/attachment.cgi?id=228912) 2.6.22.18 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User jnelson-suse@jamponi.net added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c43 --- Comment #43 from Jon Nelson <jnelson-suse@jamponi.net> 2008-07-19 08:21:23 MDT --- Created an attachment (id=228913) --> (https://bugzilla.novell.com/attachment.cgi?id=228913) 2.6.25.9 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User jnelson-suse@jamponi.net added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c44 --- Comment #44 from Jon Nelson <jnelson-suse@jamponi.net> 2008-07-19 09:16:51 MDT --- Created an attachment (id=228916) --> (https://bugzilla.novell.com/attachment.cgi?id=228916) 2.6.25.11-SL110_BRANCH_20080717102407-default -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c45 --- Comment #45 from Thomas Renninger <trenn@novell.com> 2008-07-19 17:45:45 MDT --- Ok we have several causes for power regressions: - Some Dells after suspend to disk/ram: [Bug 410612] kacpid goes wild after s2disk which might come out as a duplicate of: [Bug 401740] kacpi* eat a lot of cpu after s2disk and can hopefully be fixed soon - [Bug 377538] Regress in power management: Which should be totally unrelated. There seem to be USB and keyboard interrupts going wild. - Some(one?) mysterious ones..., e.g. Jon has working C3 state, AFAIK Dirk too. At least I had had a look at that when I touched the machine, but I couldn't see anything obvious. - [Bug 404245] Acer laptop powers off due to overheating Thermal polling missing by default -> therefore overheating. Might still also have another reason. Jon, I expect yours is special, sorry. But Hui's might be related to the very first two bugreports. In this case it is not related to the battery. Hmm, but it's not a Dell, maybe still something else. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c46 --- Comment #46 from Thomas Renninger <trenn@novell.com> 2008-07-19 18:10:26 MDT --- That we do not override polling frequency provided by BIOS anymore could help some (also see bug 404245). You may want to try: for x in /proc/acpi/thermal_zone/*/polling_frequency;do echo 5 > $x done -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=387207 User trenn@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=387207#c47 Thomas Renninger <trenn@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |WORKSFORME --- Comment #47 from Thomas Renninger <trenn@novell.com> 2009-01-30 08:13:38 MST --- This bug is too long and confusing (there were several unrelated issues concerning this topic). I am closing this one now. If you should still have problems with 11.1, please open a new bug, copy or add additional most relevant descriptions and assign it to me. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com