http://bugzilla.novell.com/show_bug.cgi?id=533556
User trenn@novell.com added comment
http://bugzilla.novell.com/show_bug.cgi?id=533556#c37
Thomas Renninger changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |ASSIGNED
--- Comment #37 from Thomas Renninger 2009-10-19 12:40:41 MDT ---
I tried with 11.1 (SLE11) and I also could see the above threshold messages,
but not this kind of mce storm.
I expect the high temperature is real.
You can monitor this nicely by:
a) Put the cores under load and start several of these processes:
cat /dev/zero >/dev/null &
b) Monitor temperature:
- Install sensors package, run:
sensors-detect (always confirm),
- Start lm-sensors service:
rclm-sensors start
- and monitor temperature by:
watch -n1 sensors
The temperature should now raise slowly until the critical temperature is
reached. You should see the same with older SUSE versions.
The bug seem to be that the hysteresis changed, the mce and speed
limitation/throttling and unthrottling seem to toggle quickly all the time,
resulting in a hard machine freeze (no display, machine can still be pinged,
but ssh login does not work anymore, keyboard is dead). I added the machine to
a serial console over night, hopefully I can catch something.
--
Configure bugmail: http://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.