[opensuse] Considering 11.4 again but first ..........
I update to 11.4 back in march but had to take all my systems back to 11.3, 11.4 is not usable, for me, due to 2 key problems I ran into. First one is in the following bug posts https://bugzilla.novell.com/show_bug.cgi?id=665720 and https://bugzilla.novell.com/show_bug.cgi?id=680297. There appears to be no fix yet, but curious if anyone knows, how stable the work around using the RUN_PARALLEL="no" is. Second issue: amd cpu based desktop locking up at random with no error messages. The only symptom is there are a lot of [900.000017] [Hardware Error]: No human readable MCE decoding support on this cpu type errors in syslog and the numerical number is different each time. I used the nomce option which does stop the error messages, but the amd cpu based desktop is still unstable, with random lock ups. I like to upgrade 11.4, but for now, 11.4 is not usable, so I stay on 11.3 Thanks, Tony -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
Tony wrote:
Second issue: amd cpu based desktop locking up at random with no error messages. The only symptom is there are a lot of [900.000017] [Hardware Error]: No human readable MCE decoding support on this cpu type errors in syslog and the numerical number is different each time. I used the nomce option which does stop the error messages, but the amd cpu based desktop is still unstable, with random lock ups.
MCE = Machine Check Error. Read: your CPU is probably failing. -- Per Jessen, Zürich (17.9°C) -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On 04/06/2011 12:51 PM, Per Jessen wrote:
Tony wrote:
Second issue: amd cpu based desktop locking up at random with no error messages. The only symptom is there are a lot of [900.000017] [Hardware Error]: No human readable MCE decoding support on this cpu type errors in syslog and the numerical number is different each time. I used the nomce option which does stop the error messages, but the amd cpu based desktop is still unstable, with random lock ups. MCE = Machine Check Error. Read: your CPU is probably failing.
hmm perhaps but runs just fine on 11.3 or any other os, I've used on this box prior to 11.4 update, so thinking that's not it. This happens only with 11.4, so far. Thanks -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On 04/06/2011 05:49 PM, Tony wrote:
On 04/06/2011 12:51 PM, Per Jessen wrote:
Tony wrote:
Second issue: amd cpu based desktop locking up at random with no error messages. The only symptom is there are a lot of [900.000017] [Hardware Error]: No human readable MCE decoding support on this cpu type errors in syslog and the numerical number is different each time. I used the nomce option which does stop the error messages, but the amd cpu based desktop is still unstable, with random lock ups. MCE = Machine Check Error. Read: your CPU is probably failing.
hmm perhaps but runs just fine on 11.3 or any other os, I've used on this box prior to 11.4 update, so thinking that's not it. This happens only with 11.4, so far.
Thanks
MCE's can be tricky things to troubleshoot. 'Generally' (overwhelming majority of the time) a MCE is hardware, a hardware/firmware message that is logged by software. However "something has to throw the exception". I ran into a really strange problem with the nvidia driver on openSuSE causing (or at least involved in) an MCE: http://lists.opensuse.org/opensuse/2008-04/msg00392.html I still run the same nvidia card, so I never found out what the actual issue was. It was on a Tyan Tomcat 8KE board with and Opeteron 180 proc running x86_64. Best thing you can do is try swapping hardware and seeing if the mce goes away. As far as 11.4 goes, I'm running it on x86_64 and I have been very, very pleased with the 11.4 release. It is by far one of the best since 11.0. I don't know 11.3, but the default kernel with 11.3 sucked for me. I still have 11.3 running on a box with a 2.6.35 kernel and I have no complaint. As for 11.4 with the 2.6.37 kernel -- it has been rock solid on a very finicky Toshiba laptop with Atheros wireless and ATI graphics. (I did have an Xorg issue that is currently being fixed: https://bugzilla.novell.com/show_bug.cgi?id=678264) Whatever the issue, it looks like your box is tripping on something that may be one of the features that has been moved into the kernel since 2.6.34 (11.3 default) There have been a *lot* of changes between 2.6.34 and 2.6.37. It may just be one of those issue "where all the stars line up on your box" and you have hit a bug in the code somewhere. Because, honestly, all things considered, 11.4 has been the best release openSuSE has had in some time. (no flames, just my humble learned opinion :) -- David C. Rankin, J.D.,P.E. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
Tony wrote:
On 04/06/2011 12:51 PM, Per Jessen wrote:
Tony wrote:
Second issue: amd cpu based desktop locking up at random with no error messages. The only symptom is there are a lot of [900.000017] [Hardware Error]: No human readable MCE decoding support on this cpu type errors in syslog and the numerical number is different each time. I used the nomce option which does stop the error messages, but the amd cpu based desktop is still unstable, with random lock ups.
MCE = Machine Check Error. Read: your CPU is probably failing.
hmm perhaps but runs just fine on 11.3 or any other os, I've used on this box prior to 11.4 update, so thinking that's not it. This happens only with 11.4, so far.
Maybe it wasn't being reported correctly in 11.3. I think the two indications are quite clear: random lock ups + machine checks. However, it is possible that the messages stem from a corrected error, in which case they shouldn't be logged at all. Maybe try installing mcelog, that might produce more useful messages. -- Per Jessen, Zürich (10.8°C) -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
-----Original Message----- From: Tony [mailto:tonys@technologist.com] Sent: Wednesday, April 06, 2011 10:46 AM To: opensuse@opensuse.org Subject: [opensuse] Considering 11.4 again but first ..........
I update to 11.4 back in march but had to take all my systems back to 11.3, 11.4 is not usable, for me, due to 2 key problems I ran into.
First one is in the following bug posts https://bugzilla.novell.com/show_bug.cgi?id=665720 and https://bugzilla.novell.com/show_bug.cgi?id=680297. There appears to be no fix yet, but curious if anyone knows, how stable the work around using the RUN_PARALLEL="no" is.
Second issue: amd cpu based desktop locking up at random with no error messages. The only symptom is there are a lot of [900.000017] [Hardware Error]: No human readable MCE decoding support on this cpu type errors in syslog and the numerical number is different each time. I used the nomce option which does stop the error messages, but the amd cpu based desktop is still unstable, with random lock ups.
I like to upgrade 11.4, but for now, 11.4 is not usable, so I stay on 11.3
Thanks,
Tony -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
Which version did you try? I was having similar issues getting 11.4 x86 to work on an Atom 230, but switching over to x64 cured my problems. Ken -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On 04/06/2011 07:14 PM, Ken Gramm wrote:
-----Original Message----- From: Tony [mailto:tonys@technologist.com] Sent: Wednesday, April 06, 2011 10:46 AM To: opensuse@opensuse.org Subject: [opensuse] Considering 11.4 again but first ..........
I update to 11.4 back in march but had to take all my systems back to 11.3, 11.4 is not usable, for me, due to 2 key problems I ran into.
First one is in the following bug posts https://bugzilla.novell.com/show_bug.cgi?id=665720 and https://bugzilla.novell.com/show_bug.cgi?id=680297. There appears to be no fix yet, but curious if anyone knows, how stable the work around using the RUN_PARALLEL="no" is.
Second issue: amd cpu based desktop locking up at random with no error messages. The only symptom is there are a lot of [900.000017] [Hardware Error]: No human readable MCE decoding support on this cpu type errors in syslog and the numerical number is different each time. I used the nomce option which does stop the error messages, but the amd cpu based desktop is still unstable, with random lock ups.
I like to upgrade 11.4, but for now, 11.4 is not usable, so I stay on 11.3
Thanks,
Tony -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
Which version did you try? I was having similar issues getting 11.4 x86 to work on an Atom 230, but switching over to x64 cured my problems.
Ken Hmm, 32 bit, on both systems, I might try 64bit see what happens.
Tony -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On 4/6/2011 10:46 AM, Tony wrote:
I update to 11.4 back in march but had to take all my systems back to 11.3, 11.4 is not usable, for me, due to 2 key problems I ran into.
First one is in the following bug posts https://bugzilla.novell.com/show_bug.cgi?id=665720 and https://bugzilla.novell.com/show_bug.cgi?id=680297. There appears to be no fix yet, but curious if anyone knows, how stable the work around using the RUN_PARALLEL="no" is.
Second issue: amd cpu based desktop locking up at random with no error messages. The only symptom is there are a lot of [900.000017] [Hardware Error]: No human readable MCE decoding support on this cpu type errors in syslog and the numerical number is different each time. I used the nomce option which does stop the error messages, but the amd cpu based desktop is still unstable, with random lock ups.
I like to upgrade 11.4, but for now, 11.4 is not usable, so I stay on 11.3
Thanks,
Tony
I encountered this problem on my 32 bit install (11.4) on an athlon based box. Booting with the "nomce" option eliminated or suppressed the problem. You can make this more permanent by turning MCE reporting off in Yast-System Settings. Turned out my problem was simply a bad CD in my CD drive. I removed it and the problem has not returned. See previous thread here http://lists.opensuse.org/opensuse/2011-03/msg00398.html Jim F -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
participants (5)
-
David C. Rankin
-
Jim Flanasgan
-
Ken Gramm
-
Per Jessen
-
Tony