[Bug 788195] New: quad socket romley (E5-4600 series CPUs) systems will not reboot/power off correctly - system hangs
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c0 Summary: quad socket romley (E5-4600 series CPUs) systems will not reboot/power off correctly - system hangs Classification: openSUSE Product: openSUSE 12.2 Version: Final Platform: x86-64 OS/Version: openSUSE 12.2 Status: NEW Severity: Critical Priority: P5 - None Component: Basesystem AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: rick@microway.com QAContact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:16.0) Gecko/20100101 Firefox/16.0 I have tried openSuse 12.2 on 3 different quad socket Romley platforms - 1 from Asus, another from Supermicro, and another from Quanta. All 3 systems have problems when rebooting or powering off with openSuse 12.2. The typical behavior I am seeing is a hang at the POST. If you reboot the system, it will hang with a blank screen at the POST. If you power the system off, when you power it back on it will either hang at the POST, or hang under Linux during the boot before you can type anything. If you perform a regular power off, then unplug the AC power, then plug it back in and turn it back on it works fine. Once it has hung at the POST, or under Linux during the boot, holding in the power button to turn it back off, and then turning it back on works fine. The Quanta system generates an entry in the IPMI log every time this happens saying "IERR Asserted". Based on some reading I've done this indicates an internal error in the CPU. Is openSuse 12.2 somehow leaving flags set incorrectly on the CPU for shutdown? CentOS 6.3 and Gentoo w/ a 3.5.3 kernel both work fine on this hardware. Reproducible: Always Steps to Reproduce: 1. reboot a quad socket Romley system, or power it off and then back on 2. It will hang at the POST 3. Actual Results: System hangs at the POST, or sometimes hangs early on in the Linux boot process. Most of the time it's a POST hang. Expected Results: Correct powering off / on / rebooting. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c Jiaying ren <jren@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |jren@novell.com AssignedTo|bnc-team-screening@forge.pr |jsrain@suse.com |ovo.novell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c Jiri Srain <jsrain@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Component|Basesystem |Kernel AssignedTo|jsrain@suse.com |kernel-maintainers@forge.pr | |ovo.novell.com -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c1 --- Comment #1 from Richard Warner <rick@microway.com> 2012-11-20 02:42:42 UTC --- Hi, Are there any updates on this? Is there any other information I can provide to help? Should I try getting BIOS engineers from any of the 3 companies involved in this? Thanks -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c Richard Warner <rick@microway.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Priority|P5 - None |P1 - Urgent -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c2 --- Comment #2 from Richard Warner <rick@microway.com> 2012-12-19 21:56:03 UTC --- Any updates on this? It has been over a month since I posted this critical bug report and no one has responded at all. Does anyone even care that this doesn't work? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c3 --- Comment #3 from Richard Warner <rick@microway.com> 2012-12-20 19:57:19 UTC --- This problem has now occurred on a single socket Xeon board (supermicro X9SRE) with an E5-2650 processor as well. I have identified the cause of this problem. It is the mei module. If I disable the mei module (added 'blacklist mei' and 'install mei /bin/echo "skipping mei module"' to 50-blacklist.conf) the problem goes away. This has been tested on the X9SRE, an Intel quad romley system (lizard head pass), and a Quanta quad romley system (S400-X44E) successfully. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|kernel-maintainers@forge.pr |bpetkov@suse.com |ovo.novell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c4 Borislav Petkov <bpetkov@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |bpetkov@suse.com --- Comment #4 from Borislav Petkov <bpetkov@suse.com> 2013-07-15 21:34:54 UTC --- We have a similar issue (bnc#822927) with the mei_me module and waiting for an upstream fix. Btw, is 3.10 (http://kernel.opensuse.org/packages/master) still broken for you without blacklisting the module? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c Borislav Petkov <bpetkov@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO InfoProvider| |rick@microway.com -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=788195 https://bugzilla.novell.com/show_bug.cgi?id=788195#c5 Borislav Petkov <bpetkov@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |CLOSED InfoProvider|rick@microway.com | Resolution| |NORESPONSE --- Comment #5 from Borislav Petkov <bpetkov@suse.com> 2013-09-16 16:10:49 UTC --- two months old... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com