[Bug 663675] New: gcc ICE crashes when using KVM
https://bugzilla.novell.com/show_bug.cgi?id=663675 https://bugzilla.novell.com/show_bug.cgi?id=663675#c0 Summary: gcc ICE crashes when using KVM Classification: openSUSE Product: openSUSE 11.4 Version: Factory Platform: Other OS/Version: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: adrian@novell.com QAContact: qa@suse.de CC: mls@novell.com, rguenther@novell.com Found By: --- Blocker: --- When using openSUSE 11.4 as base for our OBS build hosts with KVM, we observe many unreproducable gcc ICE crashes (which do not crash on XEN or native builds). They are triggered by an abort in libgmp, when doing __gmpn_cpuid (vendor_string, 0); vendor_string[12] = 0; .. else if (strcmp (vendor_string, "AuthenticAMD") == 0) { switch (family) { case 5: case 6: abort (); <--- got called break; .. Our assumption is that kvm kernel is returning random data under high load, which leads to these crashes after some time. Can we enable more kernel debug output somehow ? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c1
--- Comment #1 from Richard Guenther
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c2
Adrian Schröter
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c3
Alexander Graf
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c4
Adrian Schröter
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c5
--- Comment #5 from Adrian Schröter
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c
Jeff Mahoney
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c6
Richard Guenther
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c7
--- Comment #7 from Marius Tomaschewski
Fam. 5/6 are 32bit CPUs. I suspect the hypervisor is confused by a mix of 32bit and 64bit guests and does invalid CPUID caching?
I think, we have a combination of two bugs here and should split the two bug reports again and fix the libgmp to not to call abort() or make unreliable assumptions bases on the cpu family. I've checked the cpu family on some hosts and I think, the cpu family flag in /proc/cpuinfo is not reliable to call abort() because of it: 64bit CPUs: AMD Athlon(tm) 64 X2 Dual Core Processor 4600+ cpu family : 15 AMD Athlon(tm) Dual Core Processor 5050e cpu family : 15 AMD Turion(tm) 64 Mobile Technology MT-34 cpu family : 15 Intel(R) Core(TM)2 Duo CPU E7200 @ 2.53GHz cpu family : 6 Intel(R) Core(TM)2 Duo CPU E8400 @ 3.00GHz cpu family : 6 Intel(R) Core(TM) i7 CPU 965 @ 3.20GHz cpu family : 6 Intel(R) Xeon(R) CPU X5376 @ 2.80GHz cpu family : 6 32bit CPUs: Intel(R) Xeon(TM) CPU 2.40GHz cpu family : 15 Pentium III (Coppermine) cpu family : 6 So as you see, Family 6 is either definitively not a 32bit CPU and family 15 is also not definitively a 64 bit CPU. I guess, this has to do with the detection in arch/x86/kernel/cpu/common.c, cpu_detect() (and another places in this file), where it is using cpuid() and makes some calculations ... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c8
Marius Tomaschewski
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c9
Richard Guenther
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c10
Petr Baudis
https://bugzilla.novell.com/show_bug.cgi?id=663675
https://bugzilla.novell.com/show_bug.cgi?id=663675#c11
Jeff Mahoney
participants (1)
-
bugzilla_noreply@novell.com