[opensuse-virtual] Latest Xen 4.1.2* updates on 11.4 -- fails to boot
Hi, A careless update today on my part on an opensuse 11.4 server, following XEN from repo baseurl=http://download.opensuse.org/repositories/Virtualization/openSUSE_11.4 has killed a working system that's been running for quite awhile on 4.1.2. My bad. I'm trying to figure out what's going on on my end -- can't seem to get the info I need from logs, yet. Anyway, are there any specific things to look for in this lates update? Changes in config, etc required? The system I *had* been running had 'xl' use disabled (remember reading something about that on this list awhile ago.) Something there maybe? At this point, just looking for help while I try to dig myself out this *&^%* hole I dug for myself :-( Cheers, Garrett -- To unsubscribe, e-mail: opensuse-virtual+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-virtual+owner@opensuse.org
I set up terminal access over a serial port, and turned on loglve=all and guest_loglvl=all. At terminal, I see the system halt on boot with a "Oops: 0002 [#1] SMP", ... (XEN) [2011-10-20 20:39:30] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-20 20:39:30] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-20 20:39:30] microcode: CPU0 updated from revision 0x1000086 to 0x1000083 (XEN) [2011-10-20 20:39:30] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-20 20:39:30] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-20 20:39:30] microcode: CPU1 updated from revision 0x1000086 to 0x1000083 (XEN) [2011-10-20 20:39:30] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-20 20:39:30] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-20 20:39:30] microcode: CPU2 updated from revision 0x1000086 to 0x1000083 (XEN) [2011-10-20 20:39:30] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-20 20:39:30] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-20 20:39:30] microcode: CPU3 updated from revision 0x1000086 to 0x1000083 [ 72.890814] microcode: Microcode Update Driver: v2.00-xen <tigran@aivazian.fsnet.co.uk>, Peter Oruba [ 72.891424] BUG: unable to handle kernel NULL pointer dereference at 00000000000000e8 [ 72.891449] IP: [<ffffffff80005989>] __switch_to+0x2e9/0x640 [ 72.891476] PGD 37601067 PUD 35b44067 PMD 0 [ 72.891493] Oops: 0002 [#1] SMP [ 72.891504] last sysfs file: /sys/devices/platform/microcode/firmware/microcode/loading [ 72.891523] CPU 1 [ 72.891529] Modules linked in: microcode pciback xenbus_be n_hdlc slhc nfsd nfs lockd fscache nfs_acl auth_rpcgss sunrpc snd_hda_codec_via snd_hda_intel snd_hda_codec snd_hwdep sky2 snd_pcm snd_timer snd i2c_piix4 sr_mod soundcore sh2 [ 72.891745] [ 72.891754] Pid: 1868, comm: firmware Not tainted 2.6.37.6-0.7-xen #1 System manufacturer System Product Name/M3A78-CM [ 72.891779] RIP: e030:[<ffffffff80005989>] [<ffffffff80005989>] __switch_to+0x2e9/0x640 [ 72.891798] RSP: e02b:ffff8800383f5af8 EFLAGS: 00010002 [ 72.891808] RAX: ffff880002196a00 RBX: ffff8800022c47c0 RCX: ffff88003efd6000 [ 72.891819] RDX: ffff8800382e6600 RSI: ffff8800022c47c0 RDI: ffff8800382e6600 [ 72.891830] RBP: ffff8800382e6600 R08: ffffffffff76abc0 R09: ffff8800022c47f8 [ 72.891842] R10: 0000000000007ff0 R11: 0000000000000001 R12: 0000000000000000 [ 72.891853] R13: 0000000000000001 R14: ffff8800395e9bc0 R15: 0000000000000000 [ 72.891873] FS: 00007fe9e2d2f7a0(0000) GS:ffff88003ef9b000(0000) knlGS:0000000000000000 [ 72.891885] CS: e033 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 72.891895] CR2: 00000000000000e8 CR3: 00000000026d3000 CR4: 0000000000000660 [ 72.891907] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 72.891919] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [ 72.891930] Process firmware (pid: 1868, threadinfo ffff880038bea000, task ffff8800382e6600) [ 72.891941] Stack: [ 72.891947] 0000000000009dc0 ffff88003ef90e40 ffff88003efc7078 0000000000000000 [ 72.891968] 0000000000000000 0000000000000000 0000000000000000 0000000000000000 [ 72.891985] 0000000000000001 0000000000000001 0000000000000001 0000000100000000 [ 72.892002] Call Trace: Something to do with "microcode"? I didn't have this problem a few days ago, prior to the latest upgrade on this hardware. So maybe close? Garrett -- To unsubscribe, e-mail: opensuse-virtual+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-virtual+owner@opensuse.org
On 20.10.11 at 22:47, <tmp.875876@123mail.org> wrote: I set up terminal access over a serial port, and turned on loglve=all and guest_loglvl=all.
At terminal, I see the system halt on boot with a "Oops: 0002 [#1] SMP",
... (XEN) [2011-10-20 20:39:30] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-20 20:39:30] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-20 20:39:30] microcode: CPU0 updated from revision 0x1000086 to 0x1000083 (XEN) [2011-10-20 20:39:30] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-20 20:39:30] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-20 20:39:30] microcode: CPU1 updated from revision 0x1000086 to 0x1000083 (XEN) [2011-10-20 20:39:30] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-20 20:39:30] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-20 20:39:30] microcode: CPU2 updated from revision 0x1000086 to 0x1000083 (XEN) [2011-10-20 20:39:30] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-20 20:39:30] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-20 20:39:30] microcode: CPU3 updated from revision 0x1000086 to 0x1000083 [ 72.890814] microcode: Microcode Update Driver: v2.00-xen <tigran@aivazian.fsnet.co.uk>, Peter Oruba [ 72.891424] BUG: unable to handle kernel NULL pointer dereference at 00000000000000e8 [ 72.891449] IP: [<ffffffff80005989>] __switch_to+0x2e9/0x640 [ 72.891476] PGD 37601067 PUD 35b44067 PMD 0 [ 72.891493] Oops: 0002 [#1] SMP [ 72.891504] last sysfs file: /sys/devices/platform/microcode/firmware/microcode/loading [ 72.891523] CPU 1 [ 72.891529] Modules linked in: microcode pciback xenbus_be n_hdlc slhc nfsd nfs lockd fscache nfs_acl auth_rpcgss sunrpc snd_hda_codec_via snd_hda_intel snd_hda_codec snd_hwdep sky2 snd_pcm snd_timer snd i2c_piix4 sr_mod soundcore sh2 [ 72.891745] [ 72.891754] Pid: 1868, comm: firmware Not tainted 2.6.37.6-0.7-xen #1 System manufacturer System Product Name/M3A78-CM [ 72.891779] RIP: e030:[<ffffffff80005989>] [<ffffffff80005989>] __switch_to+0x2e9/0x640 [ 72.891798] RSP: e02b:ffff8800383f5af8 EFLAGS: 00010002 [ 72.891808] RAX: ffff880002196a00 RBX: ffff8800022c47c0 RCX: ffff88003efd6000 [ 72.891819] RDX: ffff8800382e6600 RSI: ffff8800022c47c0 RDI: ffff8800382e6600 [ 72.891830] RBP: ffff8800382e6600 R08: ffffffffff76abc0 R09: ffff8800022c47f8 [ 72.891842] R10: 0000000000007ff0 R11: 0000000000000001 R12: 0000000000000000 [ 72.891853] R13: 0000000000000001 R14: ffff8800395e9bc0 R15: 0000000000000000 [ 72.891873] FS: 00007fe9e2d2f7a0(0000) GS:ffff88003ef9b000(0000) knlGS:0000000000000000 [ 72.891885] CS: e033 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 72.891895] CR2: 00000000000000e8 CR3: 00000000026d3000 CR4: 0000000000000660 [ 72.891907] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 72.891919] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [ 72.891930] Process firmware (pid: 1868, threadinfo ffff880038bea000, task ffff8800382e6600) [ 72.891941] Stack: [ 72.891947] 0000000000009dc0 ffff88003ef90e40 ffff88003efc7078 0000000000000000 [ 72.891968] 0000000000000000 0000000000000000 0000000000000000 0000000000000000 [ 72.891985] 0000000000000001 0000000000000001 0000000000000001 0000000100000000 [ 72.892002] Call Trace:
Something to do with "microcode"? I didn't have this problem a few days ago, prior to the latest upgrade on this hardware. So maybe close?
Yes, upstream c/s 23871:503ee256fecf (which got backported both by us and upstream to 4.0.x and 4.1.x) introduced this. You need to back-rev either microcode_ctl or xen for the time being. Jan -- To unsubscribe, e-mail: opensuse-virtual+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-virtual+owner@opensuse.org
Jan, On Friday, October 21, 2011 9:20 AM, "Jan Beulich" <JBeulich@suse.com> wrote:
Something to do with "microcode"? I didn't have this problem a few days ago, prior to the latest upgrade on this hardware. So maybe close?
Yes, upstream c/s 23871:503ee256fecf (which got backported both by us and upstream to 4.0.x and 4.1.x) introduced this. You need to back-rev either microcode_ctl or xen for the time being.
We're too hooked on the general stability of the Xen version we use from you folks. Really don't want to drop back to another version, if I understand what you mean. Not even sure where I'd GET an older version. Sounds like I can monkey with microcode_ctl version instead? What version of that safe or recommended? Or should I just rm it or turn off the microcode svc? Thanks a lot. Garrett -- To unsubscribe, e-mail: opensuse-virtual+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-virtual+owner@opensuse.org
On 21.10.11 at 15:39, <tmp.875876@123mail.org> wrote: Sounds like I can monkey with microcode_ctl version instead? What version of that safe or recommended?
Don't know.
Or should I just rm it or turn off the microcode svc?
Yes, "insserv -r" on it was reported to help. Jan -- To unsubscribe, e-mail: opensuse-virtual+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-virtual+owner@opensuse.org
Jan On Friday, October 21, 2011 2:47 PM, "Jan Beulich" <JBeulich@suse.com> wrote:
On 21.10.11 at 15:39, <tmp.875876@123mail.org> wrote: Sounds like I can monkey with microcode_ctl version instead? What version of that safe or recommended?
Don't know.
Or should I just rm it or turn off the microcode svc?
Yes, "insserv -r" on it was reported to help.
I rm'd the microcode_ctl for now. The system -- host & guests -- is back up and running. Doesn't seem like there's any problems. TBH I don't what good microcode_Ctl does for me or if I need it on both the hosts and guests at all. For now I guess I'll wait for the Xen bug you referred to get fixed. Cheers, Garrett -- To unsubscribe, e-mail: opensuse-virtual+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-virtual+owner@opensuse.org
confirming On Friday, October 21, 2011 9:29 PM, tmp.875876@123mail.org wrote:
For now I guess I'll wait for the Xen bug you referred to get fixed.
on uname -a Linux test 2.6.37.6-0.7-xen #1 SMP 2011-07-21 02:17:24 +0200 x86_64 x86_64 x86_64 GNU/Linux updating Dom0 to latest xen-* and re-installing microcode_ctl ( *SHOULD* microcode_ctl ALSO be (re)installed in Xen Guests? I don't know if these microcode updates are relevant only to the Host OS, or also to each of the virtual guests?) rpm -qa | egrep -i "^xen|microcode" xen-devel-4.1.2_05-157.1.x86_64 xen-tools-4.1.2_05-157.1.x86_64 xen-doc-pdf-4.1.2_05-157.1.x86_64 xen-4.1.2_05-157.1.x86_64 xen-libs-4.1.2_05-157.1.x86_64 xen-doc-html-4.1.2_05-157.1.x86_64 microcode_ctl-1.17-120.1.x86_64 rpm -q --changelog xen-libs-4.1.2_05-157.1.x86_64 | head -n 15 * Wed Oct 26 2011 carnold@novell.com - bnc#725169 - xen-4.0.2_21511_03-0.5.3: bootup hangs 23993-x86-microcode-amd-fix-23871.patch * Wed Oct 26 2011 carnold@novell.com - Update to Xen 4.1.2 FCS c/s 23174 * Mon Oct 24 2011 jfehlig@suse.com - bnc#720054 - Fix syntax error introduced during recent adjustment of Xen's tap udev rule. Updated udev-rules.patch * Thu Oct 20 2011 ohering@suse.de - fate#310510 - fix xenpaging Merge paging related fixes from xen-unstable: Dom0 reboot is OK again, checking dmesg | grep micro [ 36.527298] microcode: Microcode Update Driver: v2.00-xen <tigran@aivazian.fsnet.co.uk>, Peter Oruba xm dmesg | grep micro (XEN) [2011-10-29 17:10:29] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 960, offset 1060 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 1824, offset 2028 (XEN) [2011-10-29 17:10:29] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 960, offset 1060 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 1824, offset 2028 (XEN) [2011-10-29 17:10:29] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 960, offset 1060 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 1824, offset 2028 (XEN) [2011-10-29 17:10:29] microcode: collect_cpu_info: patch_id=0x1000086 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 960, offset 92 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 960, offset 1060 (XEN) [2011-10-29 17:10:29] microcode: size 3860, total_size 1824, offset 2028 Thanks. Garrett -- To unsubscribe, e-mail: opensuse-virtual+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-virtual+owner@opensuse.org
participants (2)
-
Jan Beulich
-
tmp.875876@123mail.org