Mailinglist Archive: opensuse-bugs (2746 mails)

< Previous Next >
[Bug 825510] "shutdown -r now" fails @ Xen 4.3.0 Dom0 -- systems halts, but does not restart

https://bugzilla.novell.com/show_bug.cgi?id=825510

https://bugzilla.novell.com/show_bug.cgi?id=825510#c5


A R <ar16@xxxxxxxxxxxx> changed:

What |Removed |Added
----------------------------------------------------------------------------
Status|NEEDINFO |NEW
InfoProvider|ar16@xxxxxxxxxxxx |

--- Comment #5 from A R <ar16@xxxxxxxxxxxx> 2013-06-27 12:59:23 UTC ---
OK, stepwise ...

For this env,

uname -a
Linux xen03.loc 3.7.10-1.16-xen #1 SMP Fri May 31 20:21:23 UTC 2013
(97c14ba) x86_64 x86_64 x86_64 GNU/Linux
lsb_release -a
LSB Version: n/a
Distributor ID: openSUSE project
Description: openSUSE 12.3 (x86_64)
Release: 12.3
Codename: Dartmouth
rpm -qa | grep -i xen
kernel-xen-3.7.10-1.16.1.x86_64
xen-devel-4.3.0_05-255.1.x86_64
kernel-xen-devel-3.7.10-1.16.1.x86_64
xen-libs-4.3.0_05-255.1.x86_64
xen-4.3.0_05-255.1.x86_64
xen-tools-4.3.0_05-255.1.x86_64

booting with the following Grub config (used for "ages"):

title Xen
root (hd0,0)
kernel /xen.gz vga=gfx-1280x1024x16 conring_size=64
systemd.log_level=debug systemd.log_target=syslog-or-kmsg loglvl=debug
guest_loglvl=debug com1=57600,8n1,pci console=vga,com1 console_timestamps
dom0_mem=1024M,max:1024M dom0_vcpus_pin=true dom0_max_vcpus=4 sched=credit
apic_verbosity=verbose iommu=verbose cpuidle=1 cpufreq=xen clocksource=acpi
numa=on cpuf
module /vmlinuz-xen log_buf_len=4M console=tty0 console=xvc0,57600n8
xencons=tty earlyprintk=xen vga=0x31a nomodeset=0 root=/dev/VG0/ROOT
rootfstype=ext4 rootflags=journal_checksum noresume showopts selinux=0
SELINUX_INIT=NO apparmor=0 elevator=cfq clocksource=xen mce=off noquiet
module /initrd-xen

of course, let me know if a different set of parameters is more useful/helpful.

we'd need a serial log taken of the shutdown operation, so we can
see whether kernel or hypervisor crash in any way, or how far shutdown
proceeds.

@ `shutdown -r now`, tail of what i *think* is needed/relevant at serial
console output:
---------------------------
...
[ OK ] Reached target Unmount All Filesystems.
[ OK ] Stopped target Local File Systems (Pre).
Stopping Remount Root and Kernel File Systems...
[ OK ] Stopped Remount Root and Kernel File Systems.
Starting Save Random Seed...
Starting Update UTMP about System Shutdown...
Stopping Replay Read-Ahead Data...
[ OK ] Stopped Replay Read-Ahead Data.
Stopping Collect Read-Ahead Data...
[ OK ] Stopped Collect Read-Ahead Data.
Stopping LSB: Start LVM2...
[ OK ] Started Save Random Seed.
[ OK ] Started Update UTMP about System Shutdown.
[ OK ] Stopped LSB: Start LVM2.
Stopping LSB: Multiple Device RAID...
[ OK ] Stopped LSB: Multiple Device RAID.
[ OK ] Reached target Shutdown.
(XEN) [2013-06-27 18:04:39] mm.c:618:d0 Could not get page ref for pfn fec00
(XEN) [2013-06-27 18:04:39] mm.c:618:d0 Could not get page ref for pfn fec00
(XEN) [2013-06-27 18:04:39] mm.c:618:d0 Could not get page ref for pfn fec00
(XEN) [2013-06-27 18:04:39] mm.c:618:d0 Could not get page ref for pfn fec00
(XEN) [2013-06-27 18:04:39] mm.c:618:d0 Could not get page ref for pfn fec00
(XEN) [2013-06-27 18:04:39] mm.c:618:d0 Could not get page ref for pfn fec00
(XEN) [2013-06-27 18:04:39] mm.c:618:d0 Could not get page ref for pfn fec00
(XEN) [2013-06-27 18:04:39] mm.c:618:d0 Could not get page ref for pfn fec00
Sending SIGTERM to remaining processes...
Sending SIGKILL to remaining processes...
(XEN) [2013-06-27 18:04:43] mm.c:618:d0 Could not get page ref for pfn fec00
(XEN) [2013-06-27 18:04:43] mm.c:618:d0 Could not get page ref for pfn fec00
Hardware watchdog 'SP5100 TCO timer', version 0
(XEN) [2013-06-27 18:04:43] mm.c:618:d0 Could not get page ref for pfn fec00
Unmounting file systems.
Unmounting /var/lib/dhcp/proc.
Unmounting /var/run.
Unmounting /dev/mqueue.
All filesystems unmounted.
Deactivating swaps.
All swaps deactivated.
Detaching loop devices.
All loop devices detached.
Detaching DM devices.
Detaching DM 253:7.
Detaching DM 253:6.
Detaching DM 253:5.
Detaching DM 253:4.
Detaching DM 253:3.
Detaching DM 253:2.
Detaching DM 253:0.
Not all DM devices detached, 1 left.
(XEN) [2013-06-27 18:04:43] mm.c:618:d0 Could not get page ref for pfn fec00
Detaching DM devices.
Not all DM devices detached, 1 left.
Cannot finalize remaining file systems and devices, giving up.
(XEN) [2013-06-27 18:04:45] mm.c:618:d0 Could not get page ref for pfn fec00
[ 1399.988852] Restarting system.
---------------------------

at this point it just sits, and goes no further. the system does NOT poweroff.

Does normal shutdown work, or does it also halt the machine without
turning it off?

manual/cold reboot, then @ `shutdown -h now`, it *DOES* successfully poweroff.
Here's the similar, serial console tail:
---------------------------
[ OK ] Reached target Unmount All Filesystems.
[ OK ] Stopped target Local File Systems (Pre).
Stopping Remount Root and Kernel File Systems...
[ OK ] Stopped Remount Root and Kernel File Systems.
Starting Save Random Seed...
Starting Update UTMP about System Shutdown...
Stopping Replay Read-Ahead Data...
[ OK ] Stopped Replay Read-Ahead Data.
Stopping Collect Read-Ahead Data...
[ OK ] Stopped Collect Read-Ahead Data.
Stopping LSB: Start LVM2...
[ OK ] Started Save Random Seed.
[ OK ] Started Update UTMP about System Shutdown.
[ OK ] Stopped LSB: Start LVM2.
Stopping LSB: Multiple Device RAID...
[ OK ] Stopped LSB: Multiple Device RAID.
[ OK ] Reached target Shutdown.
Sending SIGTERM to remaining processes...
Sending SIGKILL to remaining processes...
Unmounting file systems.
Unmounting /var/lib/dhcp/proc.
Unmounting /var/lib/nfs/rpc_pipefs.
Unmounting /var/run.
Unmounting /dev/mqueue.
All filesystems unmounted.
Deactivating swaps.
All swaps deactivated.
Detaching loop devices.
All loop devices detached.
Detaching DM devices.
Detaching DM 253:7.
Detaching DM 253:6.
Detaching DM 253:5.
Detaching DM 253:4.
Detaching DM 253:3.
Detaching DM 253:2.
Detaching DM 253:0.
Not all DM devices detached, 1 left.
Detaching DM devices.
Not all DM devices detached, 1 left.
Cannot finalize remaining file systems and devices, giving up.
(XEN) [2013-06-27 18:16:05] mm.c:618:d0 Could not get page ref for pfn fec00
[ 256.208915] Power down.
(XEN) [2013-06-27 18:16:07] Preparing system for ACPI S5 state.
(XEN) [2013-06-27 18:16:07] Disabling non-boot CPUs ...
(XEN) [2013-06-27 18:16:07] Breaking affinity for d0v1
(XEN) [2013-06-27 18:16:07] Breaking affinity for d0v2
(XEN) [2013-06-27 18:16:08] Breaking affinity for d0v3
(XEN) [2013-06-27 18:16:08] Entering ACPI S5 state.
---------------------------

and, at this point, it's successfully powered-off.

with 12.3 not shipping with Xen 4.3, we'd want you to test with the
shipped version of Xen (and, in case you updated that too, kernel).

Pending

with you apparently knowing that it worked before a recent update,
narrowing down which update this was would also help.

Pending

with the native kernel working, attaching the boot log of the
native kernel (to see eventual log messages regarding applied workarounds)
would be as helpful as providing exact hardware details (namely DMI
information).

not entirely sure what 'boot log' is being asked for in a systemd world, since
boot.*msg no longer appears.

here's `journalctl -b | grep -i kernel`:

http://pastebin.com/raw.php?i=khT1Da6T

--
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

< Previous Next >
References