New subject: [Bug 919284] boot fails with LVMs-on-RAID; systemd "Timed out" and "Dependency failed" errors ? referred from systemd-info ML to 'downstream'

24 Feb 2015

      http://bugzilla.opensuse.org/show_bug.cgi?id=919284

            Bug ID: 919284
           Summary: boot fails with LVMs-on-RAID; systemd "Timed out" and
                    "Dependency failed" errors ? referred from
                    systemd-info ML to 'downstream'
    Classification: openSUSE
           Product: openSUSE Distribution
           Version: 13.2
          Hardware: x86-64
                OS: openSUSE 13.2
            Status: NEW
          Severity: Major
          Priority: P5 - None
         Component: Basesystem
          Assignee: bnc-team-screening@forge.provo.novell.com
          Reporter: h15234@mailas.com
        QA Contact: qa-bugs@suse.de
          Found By: ---
           Blocker: ---
...
If it's about systemd not properly activating LVM volumes, I would open a bugreport @ opensuse.  Doesn't sound like an upstream issue as such.
...
From this thread: http://lists.opensuse.org/opensuse/2015-02/msg00602.html
...
So despite  systemd reporting that fscks were not done on HOME and VAR,
I'm working on an opensuse 13.2 machine running systemd v210.

It's disks are all on RAID.

/boot is on RAID1 on /dev/md126

The remaining partitions are on LVM-on-RAID10

The LVs are

    LV_ROOT       VG0 -wi-ao---  20.00g                                         
    LV_SWAP       VG0 -wi-ao---   8.00g                                         
    LV_HOME       VG0 -wi-ao--- 100.00g                                         
    LV_VAR        VG0 -wi-ao---   1.00g                                         

The system fails to boot, dropping to a maintenance mode prompt.

Simply hitting Ctrl-D to continue, finishes booting the system.

After boot, checking

    journalctl -b | egrep -i "Timed out|result=dependency" | egrep -i
"dev|mount"
        Feb 20 08:16:15 ender systemd[1]: Job dev-VG0-LV_HOME.device/start
timed out.
        Feb 20 08:16:15 ender systemd[1]: Timed out waiting for device
dev-VG0-LV_HOME.device.
        Feb 20 08:16:15 ender systemd[1]: Job
systemd-fsck@dev-VG0-LV_HOME.service/start finished, result=dependency
        Feb 20 08:16:15 ender systemd[1]: Dependency failed for File System
Check on /dev/VG0/LV_HOME.
        Feb 20 08:16:15 ender systemd[1]: Job dev-VG0-LV_VAR.device/start timed
out.
        Feb 20 08:16:15 ender systemd[1]: Timed out waiting for device
dev-VG0-LV_VAR.device.
        Feb 20 08:16:15 ender systemd[1]: Job
systemd-fsck@dev-VG0-LV_VAR.service/start finished, result=dependency
        Feb 20 08:16:15 ender systemd[1]: Dependency failed for File System
Check on /dev/VG0/LV_VAR.
        Feb 20 08:16:15 ender systemd[1]: Job
dev-disk-by\x2did-dm\x2dname\x2dVG0\x2dLV_HOME.device/start timed out.
        Feb 20 08:16:15 ender systemd[1]: Timed out waiting for device
dev-disk-by\x2did-dm\x2dname\x2dVG0\x2dLV_HOME.device.

This is reported ON the same system.  I.e., all the LVs are correctly mounted
and fully functional.

Why are these time-outs & dependency-fails occurring?  What do I need to
change/fix to make sure it does not happen, and avoid getting dropped into
emergency mode on boot?

hanlon

FWIW:

    I originally posted this to systemd-info mailing list; that got this reply

    On Fri, 20.02.15 08:38, h15234@mailas.com (h15234@mailas.com) wrote:
    > I'm working on a machine running systemd v210 (Opensuse 13.2)
    This is a really old systemd version, please ask downtream for help on
    such old version!
    > Its disks are all on RAID.
    >
    > /boot is on RAID1 on /dev/md126
    >
    > The remaining partitions are on LVM-on-RAID10
    >
    > The LVs are
    >
    >     LV_ROOT       VG0 -wi-ao---  20.00g                                   
    >     LV_SWAP       VG0 -wi-ao---   8.00g                                   
    >     LV_HOME       VG0 -wi-ao--- 100.00g                                   
    >     LV_VAR        VG0 -wi-ao---   1.00g    
    Well, LVM and RAID are nothign we support upstream, please talk to the
    LVM/MD communities or downstream for help. 
    Sorry,

    Lennart

...
the volumes are mounted anyway?

yes.
...
What about ROOT ?
same.  all volumes are mounted and fully available after boot completes.
...
Which version of Grub?  Of LVM?
grub-0.97-200.1.3.x86_64
    lvm2-2.02.98-43.17.1.x86_64
...
...
/boot is on RAID1 on /dev/md126
That seems strange.  It doesn't seem like a LVM mapper address.
I'm a great supporter of LVM but I do not put /boot on LVM ever.  I know
I can, but its too much hassle when things go wrong.
That's not an LV.

/boot is on an ext4-partitioned RAID vol.
...
...
Simply hitting Ctrl-D to continue, finishes booting the system.
Is this an "every time" occurrence or intermittent?
every time.  fully reproducible.
...
Does it only occur on 'cold boots' when the disks are being spun up or
also on reboots when the disks are already spinning?
both.
...
One "solution" is to edit the GRUB boot command line and add
  bootdelay=10
and possibly
  lvmwait=/dev/VG0/LV_HOME
I've already tried both.  As well as 

    x-systemd.device-timeout=5m

in /etc/fstab.

none, individually or in any combination together, chane the problem.
...
If this is an 'everytime' problem I'd first make sure the device mapper module is in > the initrd of your boot.
it already is,

cat /etc/sysconfig/kernel
    INITRD_MODULES="... raid0 raid1 raid10 raid456 dm-mod ..."
...
*IF* this is not a disk drive timing problem then its a problem with LVM activation.
agreed.  which I'm guessing is why the 

    /etc/systemd/system/lvm_local.service 

with the explicit LVM Before/Requires dependencies, and the

    ExecStart=/sbin/vgchange --available y

seems to fix the problem.  With the fixes in place booting finishes perfectly. 
Absolutely no more errors or warnings in dmesg, journalctl, etc.
...
However I'd look at /etc/sysconfig/dmraid as well for timeout values
it's already set at

    DMRAID_DEVICE_TIMEOUT="120"

-- 
You are receiving this mail because:
You are on the CC list for the bug.

[Bug 919284] New: boot fails with LVMs-on-RAID; systemd "Timed out" and "Dependency failed" errors ? referred from systemd-info ML to 'downstream'

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

bugzilla_noreply＠novell.com

tags

participants (1)