[Bug 1189776] New: system fails to boot because mkinitrd fails to include nvme driver
https://bugzilla.suse.com/show_bug.cgi?id=1189776 Bug ID: 1189776 Summary: system fails to boot because mkinitrd fails to include nvme driver Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: x86-64 OS: Linux Status: NEW Severity: Normal Priority: P5 - None Component: Basesystem Assignee: screening-team-bugs@suse.de Reporter: ohering@suse.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- orthos host virt158. Installing SLE15SP3 works fine, nvme.ko is included. Installing TMBLEWEED 20210823 fails, nmve.ko is not included in initrd. No obvious error from "mkinitrd", it does not report the to-be-included drivers anyway. -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1189776
Chenzi Cao
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c1
Antonio Feijoo
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c2
Antonio Feijoo
Specifically, it is failing the check_block_and_slaves_all function in dracut-functions.sh
I was wrong about this guess. Since a certain upstream commit [1], the function to detect a block device changed from: --> [ -e /sys/dev/block/$1 ] && return 0 --< to --> for _mod in $(get_dev_module /dev/block/$1); do --< and the get_dev_module relies on udevadm, which fails to detect the nvme driver in SLE-15 SP3: virt158:~ # ls -l /dev/block total 0 lrwxrwxrwx 1 root root 10 Oct 5 05:53 259:1 -> ../nvme0n1 lrwxrwxrwx 1 root root 12 Oct 5 05:53 259:2 -> ../nvme0n1p1 lrwxrwxrwx 1 root root 12 Oct 5 05:53 259:3 -> ../nvme0n1p2 lrwxrwxrwx 1 root root 12 Oct 5 05:53 259:4 -> ../nvme0n1p3 lrwxrwxrwx 1 root root 12 Oct 5 05:53 259:5 -> ../nvme0n1p4 lrwxrwxrwx 1 root root 12 Oct 5 05:53 259:6 -> ../nvme0n1p5 virt158:~ # udevadm info -a "/dev/block/259:5" | sed -n 's/\s*DRIVERS=="\(\S\+\)"/\1/p' virt158:~ # udevadm info -a "/dev/block/259:2" | sed -n 's/\s*DRIVERS=="\(\S\+\)"/\1/p' virt158:~ # udevadm info -a "/dev/block/259:3" | sed -n 's/\s*DRIVERS=="\(\S\+\)"/\1/p' virt158:~ # udevadm --version 246 But for me the check works in Tumbleweed and the initrd includes the nvme driver: dev@localhost:~/src/dracut/test> ls -l /dev/block lrwxrwxrwx 1 root root 10 Oct 6 09:21 259:0 -> ../nvme0n1 lrwxrwxrwx 1 root root 12 Oct 6 09:21 259:1 -> ../nvme0n1p1 dev@localhost:~/src/dracut/test> udevadm info -a "/dev/block/259:1" | sed -n 's/\s*DRIVERS=="\(\S\+\)"/\1/p' nvme dev@localhost:~/src/dracut/test> udevadm --version 249 Please, could you check if this still fails with the latest snapshot of Tumbleweed? [1] https://github.com/dracutdevs/dracut/commit/6375d5d504c5eac1cc5e7d7e26a8643b... -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c3
--- Comment #3 from Olaf Hering
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c4
Olaf Hering
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c5
--- Comment #5 from Antonio Feijoo
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c6
--- Comment #6 from Olaf Hering
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c7
--- Comment #7 from Olaf Hering
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c8
--- Comment #8 from Antonio Feijoo
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c9
--- Comment #9 from Olaf Hering
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c10
--- Comment #10 from Antonio Feijoo
weird, please force the required drivers into initrd in the TMBLWEED partition, then we can actually boot into this partition and see how a Tumbleweed system behaves.
Done. virt158:/root # lsinitrd /boot/initrd-5.14.6-2-default | grep ko | grep nvme -rw-r--r-- 1 root root 69248 Oct 4 02:57 usr/lib/modules/5.14.6-2-default/kernel/drivers/nvme/host/nvme-core.ko.xz -rw-r--r-- 1 root root 26364 Oct 4 02:57 usr/lib/modules/5.14.6-2-default/kernel/drivers/nvme/host/nvme.ko.xz -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c11
--- Comment #11 from Olaf Hering
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c12
Antonio Feijoo
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c13
--- Comment #13 from Antonio Feijoo
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c14
Franck Bui
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c15
--- Comment #15 from Olaf Hering
Can I have access to the TW system which is affected ?
See comment#0 about which system is affected. -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c16
--- Comment #16 from Franck Bui
See comment#0 about which system is affected.
The credentials are missing and I tried the usual "default" passwords for the orthos machines but with no luck. -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c17
--- Comment #17 from Olaf Hering
The credentials are missing and I tried the usual "default" passwords for the orthos machines but with no luck.
Indeed, it is root/suse -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c18
--- Comment #18 from Franck Bui
Indeed, it is root/suse
Still doesn't work when I tried to log in via ssh: $ ssh root@virt158.devlab.prv.suse.com Can you please make accesses through ssh work ? -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c19
--- Comment #19 from Olaf Hering
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c20
Franck Bui
Installing SLE15SP3 works fine, nvme.ko is included.
Could you show the output of 'ls -l /sys/dev/block/' for SP3 ? -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c21
Olaf Hering
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c22
--- Comment #22 from Olaf Hering
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c23
Franck Bui
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c24
--- Comment #24 from Jean Delvare
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c25
--- Comment #25 from Antonio Feijoo
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c26
Antonio Feijoo
Thanks for your feedback. Actually we've submitted a dracut PR which would solve this issue following symlinks.
The patch was accepted. The fix is in progress. -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c27
Antonio Feijoo
https://bugzilla.suse.com/show_bug.cgi?id=1189776
Frank Kr�ger
https://bugzilla.suse.com/show_bug.cgi?id=1189776
Martin Jambor
https://bugzilla.suse.com/show_bug.cgi?id=1189776
https://bugzilla.suse.com/show_bug.cgi?id=1189776#c29
Antonio Feijoo
https://bugzilla.suse.com/show_bug.cgi?id=1189776
Jeffrey Cheung
participants (1)
-
bugzilla_noreply@suse.com