[Bug 756201] New: boot fails due to MD devices not starting correctly
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c0 Summary: boot fails due to MD devices not starting correctly Classification: openSUSE Product: openSUSE 12.1 Version: Final Platform: x86-64 OS/Version: openSUSE 12.1 Status: NEW Severity: Critical Priority: P5 - None Component: Basesystem AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: suse@randycushman.com QAContact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:11.0) Gecko/20100101 Firefox/11.0 Boot fails due to inaccessible partitions / MD devices not started. Problem started after update from 11.4 to 12.2 using zypper dup. I suspect the new initrd is autodetecting raidsets, finding signatures for raidsets that no longer exist. This seems to vary from one boot attempt to the next. In the most straightforward example, the boot started a nonexistent md8, using a partition that is now part of md5. md5 was left inactive, preventing a successful boot. from initrd: # cat etc/mdadm.conf AUTO -all ARRAY /dev/md0 metadata=0.90 UUID=5aa3525a:7ad6a35f:bc99ad79:a125caa7 ARRAY /dev/md1 metadata=0.90 UUID=828df197:ed191435:9316bb71:60b936be ARRAY /dev/md2 metadata=0.90 UUID=b57f8ec2:a0f97879:098bac5e:769b2d5c ARRAY /dev/md3 metadata=0.90 UUID=c1738aec:4471396a:1620bc28:6a0c8b04 from filesystem: # cat etc/mdadm.conf ARRAY /dev/md0 level=raid1 UUID=5aa3525a:7ad6a35f:bc99ad79:a125caa7 ARRAY /dev/md1 level=raid1 UUID=828df197:ed191435:9316bb71:60b936be ARRAY /dev/md2 level=raid1 UUID=b57f8ec2:a0f97879:098bac5e:769b2d5c ARRAY /dev/md3 level=raid1 UUID=c1738aec:4471396a:1620bc28:6a0c8b04 ARRAY /dev/md4 level=raid1 UUID=efd63b6f:eda21963:ef9ebeac:305c1bed ARRAY /dev/md5 level=raid1 UUID=c0493179:3706fe69:ef9ebeac:305c1bed ARRAY /dev/md6 level=raid1 UUID=61cf3e35:6a4d87df:ef9ebeac:305c1bed ARRAY /dev/md7 level=raid1 UUID=9c4abb01:1a7af84d:ef9ebeac:305c1bed # mdadm -Ds ARRAY /dev/md0 metadata=0.90 UUID=5aa3525a:7ad6a35f:bc99ad79:a125caa7 ARRAY /dev/md1 metadata=0.90 UUID=828df197:ed191435:9316bb71:60b936be ARRAY /dev/md2 metadata=0.90 UUID=b57f8ec2:a0f97879:098bac5e:769b2d5c ARRAY /dev/md3 metadata=0.90 UUID=c1738aec:4471396a:1620bc28:6a0c8b04 ARRAY /dev/md4 metadata=0.90 UUID=efd63b6f:eda21963:ef9ebeac:305c1bed ARRAY /dev/md5 metadata=0.90 UUID=c0493179:3706fe69:ef9ebeac:305c1bed ARRAY /dev/md6 metadata=0.90 UUID=61cf3e35:6a4d87df:ef9ebeac:305c1bed ARRAY /dev/md7 metadata=0.90 UUID=9c4abb01:1a7af84d:ef9ebeac:305c1bed I tried removing "AUTO -all" from the initrd version of mdadm.conf--this made no difference. I tried booting using sysvinit--the boot did not find the lvm root partition. Reproducible: Always Steps to Reproduce: 1. power on PC 2. 3. Actual Results: (from memory) type root password to enter single-user mode Expected Results: normal boot - System is currently dual-boot between openSUSE 11.4 and openSUSE 12.1 using GRUB - Separate root partitions are ext3 on RAID1 - Swap partition is on lvm on RAID1 - All other partitions, including separate root and shared home partitions, are ext4 on lvm on RAID1 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c kk zhang <kkzhang@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |kkzhang@novell.com AssignedTo|bnc-team-screening@forge.pr |nld10-bugs-qa@forge.provo.n |ovo.novell.com |ovell.com -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c1 --- Comment #1 from Randy Cushman <suse@randycushman.com> 2012-04-12 00:29:16 UTC --- I tried replacing the mdadm.conf in the initrd with the initrd from the root filesystem. This didn't help. I tried adding the kernel paramter "raid=noautodetect". This didn't help. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c Andreas Jaeger <aj@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|nld10-bugs-qa@forge.provo.n |nfbrown@suse.com |ovell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c2 Neil Brown <nfbrown@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO InfoProvider| |suse@randycushman.com --- Comment #2 from Neil Brown <nfbrown@suse.com> 2012-07-09 23:12:24 UTC --- hi, sorry that there has been no response for so long - this was only recently assigned to me (while I was on leave!). Some changes have been made in the 12.1 mdadm package since April that could relate to this. Could you check with the latest mdadm in 12.1 and see if it makes a difference. If boot still fails I'll need some more details. If the system is at all usable when it has failed to boot probably, then gathering "cat /proc/mdstat" and "mdadm -Dsv" at that point would be helpful. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c3 --- Comment #3 from Randy Cushman <suse@randycushman.com> 2012-07-26 00:07:46 UTC --- I'm not clear on how to update the mdadm package on a root partition and initrd file for a partition that will not boot. Do I need to copy files to the target partition manually, or is there a shortcut, perhaps involving chroot? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c4 Randy Cushman <suse@randycushman.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |NEW InfoProvider|suse@randycushman.com | --- Comment #4 from Randy Cushman <suse@randycushman.com> 2012-08-19 15:41:29 UTC --- Changing status because I need instruction on how to try the suggestions provided by Assignee. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c5 Neil Brown <nfbrown@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO InfoProvider| |suse@randycushman.com --- Comment #5 from Neil Brown <nfbrown@suse.com> 2012-08-20 08:10:28 UTC --- Sorry, I don't really know either. I'd probably try something with chroot. However if you are comfortable rebuilding the initrd then it might be easier to guide you through that. Just remove the file "/lib/udev/rules.d/64-md-raid.rules" from the initrd. That should get you booting. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c6 --- Comment #6 from Neil Brown <nfbrown@suse.com> 2012-09-17 00:31:44 UTC --- Ping. Have you made progress? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=756201 https://bugzilla.novell.com/show_bug.cgi?id=756201#c7 Neil Brown <nfbrown@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |RESOLVED InfoProvider|suse@randycushman.com | Resolution| |NORESPONSE --- Comment #7 from Neil Brown <nfbrown@suse.com> 2012-10-18 03:13:44 UTC --- No response for 1 month so closing. If this is still and issue please re-open. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com