[Bug 772286] New: raid1 sometimes degraded at boot
https://bugzilla.novell.com/show_bug.cgi?id=772286 https://bugzilla.novell.com/show_bug.cgi?id=772286#c0 Summary: raid1 sometimes degraded at boot Classification: openSUSE Product: openSUSE 12.2 Version: Factory Platform: Other OS/Version: openSUSE 12.2 Status: NEW Severity: Major Priority: P5 - None Component: Basesystem AssignedTo: nfbrown@suse.com ReportedBy: suse-beta@cboltz.de QAContact: qa-bugs@suse.de Found By: Beta-Customer Blocker: --- Created an attachment (id=499302) --> (http://bugzilla.novell.com/attachment.cgi?id=499302) /var/log/messages (bzip2-compressed) (Factory last updated 2012-07-15 - I'm _not_ using the 12.2 RC1 repo) Sometimes (maybe every one or two weeks) my laptop boots up with one or more md raid1 array degraded. Today was somewhat extreme - 4 raid arrays started up with only one of two disks/partitions. I think this first happened when I upgraded from 12.1 to factory (around 12.2 beta1). 12.1 didn't have this problem IIRC. The hardware is most probably OK - I don't see any log entries that point to disk failures, and I can re-add the missing disks/partitions with mdadm and they work without problems then. Today's /proc/mdstat after boot: Personalities : [raid1] [raid0] [raid10] [raid6] [raid5] [raid4] md1 : active raid1 sda3[0] 133162712 blocks super 1.0 [2/1] [U_] bitmap: 146/254 pages [584KB], 256KB chunk md4 : active raid1 sda7[0] 1839396 blocks super 1.0 [2/1] [U_] bitmap: 1/8 pages [4KB], 128KB chunk md3 : active raid1 sda6[0] 10482308 blocks super 1.0 [2/1] [U_] bitmap: 3/160 pages [12KB], 32KB chunk md0 : active raid1 sda2[0] 200800 blocks super 1.0 [2/1] [U_] bitmap: 2/7 pages [8KB], 16KB chunk md2 : active raid1 sda5[0] sdb5[1] 10482308 blocks super 1.0 [2/2] [UU] bitmap: 0/160 pages [0KB], 32KB chunk unused devices: <none> To relate this to mointpoints: /dev/md2 on / type ext3 /dev/md0 on /boot type ext3 /dev/md3 on /testroot type ext3 /dev/mapper/cr_home (= /dev/md1) on /home type ext3 /dev/md4 is encrypted (and currently disabled) swap I'm attaching my /var/log/messages. Search for "Jul 19 19:14:48" to get today's set of 4 degraded arrays. Note: this bug might be related to bug 760859 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=772286 https://bugzilla.novell.com/show_bug.cgi?id=772286#c1 Neil Brown <nfbrown@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO InfoProvider| |fcrozat@suse.com --- Comment #1 from Neil Brown <nfbrown@suse.com> 2012-09-12 07:50:30 UTC --- (sorry for the delay in getting to this). I think I have discovered the problem, and the magic incantation which makes the problem disappear. Please edit "/etc/init.d/boot.md", find a line near the top which reads: # Required-Start: boot.udev boot.rootfsck and add the string udev-trigger to the end (with a 'space' to separate it from the previous word). boot.md uses "udevadm settle" to wait for all udev events to complete, but thes requires that the events have been triggered. In the old sysv-init world, the script "/etc/init.d/boot.udev" would start udev and trigger the events. As boot.md "Requires" "boot.udev", it wouldn't run until the triggering had happened. In the new "systemd" world, there are two separate services, boot.udev and udev-trigger. boot.md needs to wait for both. I suspect that we need to do a little more than just that change for a complete solution as it will probably confuse sysvinit. Frederic: you are our resident systemd person. What do you suggest here? Is just adding "udev-trigger" correct? Might it confuse sysvinit? Would we care? I just tried booting with sysvinit and it works, but if I run "insserv boot.md" it complains: FATAL: service udev-trigger has to be enabled to use service boot.md as expected. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=772286 https://bugzilla.novell.com/show_bug.cgi?id=772286#c2 Frederic Crozat <fcrozat@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |NEW CC| |fcrozat@suse.com InfoProvider|fcrozat@suse.com | --- Comment #2 from Frederic Crozat <fcrozat@suse.com> 2012-09-18 09:16:03 UTC --- (In reply to comment #1)
Frederic: you are our resident systemd person. What do you suggest here? Is just adding "udev-trigger" correct? Might it confuse sysvinit? Would we care? I just tried booting with sysvinit and it works, but if I run "insserv boot.md" it complains:
FATAL: service udev-trigger has to be enabled to use service boot.md
as expected.
One option could be to write a md.service file which would be used instead of boot.md (but it is not material for 12.2, better to do that for 12.3) I would suggest to use: # Should-Start: udev-trigger This way, insserv wouldn't complain, because the service doesn't exist under sysv but would be used under systemd (I didn't test this, but I think it should work properly). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=772286 https://bugzilla.novell.com/show_bug.cgi?id=772286#c3 Neil Brown <nfbrown@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO InfoProvider| |maintenance@opensuse.org --- Comment #3 from Neil Brown <nfbrown@suse.com> 2012-09-20 06:26:30 UTC --- Yes, that works. Maintenance: could we please schedule and update for mdadm in 12.2 for this bug and for bug #779087. Both cause problems when booting in different situations. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=772286 https://bugzilla.novell.com/show_bug.cgi?id=772286#c4 Marcus Meissner <meissner@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |NEW CC| |meissner@suse.com InfoProvider|maintenance@opensuse.org | --- Comment #4 from Marcus Meissner <meissner@suse.com> 2012-09-21 12:17:59 UTC --- we are not doing much scheduling. feel free to submit a mdadm update as maintenancerequest with the bugs you see needing to be fgixed :) osc branch -M openSUSE:12.2:Update mdadm -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=772286 https://bugzilla.novell.com/show_bug.cgi?id=772286#c5 --- Comment #5 from Bernhard Wiedemann <bwiedemann@suse.com> 2012-09-24 08:00:59 CEST --- This is an autogenerated message for OBS integration: This bug (772286) was mentioned in https://build.opensuse.org/request/show/135563 Factory / mdadm -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=772286 https://bugzilla.novell.com/show_bug.cgi?id=772286#c6 Neil Brown <nfbrown@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED --- Comment #6 from Neil Brown <nfbrown@suse.com> 2012-09-25 02:39:00 UTC --- Thanks. I've submitted an update. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=772286 https://bugzilla.novell.com/show_bug.cgi?id=772286#c7 Neil Brown <nfbrown@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |FIXED --- Comment #7 from Neil Brown <nfbrown@suse.com> 2012-09-26 02:08:24 UTC --- ... and closing bug. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=772286 https://bugzilla.novell.com/show_bug.cgi?id=772286#c8 --- Comment #8 from Swamp Workflow Management <swamp@suse.de> 2012-10-04 20:08:55 UTC --- openSUSE-RU-2012:1291-1: An update that has two recommended fixes can now be installed. Category: recommended (low) Bug References: 760859,772286 CVE References: Sources used: openSUSE 12.1 (src): mdadm-3.2.2-4.9.1 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com