[Bug 849752] New: 13.1RC2 Cannot boot from RAID1 volume(s)
https://bugzilla.novell.com/show_bug.cgi?id=849752 https://bugzilla.novell.com/show_bug.cgi?id=849752#c0 Summary: 13.1RC2 Cannot boot from RAID1 volume(s) Classification: openSUSE Product: openSUSE 13.1 Version: RC 1 Platform: i586 OS/Version: SUSE Other Status: NEW Severity: Major Priority: P5 - None Component: Installation AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: vmicho@gmail.com QAContact: jsrain@suse.com Found By: --- Blocker: --- Created an attachment (id=566785) --> (http://bugzilla.novell.com/attachment.cgi?id=566785) 12.3 partition setup User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:22.0) Gecko/20100101 Firefox/22.0 Image used: openSUSE-13.1-DVD-Build0084-i586.iso My partition setup is simple: 2 disks, 4 primary partitions on each disk, same setup on both disks, raid1 for each identical partition pair (see attached screen shot): 15gb /, 200mb /boot, 90gb /home, 40gb /mnt/data, no swap, no EFI. Setup + installation + 1st boot to system works OK. Next boot fails though. I've recreated same setup in 12.3 on zeroed disks and all works. The only obvious difference between 12.3 and 13.1rc2 is: - raids are named md124-md127 instead of md0-md3 - you can give the raid a name/label (I've used root, boot, home, data) -> see "KO clean partition setup*" for details. -> on rescue system, all raids look OK and are mountable without errors And of course the sstem doesn't boot -> see the "boot failure*" screen shots Note: Just a maybe a related observation: I've redone installation of rc2 3 or four times, usually on zeroed disks. I've reused the raid setup once from previous installation and (I do not remember precisely) I think after the 1st boot to system after installation, one partition was mounted twice (!). Two md devices somehow 'pointed' to the same partitions (e.g. md124 was mounted on /, md125 on /mnt/data, but in /mnt/data I saw the root filesystem again!) Reproducible: Always Steps to Reproduce: 1. PC with 2 zeroed HDDs (I zeroed few MB on the beginning with dd), no EFI 2. run installation dvd 3. create new custom partition setup as described in this bug 4. click bootloader settings, confirm (just in case, to allow/check redundancy) 5. complete installation, 1st reboot 6. 2nd installation phase and login into desktop 7. reboot and fail Actual Results: 7. No longer able to boot after 1st boot to system Expected Results: Should boot every time -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c1
--- Comment #1 from Michal Veselenyi
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c2
--- Comment #2 from Michal Veselenyi
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c3
--- Comment #3 from Michal Veselenyi
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c4
--- Comment #4 from Michal Veselenyi
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c
Xiyuan Liu
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c5
Joachim Wagner
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c6
--- Comment #6 from Joachim Wagner
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c7
Michal Veselenyi
The link "in this bug" links to show_bug.cgi?id=4 which gives a "Access Denied" message. Is the bug number incomplete or can you re-paste the configuration here? I installed a Raid 1 setup with RC2 last weekend without problems. I didn't open the boot loader page during installation though. Is step 4 essential to replicate the bug? Note that I'm just another user, not an OpenSUSE developer.
PS: the bug word in "in this bug" was translated into a random link (why id 4?) by issue tracker. I really meant to say only "in this bug". PS2: I'll not be able to test this too often as I already installed 12.3 on the PC and I'm using it. Unless it is for testing of something most probably fixed/working. I don't actually have any other PC with 2 hdds. PS3: step 4 is most probably not essential. It was surely a bug before in 12.3-beta or 12.2. PS4: dunno if 'mdadm --zero-superblock' would help as I recreated several times the partitions + raid drives (or what is it called) so the md should ignore any other data not on expected position. And anyway, the very 1st setup I've made was on disks with very random data (was 2 years somewhere on shelf, turned off). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c
Michal Veselenyi
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c8
--- Comment #8 from Michal Veselenyi
I installed a Raid 1 setup with RC2 last weekend without problems. I didn't open the boot loader page during installation though. Is step 4 essential to replicate the bug? Note that I'm just another user, not an OpenSUSE developer.
Did you usa same / similar setup? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c9
Neil Brown
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c10
--- Comment #10 from Joachim Wagner
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c11
--- Comment #11 from Michal Veselenyi
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c12
Arvin Schnell
Arvin: what do you know about the use of "/dev/md/data" in /etc/fstab?
This was added on purpose for fate #315590. It's reasonable that the user gives a name to the RAID to actually use the name.
Would it be possible to add "--homehost=any" to the "mdadm" command when creating arrays?
That shouldn't be a problem.
It would be best to use the final host name there (rather than 'any'), but 'any' is better than "linux".
The final name is likely not known during installation. E.g. the system might use DHCP during the first steps of the installation and a proper network setup is done later (that's the current workflow of YaST). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c13
--- Comment #13 from Joachim Wagner
Would it be possible to add "--homehost=any" to the "mdadm" command when creating arrays?
That shouldn't be a problem. And this would be sufficient for feature request https://features.opensuse.org/315590
It would be best to use the final host name there (rather than 'any'), but 'any' is better than "linux".
The final name is likely not known during installation. E.g. the system might use DHCP during the first steps of the installation and a proper network setup is done later (that's the current workflow of YaST). It's a bit more complicated as there is a temporary hostname during startup ("linux-" plus some random characters on my systems) that is overwritten by DHCP either during startup (in a "traditional" network setup) or when triggered by the network manager. In my tests, it is this temporary hostname that is used during md assembly, not the DHCP-supplied hostname. You can see and configure this temporary hostname in YaST - Network Devices - Network Settings - Hostname/DNS - Hostname (assuming a traditional network setup). However, before using this hostname (instead of "any") one would have to check carefully that DHCP is guaranteed to come after md assembly.
Furthermore, users who want the hostname to be stored in the md device (and assembly to be restricted to the respective hostname) probably want a more meaningful hostname stored there than the hostname generated by the installer. (Note that this isn't part of the above feature request on openfate.) Imho, the following is needed to implement this additional feature: (1) check that DHCP always comes later, (2) the installer should ask for the hostname, (3) the md setup should have a checkbox whether to restrict assembly to this hostname (expert option), (4) YaST needs to update the hostname in the md devices and any place it is used if the pre-dhcp startup hostname is changed (there's at least one other place as comment 10 shows that it is not enough to change the md hostname). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c14
--- Comment #14 from Joachim Wagner
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c
Joachim Wagner
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c15
Neil Brown
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c16
Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c17
--- Comment #17 from Michal Veselenyi
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c18
--- Comment #18 from Joachim Wagner
So what's the current workaround? If you want to use named md devices, try Neil's workaround (comment 15). However, it's always a good idea to read the whole discussion, familiarise yourself with the "mdadm" command and the its options mentioned here (--name, --update, --assemble, --examine, --scan, --detail) and be prepared that you may have to experiment a little bit.
Is it sufficient to leave the name blank Yes.
-- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c19
--- Comment #19 from Joachim Wagner
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c20
Günther J. Niederwimmer
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c21
--- Comment #21 from Joachim Wagner
Why is it not possible to insert the Naming from the Bios "Volume1 / Vol1" like in other Distribution ? I added your feature request to openfate as case #316749.
Please vote for it and the related request #315590. You can login on openfate with the same details you use for this page. Note that this is a slightly different issue: You are talking about RAIDs created in the Bios before installation. This bug report is about RAIDs created by YaST during installation. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c22
Christian Treczoks
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c23
--- Comment #23 from Joachim Wagner
BTW: Why is the host part of the device name "linux"? The machine got its name "Bowman" via DHCP from the very beginning [...] See comment 13.
For the purpose of this bug report, I think "--homehost=any" is the way forward. I added a summary of the ideas and comments around the RAID hostname, initial hostname and DHCP to the feature request openfate #315590. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c24
Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c25
--- Comment #25 from Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c26
Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c27
Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c28
--- Comment #28 from Joachim Wagner
Fixed for openSUSE 13.2.
Bug 867514 comment 3 mentions a "prerelease of openSUSE 13.2". Is this an installation DVD? Where is it available? Googling it doesn't show any obvious source. I read that the first Milestone is expected around May, and on software.opensuse.org it says "We currently don't have a Factory Snapshot that is more recent than our last openSUSE release." - JJ -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c29
--- Comment #29 from Andrey Borzenkov
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c30
--- Comment #30 from Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c31
Andrey Borzenkov
The problem with changing IP address should be fixed by using --homehost any when creating mdadm RAIDs.
That's not a solution. It makes it impossible to have more than one installation on the same system using the same names. MDs are host-specific for a reason. The correct solution is to either use mdadm.conf to force names for the right UUIDs or (in addition) put correct host name; it should be possible with single stage installer which is planned (not sure whether this is already implemented).
The problem with not generating a mdadm.conf is fixed.
There is no /etc/mdadm.conf present after installation. I can provide whatever logs are needed from this VM. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c32
Josef Reidinger
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c33
Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c34
Andrey Borzenkov
For the missing mdadm.conf I need log files from the installation.
y2logs attached.
The --homehost=any usage was added after recommendations by our mdadm expert Neil, see comment #15. I you do not agree here I suggest to create a feature request since I cannot change such things all the time.
Neil suggested using *either* --homehost=any *or* /etc/mdadm.conf. I do not know what was implemented, but in any case installer did *not* use --homehost=any because all arrays had as home host temporary IP used during installation. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c35
Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c36
--- Comment #36 from Joachim Wagner
create a feature request
Note existence of openfate #315590 https://features.opensuse.org/315590 and openfate #316749 https://features.opensuse.org/316749 -JJ -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c37
--- Comment #37 from Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c38
--- Comment #38 from Andrey Borzenkov
Somehow you did not correctly use the NET ISO. Yes, I had to manually give it factory as installation source.
(In reply to comment #37)
With openSUSE-Factory-DVD-x86_64-Build0108-Media.iso (likely only available internal) /etc/mdadm.conf is written.
Did it contain all MD arrays? In my case it had only swap, although three were created - root, swap and home. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c39
--- Comment #39 from Arvin Schnell
https://bugzilla.novell.com/show_bug.cgi?id=849752
https://bugzilla.novell.com/show_bug.cgi?id=849752#c40
--- Comment #40 from Andrey Borzenkov
Yes, in one test I had three named BOOT, ROOT and SWAP.
I retried once more using openSUSE-Factory-NET-i586-Build0108-Media.iso and no /etc/mdadm.conf is created. I still get correct names due to --homehost=any. I did check that I used libstorage 2.25.9 ... Do you need yast logs? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com