[Bug 227377] New: Can't boot from installed 10.2 final when using software raid (no operatings system found)
https://bugzilla.novell.com/show_bug.cgi?id=227377 Summary: Can't boot from installed 10.2 final when using software raid (no operatings system found) Product: openSUSE 10.2 Version: Final Platform: Other OS/Version: Other Status: NEW Severity: Critical Priority: P5 - None Component: Installation AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: franz.x.maier@aon.at QAContact: jsrain@novell.com Problem: when I install 10.2 final with software raid for / swap and /home (3 partitions with Raid 1 on two physical disks) the system doesnt boot after install (first boot) and the hardware tells me, that there is nor operating system present. This configuration worked ok in 10.2 RC1, but no longer works in 10.2 final. Regards Franz X. Maier -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 franz.x.maier@aon.at changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |FIXED ------- Comment #1 from franz.x.maier@aon.at 2006-12-10 03:23 MST ------- In the meantime I solved the problem: I partitioned like the automatic partitioning from another Pc with a hardware raid (fake raid), namely /boot swap / and /home and it works! You should at least document this special requirement, as it is quite new. Regards Franz X. Maier -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 franz.x.maier@aon.at changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |CLOSED -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 andreas.pfaller@gmail.com changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |andreas.pfaller@gmail.com Status|CLOSED |REOPENED Resolution|FIXED | ------- Comment #2 from andreas.pfaller@gmail.com 2006-12-10 18:39 MST ------- Why close this. The problem exists. I have just installed 10.2 with software raid (2 identical partioned SATA disks with among others / /boot /var configured as raid 1 and the system does not boot after first installation step (after the first reboot after installation). I left the bootloader configuration unchanged from the one suggested by the installer. I have booted with the rescue system and the content off all relevant partitions looks OK and the raid partitions are currently syncing. This is a NEW installation where all disks were completely repartioned. So it looks like if the only problem is the bootloader installation. Currently I am trying to figure out how to install grub manually but have not yet been successful. /etc/grub.conf contains: setup --stage2=/boot/grub/stage2 (hd0,0) (hd0,0) setup --stage2=/boot/grub/stage2 (hd1,0) (hd0,0) quit and /boot/device.map: (hd0) /dev/sda (hd1) /dev/sdb Both look OK to me. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #3 from andreas.pfaller@gmail.com 2006-12-10 20:26 MST ------- Some additional info: After messing with the recovery options of the Installation DVD of which none worked, (e.g. the automatic repair options seem to be totally unaware of the existence of raid and the "bootloader repair" simply failed with an error message) I booted again with the rescue system and simply did
grub root (hd0,0) setup quit
which upon reboot at least showed the grub menu with the newly installed system. However trying to boot "Suse 10.2" dropped me in a minimal system (I assume the initrd) and failed to assemble the arrays. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #4 from andreas.pfaller@gmail.com 2006-12-11 10:10 MST ------- Created an attachment (id=109185) --> (https://bugzilla.novell.com/attachment.cgi?id=109185&action=view) YaST2 logs Obtained via rescue system from /var/log on HD of installed system. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #5 from andreas.pfaller@gmail.com 2006-12-11 10:12 MST ------- Created an attachment (id=109186) --> (https://bugzilla.novell.com/attachment.cgi?id=109186&action=view) mdadm.conf Obtained from /etc on HD of installed system. The mdadm.conf included in the initrd is identical. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #6 from andreas.pfaller@gmail.com 2006-12-11 10:13 MST ------- Created an attachment (id=109187) --> (https://bugzilla.novell.com/attachment.cgi?id=109187&action=view) fstab Obtained from /etc on HD of installed system. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #7 from andreas.pfaller@gmail.com 2006-12-11 10:15 MST ------- Created an attachment (id=109188) --> (https://bugzilla.novell.com/attachment.cgi?id=109188&action=view) fdisk -l "fdisk -l" output. Obtained while running rescue system. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #8 from andreas.pfaller@gmail.com 2006-12-11 10:16 MST ------- Created an attachment (id=109189) --> (https://bugzilla.novell.com/attachment.cgi?id=109189&action=view) mdadm output Obtained by running "mdadm --examine /dev/sda*" while running rescue system. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #9 from andreas.pfaller@gmail.com 2006-12-11 10:20 MST ------- Created an attachment (id=109190) --> (https://bugzilla.novell.com/attachment.cgi?id=109190&action=view) serial console boot log Serial console output while trying to boot system after fixing the grub installation like describe in comment #3. I am running out of ideas now ;) Any hint is appreciated. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #10 from andreas.pfaller@gmail.com 2006-12-11 17:16 MST ------- Since I was running out of ideas I opted for a complete reinstall. The differences from my previous attempt are: - No separate /boot partition - Only one primary partition (sd[ab]1 - not used for install). The previous attempt had a primary partition (sd[ab]3) physically located behind the extended partition - this worked without problem on the same hardware with 10.1. The reason for this layout was to ease replacement of a raid component drive if one of the drives fails and the replacement has a slightly smaller capacity. This new installation also did not install a working grub configuration. However after installing grub with
root (hd0,4) setup (hd0) from the rescue system the system booted unlike my previous attempt where the initrd failed to assemble the root fs (see boot log of comment #9).
-- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 chrubis@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|bnc-team- |yast2-maintainers@suse.de |screening@forge.provo.novell| |.com | Status|REOPENED |NEW -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 jsuchome@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|yast2-maintainers@suse.de |fehr@novell.com ------- Comment #11 from jsuchome@novell.com 2006-12-18 02:03 MST ------- Thomas, isn't this a partitioner problem? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #12 from fehr@novell.com 2006-12-18 02:11 MST ------- The partitioner proposal certainly suggests a separate /boot partition if a fake raid disk is used. I am not sure if there is a warning if the user wants to be extra smart and removes this /boot partition or partitions manually without separate /boot. Will check this. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #13 from andreas.pfaller@gmail.com 2006-12-18 07:56 MST ------- Thomas, I think you misread my description. The system WITH a separate boot had problems which failed in the initrd. My first try had a boot partition (sda1,sdb1). / was the first extended partition (sda5,sdb5). In both my tries the partition layout was created completely manually by me with the installation system partitioner. Both attempts also failed to create a working grub configuration (configuration left unchanged from the default proposal of the installer). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 fehr@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |fehr@novell.com AssignedTo|fehr@novell.com |odabrunz@novell.com ------- Comment #14 from fehr@novell.com 2006-12-19 01:37 MST ------- The attached y2log file contains no dmraid setup at all but a setup using Software Raid (/dev/md*). Sorry that I assumes it was fake raid (dmraid). I was confused by comment #1. The problems seems to be in grub.conf and bootloader setup, this is outside of yast2-storage and I reassign this to bootloader mainteiner. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #15 from stefan.fent@novell.com 2006-12-20 01:39 MST ------- As the serial output looks ok (the kernel commandline is correct) I doubt this is a bootloader problem. For some reasons, /dev/md1 seems to be broken. /boot being a software RAID is problematic, though. stage2 bootloader requires to be on the same physical blocks on both disks, which can't be guaranteed with YaST partitioning the disks. /etc/grub.conf looks broken to me, it should be: setup --stage2=/boot/grub/stage2 (hd0,0) (hd0,0) setup --stage2=/boot/grub/stage2 (hd1,0) (hd1,0) ^ -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #16 from andreas.pfaller@gmail.com 2006-12-20 07:03 MST ------- I am not really familiar with grub but shouldn't the correct setup for raid-1 be: setup --stage2=/boot/grub/stage2 (hd0,0) (hd0,0) setup --stage2=/boot/grub/stage2 (hd0,0) (hd0,0) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #17 from andreas.pfaller@gmail.com 2006-12-20 07:26 MST ------- Sorry, somehow Firefox sent comment #16 before I was finished. I am not really familiar with grub but shouldn't the correct setup for raid-1 be: setup --stage2=/boot/grub/stage2 (hd0,0) (hd0,0) device (hd0) /dev/sdb setup --stage2=/boot/grub/stage2 (hd0,0) (hd0,0) I have not tested it yet but I think the above command will assure that sdb will be properly booted if sda completely fails or is removed as it will be the the new "0" drive for the BIOS. Different physical blocks should not really happen for stage1 because raid-1 should guarantee the they are located in identical blocks. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #18 from stefan.fent@novell.com 2006-12-21 01:51 MST ------- No, as then it would only write on the first disk, sda in this case. sdb will be left untouched, so if sda fails, there is no grub at all on the new sda (old sdb). The problem here is not stage1, but stage 2 that is located in the filesystem, and stage2 stores the blocks where some part of it is located. So if you mirror it on different partitions, say sda1 and sdb3 (so there for sure are different blocks) and stage2 from sda1 gets mirrored to sdb3 this data is lost for sdb3. Means: The stage2 files must be different, if stored on different blocks. (Which will be corrected by the RAID and thus leads to an unbootable system) Stage1 is either located in the MBR, or in the boot record of the partition, both are not touched by the filesystem (except xfs) and thus not changed by the RAID. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #19 from andreas.pfaller@gmail.com 2006-12-21 12:59 MST ------- Stefan, you got me confused: I thought the "device (hd0) /dev/sdb" command essential makes grub use /dev/sdb whenever the grub "hd0" specification is used. So the 2nd setup command should install grub on /dev/sdb. At least for my current setup (see below) this works - I have verified that the system is bootable from both disks by temporary removing /dev/sda and for the next try /dev/sdb. I even zeroed the relevant disk sectors and they were restored on both disks by grub (the stage1.5 sectors, see below). Regarding the modification of stage2: Thanks for the hint Stefan, I was not aware of this problem. (Note: I meant stage2 in the last paragraph of my comment #17). Currently I have installed grub with device (hd0) /dev/sda root (hd0,4) setup (hd0) and device (hd0) /dev/sdb root (hd0,4) setup (hd0) and as far as I interpret grubs output this should be safe as this install stage1.5 in the physical sectors (1-15) (i.e. outside of any mirrored partition) and as stage1.5 understands ext2 it should have no problems finding stage2 even if the sectors occupied by stage2 change or are not identical because of different physical positions of the raid component partitions. And as I said above I have verified that it works as expected on my system. Shouldn't YaST enforce something like this automatically as soon as /boot gets installed inside an raid-1 partition? One thing however still make me nervous. Does grub's e2fs_stage1_5 understand all current ext2 features? dir_index may be a problem. A further suggestion: All documentation I found regarding the combination of raid/grub while googling is highly contradicting. It would be nice if SuSE manual would provide some details about this quite common scenario. This leaves the problem with initrd failing to assemble the root device. In hindsight I should have opened a separate bugzilla entry for this. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #20 from mmccarthy@lojack.com 2006-12-28 16:57 MST ------- Having spent most of the day trying to get s/w RAID installed on a system I am upgrading, I found that it is indeed a problem with putting /boot into an ext3 filesystem. If you use Reiserfs (as used to be the default) or ext2, then all is well. If you want a journaling fs just use Reiser (It has served me well for years). If you insist on ext3, create a small RAID md, say 80MB, format with ext2 and mount it as /boot. Since little ever changes on /boot, journaling doesn't buy you anything there. I tried both and they seem to boot up fine. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #21 from andreas.pfaller@gmail.com 2006-12-29 00:22 MST ------- The result from comment #20 surprises me. My first attempt had a separate /boot on ext2! While I could fix grub from the rescue system I never figured out how to make it past the initrd. With my second installation attempt boot was put on my root partition which is ext3! That attempt also created a non working grub installation (fixed again from the rescue system, final version see comment #19) and booted normally after that. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #22 from markus.kossmann@gmx.de 2006-12-31 07:55 MST ------- One idea: Some BIOSses require to have a partition with "bootable" flag set on the bootdisk. and if that is missing they claim hat there is no operation system. Might it be, that there is/was no partition with "bootable" flag on your system ? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 ------- Comment #23 from andreas.pfaller@gmail.com 2006-12-31 08:53 MST ------- Markus, if you look at the many details I have provided above you will see that the bootable flag was set on the boot partition (see comment #7). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 mhorvath@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |seuchato@yahoo.de ------- Comment #24 from mhorvath@novell.com 2007-01-24 05:54 MST ------- *** Bug 233758 has been marked as a duplicate of this bug. *** -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 odabrunz@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED ------- Comment #25 from odabrunz@novell.com 2007-02-23 05:35 MST ------- I looked into this, but could not find any quick answer. Since software-RAID issues have been postponed until after SLES10 SP1, I will work on this for 10.3 (earliest?). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377 odabrunz@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Component|Installation |Installation Product|openSUSE 10.2 |openSUSE 10.3 Version|Final |Alpha 0plus ------- Comment #26 from odabrunz@novell.com 2007-05-10 11:39 MST ------- Moving bug to 10.3. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=227377#c27
--- Comment #27 from Richard Creighton
https://bugzilla.novell.com/show_bug.cgi?id=227377#c28
Christoph Thiel
https://bugzilla.novell.com/show_bug.cgi?id=227377#c29
--- Comment #29 from Richard Creighton
https://bugzilla.novell.com/show_bug.cgi?id=227377#c30
--- Comment #30 from Richard Creighton
https://bugzilla.novell.com/show_bug.cgi?id=227377#c31
Stephan Kulow
https://bugzilla.novell.com/show_bug.cgi?id=227377
Stephan Kulow
https://bugzilla.novell.com/show_bug.cgi?id=227377#c32
Leonce Eraly
https://bugzilla.novell.com/show_bug.cgi?id=227377#c33
Michael McCarthy
https://bugzilla.novell.com/show_bug.cgi?id=227377#c35
Stefan Fent
https://bugzilla.novell.com/show_bug.cgi?id=227377#c36
Richard Creighton
The bootloader can't write to /dev/md, it has to write to the devices directly. As the combination of ext3 and md destroys this information again, this doesn't work, so you have to use ext2 / reiserfs. --> invalid.
Sir, I have installed to a pure MD raid environment using exclusively EXT3 filesystems and I assure you it works and is not invalid. What is invalid is the combination of IDE and SATA when trying to do all of this. All I have to do to make it work is remove all of the IDE drives from my system and install using pure MD raid structures and EXT3. If my understanding is correct, EXT3 is EXT2 with journaling so if true, what would that have to do with the bootloader anyway? The bootloader most assuredly *can* write to dev/mdx. It is also obvious that it is capable of writing a MBR to the raw device that contains the MD raid or you wouldn't even get the 'GRUB' in the corner of the screen when you try to boot. FWIW, I am writing this from an 10.3 RAID only installation on EXT3 fs so I know it can work. What is nice is that 10.2 did it without jumping through all the hoops You can close it again, but that isn't correct, it is expedient. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=227377
Matej Horvath
https://bugzilla.novell.com/show_bug.cgi?id=227377
Stefan Fent
https://bugzilla.novell.com/show_bug.cgi?id=227377#c38
Joachim Plack
https://bugzilla.novell.com/show_bug.cgi?id=227377#c39
--- Comment #39 from Richard Creighton
https://bugzilla.novell.com/show_bug.cgi?id=227377
User jplack@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=227377#c40
Joachim Plack
https://bugzilla.novell.com/show_bug.cgi?id=227377
User leo@unstable.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=227377#c41
--- Comment #41 from Leonce Eraly
https://bugzilla.novell.com/show_bug.cgi?id=227377
User jplack@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=227377#c42
Joachim Plack
https://bugzilla.novell.com/show_bug.cgi?id=227377
Stanislav Visnovsky
https://bugzilla.novell.com/show_bug.cgi?id=227377
User jplack@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=227377#c43
Joachim Plack
https://bugzilla.novell.com/show_bug.cgi?id=227377
User juhliarik@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=227377#c44
Jozef Uhliarik
participants (1)
-
bugzilla_noreply@novell.com