[Bug 483175] New: system never boots from any disks of software raid1 (upgraded from hosed raid1 opensuse 11.0)
https://bugzilla.novell.com/show_bug.cgi?id=483175 Summary: system never boots from any disks of software raid1 (upgraded from hosed raid1 opensuse 11.0) Classification: openSUSE Product: openSUSE 11.1 Version: Final Platform: i386 OS/Version: openSUSE 11.1 Status: NEW Severity: Critical Priority: P5 - None Component: Installation AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: abittner@stud.fh-heilbronn.de QAContact: jsrain@novell.com Found By: --- Created an attachment (id=277855) --> (https://bugzilla.novell.com/attachment.cgi?id=277855) clean 11.0 x86 with raid1 - logfiles. system unable to boot User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.1b3) Gecko/20090305 Firefox/3.1b3 (.NET CLR 3.5.30729) experimenting with simple raid1 setup (two sata disks, sda and sdb) one / and one swap partition, becoming md0 and md1. an already messed up (same problem: opensuse 11.0 never boots from any of the sata drives, apparently grub never gets installed anywhere) opensuse 11.0 system was taken as a base, and i tried to fix it by applying opensuse 11.1 to it. opensuse 11.1 recognized md0 and md1 properly, upgraded the packages and even executed the new system (kexec instead of real re-boot) fine. did some yast2 online_updates in this first kexec-bootup of the upgraded 11.1 system i have read through a number of bugs and i think first of all this 11.0 bug was never really fixed: https://bugzilla.novell.com/show_bug.cgi?id=398356 as it should have been gone by 11.1 rtm. after the first real reboot of 11.1 after the online_upgrades the system never boots. i tried to manually select my sata disks from bios boot-menu during POST, but it never boots from any of the two sata disks in opensuse 11.1 either. also using the opensuse 11.0 (back during messed up state of 11.0) to try to "boot from harddisk" never works, and "boot from harddisk" in 11.1 also doesnt help any either. i also fail to see how this can have the status of fixed: https://bugzilla.novell.com/show_bug.cgi?id=439518 https://bugzilla.novell.com/show_bug.cgi?id=438644 i can try some additional completely clean 11.1 from scratch installs and see what happens there, but i doubt that it will help. also this whole thread talks about the huge grub/swraid1 mess in recent opensuse versions: http://lists.opensuse.org/opensuse/2009-02/msg00509.html :( Reproducible: Always Steps to Reproduce: 1. upgrade opensuse 11.0 software raid1 to opensuse 11.1 2. try to reboot your 11.1 machine 3. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 User abittner@stud.fh-heilbronn.de added comment https://bugzilla.novell.com/show_bug.cgi?id=483175#c1 --- Comment #1 from andreas bittner <abittner@stud.fh-heilbronn.de> 2009-03-07 06:37:53 MST --- Created an attachment (id=277856) --> (https://bugzilla.novell.com/attachment.cgi?id=277856) 11.0 x86 with raid1 upgraded to 11.1 - logfiles. system unable to boot -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 User abittner@stud.fh-heilbronn.de added comment https://bugzilla.novell.com/show_bug.cgi?id=483175#c2 --- Comment #2 from andreas bittner <abittner@stud.fh-heilbronn.de> 2009-03-07 08:49:40 MST --- clean 11.1 x86 install gives me a bootable system but just as long as the sda (first sata hdd) is connected. also manually selecting during bios/POST to boot system from second sata sdb hdd doesnt give any output. no grub lines, nothing. system just idles there after the POST screens and summary tables and messages of the bios POST. so grub doesnt get installed properly/at all onto the sdb disk. will attach logs. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 User abittner@stud.fh-heilbronn.de added comment https://bugzilla.novell.com/show_bug.cgi?id=483175#c3 --- Comment #3 from andreas bittner <abittner@stud.fh-heilbronn.de> 2009-03-07 08:52:22 MST --- Created an attachment (id=277860) --> (https://bugzilla.novell.com/attachment.cgi?id=277860) logfiles from cleaninstall opensuse 11.1 swraid1 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 User abittner@stud.fh-heilbronn.de added comment https://bugzilla.novell.com/show_bug.cgi?id=483175#c4 --- Comment #4 from andreas bittner <abittner@stud.fh-heilbronn.de> 2009-03-07 09:48:29 MST --- according to http://lists.opensuse.org/opensuse/2009-02/msg00597.html http://de.opensuse.org/SDB:Der_Bootmanager_GRUB#GRUB_manuell_installieren i ran grub as rootuser from the inside the new 11.1 system. and first looked up my two sata harddisks in /boot/grub/device.map
(hd0) /dev/disk/by-id/ata-SAMSUNG_HD753LJ_S13UJDWQ702567 (hd1) /dev/disk/by-id/ata-SAMSUNG_HD753LJ_S13UJDWQ702565 /dev/disk/by-id/ata-SAMSUNG_HD753LJ_S13UJDWQ702567 and then inside grub commandline shell: device (hd0) /dev/disk/by-id/ata-SAMSUNG_HD753LJ_S13UJDWQ702567
and according to mdadm -D /dev/md1 my / (root) filesystem is on /dev/sda5 and /dev/sdb5 partitions so then in grub i do: root (hd0,4) giving me output:
Filesystem type is ext2fs, partition type 0xfd so that / partition on /dev/sda(5) is indeed of partition type 0xFD, Linux RAID
and then inside grub: setup (hd0)
setup (hd0) Checking if "/boot/grub/stage1" exists... yes Checking if "/boot/grub/stage2" exists... yes Checking if "/boot/grub/e2fs_stage1_5" exists... yes Running "embed /boot/grub/e2fs_stage1_5 (hd0)"... 17 sectors are embedded. succeeded Running "install /boot/grub/stage1 (hd0) (hd0)1+17 p (hd0,4)/boot/grub/stage2 /boot/grub/menu.lst"... succeeded Done.
and then same for the other sata disk /dev/sdb in grub: grub> device (hd0) /dev/disk/by-id/ata-SAMSUNG_HD753LJ_S13UJDWQ702565 device (hd0) /dev/disk/by-id/ata-SAMSUNG_HD753LJ_S13UJDWQ702565 grub> root (hd0,4) root (hd0,4) Filesystem type is ext2fs, partition type 0xfd grub> setup (hd0) setup (hd0) Checking if "/boot/grub/stage1" exists... yes Checking if "/boot/grub/stage2" exists... yes Checking if "/boot/grub/e2fs_stage1_5" exists... yes Running "embed /boot/grub/e2fs_stage1_5 (hd0)"... 17 sectors are embedded. succeeded Running "install /boot/grub/stage1 (hd0) (hd0)1+17 p (hd0,4)/boot/grub/stage2 /boot/grub/menu.lst"... succeeded Done. --------- after exiting grub commandline shell and rebooting the 11.1 system, the whole box boots fine even when being selected to boot from second sata disk inside bios/POST screen, or when disconnecting the first sata disk and so on. it would be great if this bug could be fixed, and if there was some way to update the opensuse 11.1 iso images to embed these and other essential bugfixes and newer rpm packages. is there any way to patch/re-build the opensuse iso install images? thanks and regards. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 User abittner@stud.fh-heilbronn.de added comment https://bugzilla.novell.com/show_bug.cgi?id=483175#c5 --- Comment #5 from andreas bittner <abittner@stud.fh-heilbronn.de> 2009-03-08 07:31:17 MST --- update: i have tried a clean install from scratch on opensuse 11.2 alpha0/factory today netinst iso http://download.opensuse.org/factory/iso/openSUSE-Factory-NET-i586-Build0026... and set up a same raid1 system. md0 for swap, md1 for / two sata disks same as above. the 11.2 alpha0 system can once again only boot from the first sata drive. if i manually select the second sata drive in bios/POST screen and want to boot from it nothing happens after the POST bios screens. so this is definitely not fixed anywhere up til opensuse 11.2 alpha0/factory as of now. at least not for me, and i fail to see the validity of all those closed bugreports and fixed markings. this is simply impossible. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 Cyril Hrubis <chrubis@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|bnc-team-screening@forge.pr |yast2-maintainers@suse.de |ovo.novell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 User jsrain@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=483175#c6 Jiri Srain <jsrain@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Priority|P5 - None |P3 - Medium CC| |jsrain@novell.com, | |juhliarik@novell.com Component|Installation |Installation Version|Final |Factory AssignedTo|yast2-maintainers@suse.de |bnc-team-screening@forge.pr | |ovo.novell.com Product|openSUSE 11.1 |openSUSE 11.2 Severity|Critical |Normal --- Comment #6 from Jiri Srain <jsrain@novell.com> 2009-03-10 02:12:02 MST --- The reason for having an MD array is having the data safe at first, not having the system bootable if a disk fails (even though it is nice). Therefore lowering the severity. Anyway, I agree that we should improve this for 11.2 and have the system bootable from either of the disks provided some conditions are met (in your case, with two disks partitioned exactly the same way, they are). Note one point: The situation may be different when you disconnect /dev/sda (which is scenario that should work, when you remove broken disk) than when you just reconfigure your BIOS (which, I hope you agree, is not that important) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 User abittner@stud.fh-heilbronn.de added comment https://bugzilla.novell.com/show_bug.cgi?id=483175#c7 --- Comment #7 from andreas bittner <abittner@stud.fh-heilbronn.de> 2009-03-10 05:50:17 MST --- hmm and i thought i was having backups for having data safety and having raid for having system (uptime) safety..... i was looking at the system as whole. i wasnt exactly reconfiguring bios, but my bios allows me via F8 keypress to manually select from detected storage divices which to boot from. now i am no storage, ibm or standards and compatibility expert or engineer and i dont know the exact details and specs and/or differences if there are any between reconfiguring bios to boot from exact device, or letting bios try various devices and skip to others if it cannot boot from a device (because not present, or because not bootable) that has been configured manually, and so forth. but i can try what happens when i let the bootdevice in my bios configured to first sata disk and disconnect it, if that behaves differently. anyways, as there are some solutions about how to install grub properly in raid1 scenarios on all involved disks and there are quite a number of other bugreports and demand of users to enhance this, it would be really great if this could be finally added and enhanced. this would also hopefully mean that opensuse resuce/repair mode bootup from opensuse dvd/cd media and everything that is related to this stuff would also be able to handle these situations and so on. thanks and regards. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 User jsrain@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=483175#c8 Jiri Srain <jsrain@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|bnc-team-screening@forge.pr |jreidinger@novell.com |ovo.novell.com | --- Comment #8 from Jiri Srain <jsrain@novell.com> 2009-03-10 06:02:15 MST --- You know, RAID is just faster recovery, and unlike back-up it is always up-to-date (while back-up is from yesterday) ;-) If you send me (via private mail) your bootloader configuration files (/etc/grub.conf, /boot/grub/menu.lst, /boot/grub/device.map), I can most likely update them for you and provide instructions on how to move forward. As I said, we should tackle it in 11.2 (or further). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=483175 Josef Reidinger <jreidinger@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com