[Bug 356226] New: something wrong with the metadata detection on NVIDIA-raid
https://bugzilla.novell.com/show_bug.cgi?id=356226 User jos.baudrez@telenet.be added comment https://bugzilla.novell.com/show_bug.cgi?id=356226#c16 Summary: something wrong with the metadata detection on NVIDIA- raid Product: openSUSE 10.3 Version: Final Platform: x86-64 OS/Version: openSUSE 10.3 Status: NEW Severity: Normal Priority: P5 - None Component: Installation AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: jos.baudrez@telenet.be QAContact: jsrain@novell.com Found By: --- On installation, the SuSE 10.3 detects my nvidia STRIPED-array as a MIRRORED-array; even after (warm) boot the BIOS initialises the array as MIRROR, and i have to power-off before the BIOS sees the STRIPED array again! So I am unable to upgrade to SuSE 10.3. Ans that's a pity :-) ! I have formerly commented on the Bug-333753 (#16), but it seems that my problem has nothing to do with (mk)initrd or the problems there, so i was invited to open a new bug report. Hence .... What follows is the result of running dmraid after a fresh start in Rescue mode from the 10.2 and the 10.3 install-DVD's. My machine has 3 disks. Under SuSE 10.2 dmraid -b reports them as: /dev/sda: 145223999 total, "WD-WMANS1687285" /dev/sdb: 145223999 total, "WD-WMANS1617466" /dev/sdc: 976771055 total, "WD-WCAPW3282175" But under 10.3 the same command results in an already different detection: /dev/sda: 145226112 total, "WD-WMANS1687285" /dev/sdb: 145226112 total, "WD-WMANS1617466" /dev/sdc: 976773168 total, "WD-WCAPW3282175" (i do prefer the bigger capacity ;-) ) Under 10.2 as under 10.3 it seems that the nvidia metadata are discovered (and none of the other chipsets). Under 10.2 dmraid -n then further shows: /dev/sda (nvidia): 0x000 NVIDIA 0x008 size: 30 0x00c chksum: 2345429009 0x010 version: 100 0x012 unitNumber: 0 0x013 reserved: 0 0x014 capacity: 290447872 0x018 sectorSize: 512 0x01c productID: STRIPE 138.49G 0x02c productRevision: 100 0x030 unitFlags: 0 0x034 array->version: 6553668 0x038 array->signature[0]: 782688683 0x03c array->signature[1]: 1311357306 0x040 array->signature[2]: 357572781 0x044 array->signature[3]: 1066485961 0x048 array->raidJobCode: 0 0x049 array->stripeWidth: 2 0x04a array->totalVolumes: 2 0x04b array->originalWidth: 2 0x04c array->raidLevel: 128 0x050 array->stripeBlockSize: 128 0x054 array->stripeBlockByteSize: 65536 0x058 array->stripeBlockPower: 7 0x05c array->stripeMask: 127 0x060 array->stripeSize: 256 0x064 array->stripeByteSize: 131072 0x068 array->raidJobMark 0 0x06c array->originalLevel 128 0x070 array->originalCapacity 290447872 0x074 array->flags 0x0 /dev/sdb (nvidia): 0x000 NVIDIA 0x008 size: 30 0x00c chksum: 2345363473 0x010 version: 100 0x012 unitNumber: 1 0x013 reserved: 0 0x014 capacity: 290447872 0x018 sectorSize: 512 0x01c productID: STRIPE 138.49G 0x02c productRevision: 100 0x030 unitFlags: 0 0x034 array->version: 6553668 0x038 array->signature[0]: 782688683 0x03c array->signature[1]: 1311357306 0x040 array->signature[2]: 357572781 0x044 array->signature[3]: 1066485961 0x048 array->raidJobCode: 0 0x049 array->stripeWidth: 2 0x04a array->totalVolumes: 2 0x04b array->originalWidth: 2 0x04c array->raidLevel: 128 0x050 array->stripeBlockSize: 128 0x054 array->stripeBlockByteSize: 65536 0x058 array->stripeBlockPower: 7 0x05c array->stripeMask: 127 0x060 array->stripeSize: 256 0x064 array->stripeByteSize: 131072 0x068 array->raidJobMark 0 0x06c array->originalLevel 128 0x070 array->originalCapacity 290447872 0x074 array->flags 0x0 =============> but, under 10.3 the same command shows:<============= /dev/sda (nvidia): 0x000 NVIDIA 0x008 size: 30 0x00c chksum: 2373758283 0x010 version: 100 0x012 unitNumber: 0 0x013 reserved: 0 0x014 capacity: 145225984 0x018 sectorSize: 512 0x01c productID: MIRROR 69.24G 0x02c productRevision: 100 0x030 unitFlags: 0 0x034 array->version: 6553668 0x038 array->signature[0]: 114032115 0x03c array->signature[1]: 982343681 0x040 array->signature[2]: 395968431 0x044 array->signature[3]: 2137081411 0x048 array->raidJobCode: 0 0x049 array->stripeWidth: 1 0x04a array->totalVolumes: 2 0x04b array->originalWidth: 1 0x04c array->raidLevel: 129 0x050 array->stripeBlockSize: 128 0x054 array->stripeBlockByteSize: 65536 0x058 array->stripeBlockPower: 7 0x05c array->stripeMask: 127 0x060 array->stripeSize: 128 0x064 array->stripeByteSize: 65536 0x068 array->raidJobMark 0 0x06c array->originalLevel 129 0x070 array->originalCapacity 145225984 0x074 array->flags 0x0 /dev/sdb (nvidia): 0x000 NVIDIA 0x008 size: 30 0x00c chksum: 2373692747 0x010 version: 100 0x012 unitNumber: 1 0x013 reserved: 0 0x014 capacity: 145225984 0x018 sectorSize: 512 0x01c productID: MIRROR 69.24G 0x02c productRevision: 100 0x030 unitFlags: 0 0x034 array->version: 6553668 0x038 array->signature[0]: 114032115 0x03c array->signature[1]: 982343681 0x040 array->signature[2]: 395968431 0x044 array->signature[3]: 2137081411 0x048 array->raidJobCode: 0 0x049 array->stripeWidth: 1 0x04a array->totalVolumes: 2 0x04b array->originalWidth: 1 0x04c array->raidLevel: 129 0x050 array->stripeBlockSize: 128 0x054 array->stripeBlockByteSize: 65536 0x058 array->stripeBlockPower: 7 0x05c array->stripeMask: 127 0x060 array->stripeSize: 128 0x064 array->stripeByteSize: 65536 0x068 array->raidJobMark 0 0x06c array->originalLevel 129 0x070 array->originalCapacity 145225984 0x074 array->flags 0x0 Needless to say i am able to activate the raid under 10.2. The command dmraid -ay results in: NOTICE: added /dev/sda to RAID set "nvidia_dbbcaejb" NOTICE: added /dev/sdb to RAID set "nvidia_dbbcaejb" INFO: Activating stripe RAID set "nvidia_dbbcaejb" NOTICE: discovering partitions on "nvidia_dbbcaejb" NOTICE: /dev/mapper/nvidia_dbbcaejb: dos discovering NOTICE: /dev/mapper/nvidia_dbbcaejb: dos metadata discovered NOTICE: created partitioned RAID set(s) for /dev/mapper/nvidia_dbbcaejb INFO: Activating partition RAID set "nvidia_dbbcaejb5" INFO: Activating partition RAID set "nvidia_dbbcaejb6" INFO: Activating partition RAID set "nvidia_dbbcaejb7" INFO: Activating partition RAID set "nvidia_dbbcaejb8" On the other hand this doesn't succeed under 10.3: NOTICE: added /dev/sda to RAID set "nvidia_difeadeg" NOTICE: added /dev/sdb to RAID set "nvidia_difeadeg" INFO: Activating mirror RAID set "nvidia_difeadeg" NOTICE: discovering partitions on "nvidia_difeadeg" NOTICE: /dev/mapper/nvidia_difeadeg: dos discovering NOTICE: /dev/mapper/nvidia_difeadeg: dos metadata discovered NOTICE: created partitioned RAID set(s) for /dev/mapper/nvidia_difeadeg --------->BUT THEN I get an error message on the console saying something like "past end of partition" I hope this can clarify a bit my problem - and if I can be of any help, please let me know, i'll do what i can - but i would like to get a solution for this. Tnx in advance! -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=356226
Matthias Koenig
https://bugzilla.novell.com/show_bug.cgi?id=356226
User mkoenig@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c1
Matthias Koenig
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c2
--- Comment #2 from Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c3
Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User mkoenig@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c4
Matthias Koenig
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c5
--- Comment #5 from Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c6
--- Comment #6 from Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
Greg Kroah-Hartman
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c7
--- Comment #7 from Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User nfbrown@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c8
Neil Brown
https://bugzilla.novell.com/show_bug.cgi?id=356226
User oen@dsl.pipex.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c9
--- Comment #9 from Occo Eric Nolf
https://bugzilla.novell.com/show_bug.cgi?id=356226
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c10
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c11
Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c12
--- Comment #12 from Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c13
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=356226
User nfbrown@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c14
--- Comment #14 from Neil Brown
https://bugzilla.novell.com/show_bug.cgi?id=356226
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c15
--- Comment #15 from Tejun Heo
Re: comment #10.
It would not be easy to get dmraid to try multiple device sizes, but probably it is not impossible. However dmraid is a user-space program that reads the metadata then sets up dm-tables in the kernel. So the two different possible sizes need to be available to userspace. Maybe an 'hpa-size' sysfs attribute???
Yeah, probably.
It would possibly be a lot easier if there was an ioctl to set the apparent device size to match the hpa size, or maybe even two different devices, one for each size (OK, that really is asking too much).
Hmmm... Making two different devices was suggested once by Jeff Garzik too but I worry how the userland would cope with it (ie. udev, hal and friends). It can easily confuse userland tools. I'll think about the ioctl but I think it would be better to keep the HPA region to be accessible whether dm is attached or not and if dm is going to revert the setting after scan, it just makes more sense to use the BIOS size directly instead of locking and unlocking HPA for metadata scanning. I suppose that it can be tedious to try to retrieve matching sysfs node. Does dmraid require sysfs for anything else?
Re: comments #11/#13. As libata is loaded as a module, a boot time kernel parameter won't work. You would need to get a line like:
options libata ignore_hpa=0
into the modprobe.conf in the initrd. Maybe try adding that line to /etc/modprobe.conf or /etc/modprobe.d/libata and rerunning mkinitrd ??
Our initrd is smart enough to pass residual xxxx.yyyy params to modules contained in the initrd although it does produce spurious invalid parameter warning during boot. That mechanism doesn't work for modules loaded by udev after root fs is mounted. I don't think it's the case here but nevertheless please try the suggested config files. Thanks. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c16
Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c17
--- Comment #17 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c18
--- Comment #18 from Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c19
--- Comment #19 from Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c20
--- Comment #20 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c21
--- Comment #21 from Jos Baudrez
grep -i hpa boot.msg <6>Command line: BOOT_IMAGE=linux initrd=initrd,12801024.spl splash=silent rescue=1 vga=0x31a libata.ignore_hpa=0 <5>Kernel command line: BOOT_IMAGE=linux initrd=initrd,12801024.spl splash=silent rescue=1 vga=0x31a libata.ignore_hpa=0 <3>Unknown boot option `libata.ignore_hpa=0': ignoring <6>ata3.00: HPA unlocked: 145223999 -> 145226112, native 145226112 <6>ata4.00: HPA unlocked: 145223999 -> 145226112, native 145226112 <6>ata5.00: HPA unlocked: 976771055 -> 976773168, native 976773168
So the striped-array is never recognized, and I can't install ..... I understand the /etc/sysconfig/kernel is not necessary now ? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=356226
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c22
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c23
Jos Baudrez
https://bugzilla.novell.com/show_bug.cgi?id=356226
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c24
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=356226
User nfbrown@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c25
Neil Brown
https://bugzilla.novell.com/show_bug.cgi?id=356226
User jos.baudrez@telenet.be added comment
https://bugzilla.novell.com/show_bug.cgi?id=356226#c26
--- Comment #26 from Jos Baudrez
participants (1)
-
bugzilla_noreply@novell.com