Mailinglist Archive: opensuse-bugs (11339 mails)

< Previous Next >
[Bug 461277] Upgrade to 11.1 GM broke RAID5 XFS
  • From: bugzilla_noreply@xxxxxxxxxx
  • Date: Fri, 26 Dec 2008 14:44:08 -0700 (MST)
  • Message-id: <20081226214408.4BD8F245390@xxxxxxxxxxxxxxxxxxxxxx>
https://bugzilla.novell.com/show_bug.cgi?id=461277

User kdemello@xxxxxxxxx added comment
https://bugzilla.novell.com/show_bug.cgi?id=461277#c9


Kenn de Mello <kdemello@xxxxxxxxx> changed:

What |Removed |Added
----------------------------------------------------------------------------
CC|
|kdemello@xxxxxxxxx




--- Comment #9 from Kenn de Mello <kdemello@xxxxxxxxx> 2008-12-26 14:44:06 MST
---
I suspect I'm having the same problem. I have 4 identical WD hard drives
configured in a MD raid 5. Since installing 11.1, all 4 have been booted from
the array with errors similar to the following:

Dec 19 10:39:40 loki kernel: ata1.00: exception Emask 0x10 SAct 0x0 SErr
0x4010000 action 0xe frozen
Dec 19 10:39:40 loki kernel: ata1.00: irq_stat 0x00400040, connection status
changed
Dec 19 10:39:40 loki kernel: ata1: SError: { PHYRdyChg DevExch }
Dec 19 10:39:40 loki kernel: ata1.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0
tag 0
Dec 19 10:39:40 loki kernel: res 40/00:04:dd:d5:6c/00:00:08:00:00/40
Emask 0x10 (ATA bus error)
Dec 19 10:39:40 loki kernel: ata1.00: status: { DRDY }
Dec 19 10:39:40 loki kernel: ata1: hard resetting link
Dec 19 10:39:44 loki kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 SControl
300)
Dec 19 10:39:49 loki kernel: ata1.00: qc timeout (cmd 0xec)
Dec 19 10:39:49 loki kernel: ata1.00: failed to IDENTIFY (I/O error,
err_mask=0x5)
Dec 19 10:39:49 loki kernel: ata1.00: revalidation failed (errno=-5)
Dec 19 10:39:49 loki kernel: ata1: hard resetting link
Dec 19 10:39:50 loki kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 SControl
300)
Dec 19 10:39:50 loki kernel: ata1.00: configured for UDMA/133
Dec 19 10:39:50 loki kernel: ata1: EH complete
Dec 19 10:39:50 loki kernel: sd 0:0:0:0: [sda] 312581808 512-byte hardware
sectors: (160GB/149GiB)
Dec 19 10:39:50 loki kernel: sd 0:0:0:0: [sda] Write Protect is off
Dec 19 10:39:50 loki kernel: sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
Dec 19 10:39:50 loki kernel: sd 0:0:0:0: [sda] Write cache: enabled, read
cache: enabled, doesn't support DPO or FUA
Dec 19 10:39:57 loki kernel: end_request: I/O error, dev sda, sector 312576642
Dec 19 10:39:57 loki kernel: md: super_written gets error=-5, uptodate=0
Dec 19 10:39:57 loki kernel: raid5: Disk failure on sda2, disabling device.
Dec 19 10:39:57 loki kernel: raid5: Operation continuing on 3 devices.
Dec 19 10:39:57 loki kernel: RAID5 conf printout:
Dec 19 10:39:57 loki kernel: --- rd:4 wd:3
Dec 19 10:39:57 loki kernel: disk 0, o:1, dev:sdd2
Dec 19 10:39:57 loki kernel: disk 1, o:1, dev:sdc2
Dec 19 10:39:57 loki kernel: disk 2, o:1, dev:sdb2
Dec 19 10:39:57 loki kernel: disk 3, o:0, dev:sda2
Dec 19 10:39:59 loki kernel: RAID5 conf printout:
Dec 19 10:39:59 loki kernel: --- rd:4 wd:3
Dec 19 10:39:59 loki kernel: disk 0, o:1, dev:sdd2
Dec 19 10:39:59 loki kernel: disk 1, o:1, dev:sdc2
Dec 19 10:39:59 loki kernel: disk 2, o:1, dev:sdb2

I see a very similar error in the messages file that Quentin attached. This
would happen during a period of heavy writes to the array, and I've been luck
in that the rebuild didn't cause errors on another disk.

My disks are partitioned thusly:

/dev/sda1 * 1 13 104391 fd Linux raid autodetect
/dev/sda2 14 19457 156183930 fd Linux raid autodetect

Each contains a 100MB partition which contains a raid1 ext3 /boot filesystem,
and the second partition makes up the raid5, which is then carved up using LVM.

I did a fresh install, with the exception that I created the partition layout
and LVM volumes in 11.0.

I'll attach the output of mdadm --detail, hwinfo, dmidecode, fdisk -l,
(pv|vg|lv)display, smartctl -a and /var/log/messages, along with whatever else
might help.


--
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

< Previous Next >
References