[Bug 573632] New: raid reads beyond disk
http://bugzilla.novell.com/show_bug.cgi?id=573632 http://bugzilla.novell.com/show_bug.cgi?id=573632#c0 Summary: raid reads beyond disk Classification: openSUSE Product: openSUSE 11.2 Version: Final Platform: i686 OS/Version: openSUSE 11.2 Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: marvin24@gmx.de QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (compatible; Konqueror/4.3; Linux) KHTML/4.3.4 (like Gecko) My raid5 frequently goes offline with these messages in the log: Jan 25 14:12:31 fb07-iapwap1 kernel: [182469.528871] scsi0: Someone reset channel A Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.878958] end_request: I/O error, dev sdb, sector 72163775 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.878984] md: super_written gets error=-5, uptodate=0 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.879005] raid5: Disk failure on sdb1, disabling device. Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.879013] raid5: Operation continuing on 3 devices. Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.879757] end_request: I/O error, dev sda, sector 17912255 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.879777] md: super_written gets error=-5, uptodate=0 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.879796] raid5: Disk failure on sda1, disabling device. Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.879803] raid5: Operation continuing on 2 devices. Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.880167] end_request: I/O error, dev sdc, sector 17912255 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.880186] md: super_written gets error=-5, uptodate=0 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.880203] raid5: Disk failure on sdc1, disabling device. Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.880211] raid5: Operation continuing on 1 devices. Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887283] end_request: I/O error, dev sdd, sector 17912255 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887312] md: super_written gets error=-5, uptodate=0 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887332] raid5: Disk failure on sdd1, disabling device. Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887340] raid5: Operation continuing on 0 devices. Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887556] RAID5 conf printout: Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887572] --- rd:4 wd:0 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887587] disk 0, o:0, dev:sda1 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887601] disk 1, o:0, dev:sdb1 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887614] disk 2, o:0, dev:sdc1 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.887626] disk 3, o:0, dev:sdd1 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893056] RAID5 conf printout: Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893072] --- rd:4 wd:0 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893087] disk 1, o:0, dev:sdb1 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893100] disk 2, o:0, dev:sdc1 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893113] disk 3, o:0, dev:sdd1 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893158] RAID5 conf printout: Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893169] --- rd:4 wd:0 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893182] disk 1, o:0, dev:sdb1 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893195] disk 2, o:0, dev:sdc1 Jan 25 14:16:31 fb07-iapwap1 kernel: [182708.893207] disk 3, o:0, dev:sdd1 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.901045] RAID5 conf printout: Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.901060] --- rd:4 wd:0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.901074] disk 1, o:0, dev:sdb1 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.901087] disk 2, o:0, dev:sdc1 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.901109] RAID5 conf printout: Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.901120] --- rd:4 wd:0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.901133] disk 1, o:0, dev:sdb1 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.901146] disk 2, o:0, dev:sdc1 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.909044] RAID5 conf printout: Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.909059] --- rd:4 wd:0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.909074] disk 1, o:0, dev:sdb1 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.909095] RAID5 conf printout: Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.909106] --- rd:4 wd:0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.909119] disk 1, o:0, dev:sdb1 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.917044] RAID5 conf printout: Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.917059] --- rd:4 wd:0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.917812] Buffer I/O error on device md0, logical block 2249 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.917829] lost page write due to I/O error on md0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.917879] Buffer I/O error on device md0, logical block 2250 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.917894] lost page write due to I/O error on md0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.917938] Buffer I/O error on device md0, logical block 2251 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.917953] lost page write due to I/O error on md0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.917996] Buffer I/O error on device md0, logical block 2252 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.918011] lost page write due to I/O error on md0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.918054] Buffer I/O error on device md0, logical block 2253 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.918069] lost page write due to I/O error on md0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.918112] Buffer I/O error on device md0, logical block 2254 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.918127] lost page write due to I/O error on md0 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.918172] Buffer I/O error on device md0, logical block 2255 Jan 25 14:16:32 fb07-iapwap1 kernel: [182708.918186] lost page write due to I/O error on md0 The raid5 consists of 4 disks (3x9GB,1x36GB{sdb}). Could this be a hardware problem? I wonder why the first i/o errors occur with sectors at the end of the disks. fdisk reports: Platte /dev/sda: 9173 MByte, 9173114880 Byte 255 Köpfe, 63 Sektoren/Spuren, 1115 Zylinder Einheiten = Zylinder von 16065 × 512 = 8225280 Bytes Disk identifier: 0x00000000 Gerät boot. Anfang Ende Blöcke Id System /dev/sda1 1 1115 8956206 fd Linux raid autodetect marc@fb07-iapwap1:~> sudo /sbin/fdisk -l /dev/sd? Platte /dev/sda: 9173 MByte, 9173114880 Byte 255 Köpfe, 63 Sektoren/Spuren, 1115 Zylinder Einheiten = Zylinder von 16065 × 512 = 8225280 Bytes Disk identifier: 0x00000000 Gerät boot. Anfang Ende Blöcke Id System /dev/sda1 1 1115 8956206 fd Linux raid autodetect Platte /dev/sdb: 37.0 GByte, 36951490560 Byte 255 Köpfe, 63 Sektoren/Spuren, 4492 Zylinder Einheiten = Zylinder von 16065 × 512 = 8225280 Bytes Disk identifier: 0x474663db Gerät boot. Anfang Ende Blöcke Id System /dev/sdb1 * 1 4492 36081958+ fd Linux raid autodetect Platte /dev/sdc: 9173 MByte, 9173114880 Byte 255 Köpfe, 63 Sektoren/Spuren, 1115 Zylinder Einheiten = Zylinder von 16065 × 512 = 8225280 Bytes Disk identifier: 0x00000000 Gerät boot. Anfang Ende Blöcke Id System /dev/sdc1 1 1115 8956206 fd Linux raid autodetect Platte /dev/sdd: 9173 MByte, 9173114880 Byte 255 Köpfe, 63 Sektoren/Spuren, 1115 Zylinder Einheiten = Zylinder von 16065 × 512 = 8225280 Bytes Disk identifier: 0x00000000 Gerät boot. Anfang Ende Blöcke Id System /dev/sdd1 1 1115 8956206 fd Linux raid autodetect mdadm --detail /dev/md0 shows: /dev/md0: Version : 0.90 Creation Time : Sun Jun 15 22:44:51 2003 Raid Level : raid5 Array Size : 26868096 (25.62 GiB 27.51 GB) Used Dev Size : 8956032 (8.54 GiB 9.17 GB) Raid Devices : 4 Total Devices : 4 Preferred Minor : 0 Persistence : Superblock is persistent Update Time : Mon Jan 25 19:51:51 2010 State : clean Active Devices : 4 Working Devices : 4 Failed Devices : 0 Spare Devices : 0 Layout : left-symmetric Chunk Size : 128K UUID : 6332406a:06436bd3:d2450da0:2958b394 Events : 0.6656794 Number Major Minor RaidDevice State 0 8 1 0 active sync /dev/sda1 1 8 17 1 active sync /dev/sdb1 2 8 33 2 active sync /dev/sdc1 3 8 49 3 active sync /dev/sdd1 Reproducible: Sometimes Steps to Reproduce: 1. 2. 3. -- Configure bugmail: http://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
http://bugzilla.novell.com/show_bug.cgi?id=573632
http://bugzilla.novell.com/show_bug.cgi?id=573632#c1
Jeff Mahoney
http://bugzilla.novell.com/show_bug.cgi?id=573632
http://bugzilla.novell.com/show_bug.cgi?id=573632#c2
Nikanth K
http://bugzilla.novell.com/show_bug.cgi?id=573632
http://bugzilla.novell.com/show_bug.cgi?id=573632#c3
Neil Brown
http://bugzilla.novell.com/show_bug.cgi?id=573632
http://bugzilla.novell.com/show_bug.cgi?id=573632#c4
Nikanth K
participants (1)
-
bugzilla_noreply@novell.com