http://bugzilla.suse.com/show_bug.cgi?id=917098
Bug ID: 917098
Summary: disk failures with backported mpt3sas driver
Classification: openSUSE
Product: openSUSE Distribution
Version: 13.2
Hardware: x86-64
OS: All
Status: NEW
Severity: Normal
Priority: P5 - None
Component: Kernel
Assignee: kernel-maintainers@forge.provo.novell.com
Reporter: s.priebe@profihost.ag
QA Contact: qa-bugs@suse.de
Found By: ---
Blocker: ---
Hello,
while playing with the SLE12 git kernel repo. I noticed a bug with the mpt3sas
driver.
A bisect shows that the following commit breaks it:
commit 0be72ecd34fd0b1942e2dd64fa54082140f9214b
Author: Hannes Reinecke
Date: Fri Mar 28 11:50:58 2014 +0100
mpt3sas: Added Reply Descriptor Post Queue (RDPQ) Array support
(FATE#315932,bnc#854817).
suse-commit: e0dfac7e406ad8c7b208b235ebeccc9333df4887
Current master git works fine if i revert
0be72ecd34fd0b1942e2dd64fa54082140f9214b.
Sample Log:
Here is an example Log:
mpt3sas0: MSI-X vectors supported: 8, no of cores: 12, max_msix_vectors: -1
mpt3sas 0000:01:00.0: irq 94 for MSI/MSI-X
mpt3sas 0000:01:00.0: irq 95 for MSI/MSI-X
mpt3sas 0000:01:00.0: irq 96 for MSI/MSI-X
mpt3sas 0000:01:00.0: irq 97 for MSI/MSI-X
mpt3sas 0000:01:00.0: irq 98 for MSI/MSI-X
mpt3sas 0000:01:00.0: irq 99 for MSI/MSI-X
mpt3sas 0000:01:00.0: irq 100 for MSI/MSI-X
mpt3sas 0000:01:00.0: irq 101 for MSI/MSI-X
mpt3sas0-msix0: PCI-MSI-X enabled: IRQ 94
mpt3sas0-msix1: PCI-MSI-X enabled: IRQ 95
mpt3sas0-msix2: PCI-MSI-X enabled: IRQ 96
mpt3sas0-msix3: PCI-MSI-X enabled: IRQ 97
mpt3sas0-msix4: PCI-MSI-X enabled: IRQ 98
mpt3sas0-msix5: PCI-MSI-X enabled: IRQ 99
mpt3sas0-msix6: PCI-MSI-X enabled: IRQ 100
mpt3sas0-msix7: PCI-MSI-X enabled: IRQ 101
mpt3sas0: iomem(0x00000000fb200000), mapped(0xffffc90017000000),
size(65536)
mpt3sas0: ioport(0x000000000000e000), size(256)
mpt3sas0: LSISAS3008: FWVersion(06.00.00.00), ChipRevision(0x02),
BiosVersion(07.00.00.00)
mpt3sas0: Protocol=(
Initiator,Target
), Capabilities=(
TLR,EEDP
,Snapshot Buffer,Diag Trace Buffer
,Task Set Full,NCQ
)
mpt3sas0: sending port enable !!
i40e 0000:03:00.1: PCI-Express: Speed 8.0GT/s Width x8
i40e 0000:03:00.1: Features: PF-id[1] VFs: 64 VSIs: 66 QP: 12 RSS
FD_ATR FD_SB NTUPLE PTP
mpt3sas0: host_add: handle(0x0001), sas_addr(0x5003048016af7200), phys(8)
scsi 0:0:0:0: Direct-Access ATA SAMSUNG MZ7WD960 C03Q PQ: 0
ANSI: 6
scsi 0:0:0:0: SATA: handle(0x0009), sas_addr(0x4433221100000000),
phy(0), device_name(0x0000000000000000)
scsi 0:0:0:0: SATA: enclosure_logical_id(0x5003048016af7200), slot(3)
scsi 0:0:0:0: atapi(n), ncq(y), asyn_notify(n), smart(y), fua(y),
sw_preserve(y)
scsi 0:0:0:0: qdepth(32), tagged(1), simple(0), ordered(0),
scsi_level(7), cmd_que(1)
sd 0:0:0:0: Power-on or device reset occurred
sd 0:0:0:0: Attached scsi generic sg2 type 0
sd 0:0:0:0: [sdc] 1782579200 512-byte logical blocks: (912 GB/850 GiB)
sd 0:0:0:0: [sdc] Write Protect is off
sd 0:0:0:0: [sdc] Mode Sense: 9b 00 10 08
sd 0:0:0:0: [sdc] Write cache: enabled, read cache: enabled, supports
DPO and FUA
sdc: unknown partition table
sd 0:0:0:0: [sdc] Attached SCSI disk
scsi 0:0:1:0: Direct-Access ATA SAMSUNG MZ7WD960 C03Q PQ: 0
ANSI: 6
scsi 0:0:1:0: SATA: handle(0x000a), sas_addr(0x4433221101000000),
phy(1), device_name(0x0000000000000000)
scsi 0:0:1:0: SATA: enclosure_logical_id(0x5003048016af7200), slot(2)
scsi 0:0:1:0: atapi(n), ncq(y), asyn_notify(n), smart(y), fua(y),
sw_preserve(y)
scsi 0:0:1:0: qdepth(32), tagged(1), simple(0), ordered(0),
scsi_level(7), cmd_que(1)
sd 0:0:1:0: Power-on or device reset occurred
sd 0:0:1:0: Attached scsi generic sg3 type 0
sd 0:0:1:0: [sdd] 1782579200 512-byte logical blocks: (912 GB/850 GiB)
sd 0:0:1:0: [sdd] Write Protect is off
sd 0:0:1:0: [sdd] Mode Sense: 9b 00 10 08
sd 0:0:1:0: [sdd] Write cache: enabled, read cache: enabled, supports
DPO and FUA
sdd: unknown partition table
sd 0:0:1:0: [sdd] Attached SCSI disk
scsi 0:0:2:0: Direct-Access ATA SAMSUNG MZ7WD960 C03Q PQ: 0
ANSI: 6
scsi 0:0:2:0: SATA: handle(0x000b), sas_addr(0x4433221102000000),
phy(2), device_name(0x0000000000000000)
scsi 0:0:2:0: SATA: enclosure_logical_id(0x5003048016af7200), slot(0)
scsi 0:0:2:0: atapi(n), ncq(y), asyn_notify(n), smart(y), fua(y),
sw_preserve(y)
scsi 0:0:2:0: qdepth(32), tagged(1), simple(0), ordered(0),
scsi_level(7), cmd_que(1)
sd 0:0:2:0: Power-on or device reset occurred
sd 0:0:2:0: Attached scsi generic sg4 type 0
sd 0:0:2:0: [sde] 1782579200 512-byte logical blocks: (912 GB/850 GiB)
sd 0:0:2:0: [sde] Write Protect is off
sd 0:0:2:0: [sde] Mode Sense: 9b 00 10 08
sd 0:0:2:0: [sde] Write cache: enabled, read cache: enabled, supports
DPO and FUA
sde: unknown partition table
sd 0:0:2:0: [sde] Attached SCSI disk
scsi 0:0:3:0: Direct-Access ATA SAMSUNG MZ7WD960 C03Q PQ: 0
ANSI: 6
scsi 0:0:3:0: SATA: handle(0x000c), sas_addr(0x4433221103000000),
phy(3), device_name(0x0000000000000000)
scsi 0:0:3:0: SATA: enclosure_logical_id(0x5003048016af7200), slot(1)
scsi 0:0:3:0: atapi(n), ncq(y), asyn_notify(n), smart(y), fua(y),
sw_preserve(y)
scsi 0:0:3:0: qdepth(32), tagged(1), simple(0), ordered(0),
scsi_level(7), cmd_que(1)
sd 0:0:3:0: Power-on or device reset occurred
sd 0:0:3:0: Attached scsi generic sg5 type 0
sd 0:0:3:0: [sdf] 1782579200 512-byte logical blocks: (912 GB/850 GiB)
sd 0:0:3:0: [sdf] Write Protect is off
sd 0:0:3:0: [sdf] Mode Sense: 9b 00 10 08
sd 0:0:3:0: [sdf] Write cache: enabled, read cache: enabled, supports
DPO and FUA
sdf: unknown partition table
sd 0:0:3:0: [sdf] Attached SCSI disk
mpt3sas0: port enable: SUCCESS
sd 0:0:0:0: attempting task abort! scmd(ffff880c36cf2a80)
sd 0:0:0:0: [sdc] CDB:
Read(10): 28 00 00 00 01 40 00 00 08 00
scsi target0:0:0: handle(0x0009), sas_address(0x4433221100000000), phy(0)
scsi target0:0:0: enclosure_logical_id(0x5003048016af7200), slot(3)
sd 0:0:0:0: task abort: FAILED scmd(ffff880c36cf2a80)
sd 0:0:3:0: attempting task abort! scmd(ffff880c358c4900)
sd 0:0:3:0: [sdf] CDB:
Inquiry: 12 00 00 00 24 00 00 00 00 00 00 00
scsi target0:0:3: handle(0x000c), sas_address(0x4433221103000000), phy(3)
scsi target0:0:3: enclosure_logical_id(0x5003048016af7200), slot(1)
sd 0:0:3:0: task abort: FAILED scmd(ffff880c358c4900)
sd 0:0:2:0: attempting task abort! scmd(ffff880c36cdc300)
sd 0:0:2:0: [sde] CDB:
Inquiry: 12 00 00 00 24 00 00 00 00 00 00 00
scsi target0:0:2: handle(0x000b), sas_address(0x4433221102000000), phy(2)
scsi target0:0:2: enclosure_logical_id(0x5003048016af7200), slot(0)
sd 0:0:2:0: task abort: FAILED scmd(ffff880c36cdc300)
sd 0:0:1:0: attempting task abort! scmd(ffff880c36cdd380)
sd 0:0:1:0: [sdd] CDB:
Read(10): 28 00 00 00 00 18 00 00 08 00
scsi target0:0:1: handle(0x000a), sas_address(0x4433221101000000), phy(1)
scsi target0:0:1: enclosure_logical_id(0x5003048016af7200), slot(2)
sd 0:0:1:0: task abort: FAILED scmd(ffff880c36cdd380)
sd 0:0:0:0: attempting device reset! scmd(ffff880c36cf2a80)
sd 0:0:0:0: [sdc] CDB:
Read(10): 28 00 00 00 01 40 00 00 08 00
scsi target0:0:0: handle(0x0009), sas_address(0x4433221100000000), phy(0)
scsi target0:0:0: enclosure_logical_id(0x5003048016af7200), slot(3)
sd 0:0:0:0: device reset: FAILED scmd(ffff880c36cf2a80)
sd 0:0:1:0: attempting device reset! scmd(ffff880c36cdd380)
sd 0:0:1:0: [sdd] CDB:
Read(10): 28 00 00 00 00 18 00 00 08 00
scsi target0:0:1: handle(0x000a), sas_address(0x4433221101000000), phy(1)
scsi target0:0:1: enclosure_logical_id(0x5003048016af7200), slot(2)
sd 0:0:1:0: device reset: FAILED scmd(ffff880c36cdd380)
sd 0:0:2:0: attempting device reset! scmd(ffff880c36cdc300)
sd 0:0:2:0: [sde] CDB:
Inquiry: 12 00 00 00 24 00 00 00 00 00 00 00
scsi target0:0:2: handle(0x000b), sas_address(0x4433221102000000), phy(2)
scsi target0:0:2: enclosure_logical_id(0x5003048016af7200), slot(0)
sd 0:0:2:0: device reset: FAILED scmd(ffff880c36cdc300)
sd 0:0:3:0: attempting device reset! scmd(ffff880c358c4900)
sd 0:0:3:0: [sdf] CDB:
Inquiry: 12 00 00 00 24 00 00 00 00 00 00 00
scsi target0:0:3: handle(0x000c), sas_address(0x4433221103000000), phy(3)
scsi target0:0:3: enclosure_logical_id(0x5003048016af7200), slot(1)
sd 0:0:3:0: device reset: FAILED scmd(ffff880c358c4900)
scsi target0:0:0: attempting target reset! scmd(ffff880c36cf2a80)
sd 0:0:0:0: [sdc] CDB:
Read(10): 28 00 00 00 01 40 00 00 08 00
scsi target0:0:0: handle(0x0009), sas_address(0x4433221100000000), phy(0)
scsi target0:0:0: enclosure_logical_id(0x5003048016af7200), slot(3)
scsi target0:0:0: target reset: FAILED scmd(ffff880c36cf2a80)
scsi target0:0:1: attempting target reset! scmd(ffff880c36cdd380)
sd 0:0:1:0: [sdd] CDB:
Read(10): 28 00 00 00 00 18 00 00 08 00
scsi target0:0:1: handle(0x000a), sas_address(0x4433221101000000), phy(1)
scsi target0:0:1: enclosure_logical_id(0x5003048016af7200), slot(2)
scsi target0:0:1: target reset: FAILED scmd(ffff880c36cdd380)
scsi target0:0:3: attempting target reset! scmd(ffff880c358c4900)
sd 0:0:3:0: [sdf] CDB:
Inquiry: 12 00 00 00 24 00 00 00 00 00 00 00
scsi target0:0:3: handle(0x000c), sas_address(0x4433221103000000), phy(3)
scsi target0:0:3: enclosure_logical_id(0x5003048016af7200), slot(1)
scsi target0:0:3: target reset: FAILED scmd(ffff880c358c4900)
scsi target0:0:2: attempting target reset! scmd(ffff880c36cdc300)
sd 0:0:2:0: [sde] CDB:
Inquiry: 12 00 00 00 24 00 00 00 00 00 00 00
scsi target0:0:2: handle(0x000b), sas_address(0x4433221102000000), phy(2)
scsi target0:0:2: enclosure_logical_id(0x5003048016af7200), slot(0)
scsi target0:0:2: target reset: FAILED scmd(ffff880c36cdc300)
mpt3sas0: attempting host reset! scmd(ffff880c36cdc300)
sd 0:0:2:0: [sde] CDB:
Inquiry: 12 00 00 00 24 00 00 00 00 00 00 00
mpt3sas0: sending diag reset !!
mpt3sas0: diag reset: SUCCESS
mpt3sas0: LSISAS3008: FWVersion(06.00.00.00), ChipRevision(0x02),
BiosVersion(07.00.00.00)
mpt3sas0: Protocol=(
Initiator,Target
), Capabilities=(
TLR,EEDP
,Snapshot Buffer,Diag Trace Buffer
,Task Set Full,NCQ
)
mpt3sas0: sending port enable !!
mpt3sas0: port enable: SUCCESS
mpt3sas0: search for end-devices: start
scsi target0:0:0: handle(0x0009), sas_addr(0x4433221100000000),
enclosure logical id(0x5003048016af7200), slot(3)
scsi target0:0:1: handle(0x000a), sas_addr(0x4433221101000000),
enclosure logical id(0x5003048016af7200), slot(2)
scsi target0:0:2: handle(0x000b), sas_addr(0x4433221102000000),
enclosure logical id(0x5003048016af7200), slot(0)
scsi target0:0:3: handle(0x000c), sas_addr(0x4433221103000000),
enclosure logical id(0x5003048016af7200), slot(1)
mpt3sas0: search for end-devices: complete
mpt3sas0: search for expanders: start
mpt3sas0: search for expanders: complete
mpt3sas0: host reset: SUCCESS scmd(ffff880c36cdc300)
sd 0:0:2:0: Power-on or device reset occurred
sd 0:0:3:0: Power-on or device reset occurred
sd 0:0:1:0: Power-on or device reset occurred
sd 0:0:0:0: Power-on or device reset occurred
mpt3sas0: removing unresponding devices: start
mpt3sas0: removing unresponding devices: end-devices
mpt3sas0: removing unresponding devices: expanders
mpt3sas0: removing unresponding devices: complete
mpt3sas0: scan devices: start
mpt3sas0: scan devices: expanders start
mpt3sas0: break from expander scan: ioc_status(0x0022),
loginfo(0x310f0400)
mpt3sas0: scan devices: expanders complete
mpt3sas0: scan devices: end devices start
mpt3sas0: break from end device scan: ioc_status(0x0022),
loginfo(0x310f0400)
mpt3sas0: scan devices: end devices complete
mpt3sas0: scan devices: complete
sd 0:0:1:0: attempting task abort! scmd(ffff880c36cdc780)
sd 0:0:1:0: CDB:
Test Unit Ready: 00 00 00 00 00 00
scsi target0:0:1: handle(0x000a), sas_address(0x4433221101000000), phy(1)
scsi target0:0:1: enclosure_logical_id(0x5003048016af7200), slot(2)
sd 0:0:1:0: task abort: FAILED scmd(ffff880c36cdc780)
sd 0:0:1:0: attempting device reset! scmd(ffff880c36cdc780)
sd 0:0:1:0: CDB:
Test Unit Ready: 00 00 00 00 00 00
scsi target0:0:1: handle(0x000a), sas_address(0x4433221101000000), phy(1)
scsi target0:0:1: enclosure_logical_id(0x5003048016af7200), slot(2)
sd 0:0:1:0: device reset: FAILED scmd(ffff880c36cdc780)
scsi target0:0:1: attempting target reset! scmd(ffff880c36cdc780)
sd 0:0:1:0: CDB:
Test Unit Ready: 00 00 00 00 00 00
scsi target0:0:1: handle(0x000a), sas_address(0x4433221101000000), phy(1)
scsi target0:0:1: enclosure_logical_id(0x5003048016af7200), slot(2)
scsi target0:0:1: target reset: FAILED scmd(ffff880c36cdc780)
mpt3sas0: attempting host reset! scmd(ffff880c36cdc780)
sd 0:0:1:0: CDB:
Test Unit Ready: 00 00 00 00 00 00
mpt3sas0: sending diag reset !!
mpt3sas0: diag reset: SUCCESS
mpt3sas0: LSISAS3008: FWVersion(06.00.00.00), ChipRevision(0x02),
BiosVersion(07.00.00.00)
mpt3sas0: Protocol=(
Initiator,Target
), Capabilities=(
TLR,EEDP
,Snapshot Buffer,Diag Trace Buffer
,Task Set Full,NCQ
)
mpt3sas0: sending port enable !!
mpt3sas0: port enable: SUCCESS
mpt3sas0: search for end-devices: start
scsi target0:0:0: handle(0x0009), sas_addr(0x4433221100000000),
enclosure logical id(0x5003048016af7200), slot(3)
scsi target0:0:1: handle(0x000a), sas_addr(0x4433221101000000),
enclosure logical id(0x5003048016af7200), slot(2)
scsi target0:0:2: handle(0x000b), sas_addr(0x4433221102000000),
enclosure logical id(0x5003048016af7200), slot(0)
scsi target0:0:3: handle(0x000c), sas_addr(0x4433221103000000),
enclosure logical id(0x5003048016af7200), slot(1)
mpt3sas0: search for end-devices: complete
mpt3sas0: search for expanders: start
mpt3sas0: search for expanders: complete
mpt3sas0: host reset: SUCCESS scmd(ffff880c36cdc780)
sd 0:0:1:0: Power-on or device reset occurred
sd 0:0:3:0: Power-on or device reset occurred
mpt3sas0: removing unresponding devices: start
mpt3sas0: removing unresponding devices: end-devices
mpt3sas0: removing unresponding devices: expanders
mpt3sas0: removing unresponding devices: complete
mpt3sas0: scan devices: start
mpt3sas0: scan devices: expanders start
mpt3sas0: break from expander scan: ioc_status(0x0022),
loginfo(0x310f0400)
mpt3sas0: scan devices: expanders complete
mpt3sas0: scan devices: end devices start
mpt3sas0: break from end device scan: ioc_status(0x0022),
loginfo(0x310f0400)
mpt3sas0: scan devices: end devices complete
mpt3sas0: scan devices: complete
sd 0:0:2:0: Power-on or device reset occurred
sd 0:0:0:0: Power-on or device reset occurred
Stefan
--
You are receiving this mail because:
You are on the CC list for the bug.