I just noticed a 1.02 BIOS go up in the Tyan directory. You might want to wait a bit before trying it out, since it seems to have re-broken DMA to the 3ware card and eaten my / again...
Was this with the latest update kernel ? And do you have more than 3GB of RAM?
This was both after boot and during POST (corrupt card ID messages and system hang), so it's definately a hardware thing. Tyan was quite helpful and pointed me at this: https://www.3ware.com/kbadmin/attachments/TM900-0045-00%20Rev%20A_P.pdf which hints that all these weirdo problems might involve sketchy signalling that's just getting aggravated by some weird timing somewhere or something. Nice, eh? I hope 3ware replaces these 36 cards I just bought. I kindof got the feeling from Tyan that they consider this to be 3ware's problem and that the issue is closed. But, I'm not using risers, and my system is rock solid (up, thrashing disk for over two weeks with no problems) in the 0.01b BIOS and dies about 20% of the time during POST with 1.02, and blows panic or dynamic linking chunks very quickly after boot in that other 80%. And presumably a PCI parity error would be detected; I see no messages mentioning any such thing -- I just see silent data corruption across the bus just like the iommu problem (except that it happens at POST now too). And the 3ware iommu bug also hit Qlogic cards, yes? I can't believe those have signalling problems considering they plug into every weirdo PCI-carrying not-a-PC jumbo datacenter rack monster on the planet without a problem. But recall that the iommu bug (as far as I know) was never explained -- it was merely noted that the flush optimization triggered it, so the optimization was backed out. So it might have been the same tim'rous signalling beastie aggravated in both cases...er, I guess. The upshot at the moment seems to be that if Tyan hears more complaints from more people for more cards than just the 3ware, perhaps we'll here more. Until then, it feels like they're going to treat the card (rightly, perhaps) as not worth engineering thought. Bummer for us folks who bought a zillion and now have to play BIOS highwire trying to get them to work. -mcq