Problems with cciss? (possibly interacting with afs?)
Wondering if anyone else has seen any similar problems.. We've had several different hosts Oops and panic in the last couple of weeks, most of them several times. Every oops seems to be on a different process, but they all seem to relate to the filesystems... sometimes a reiserfs one, sometimes the afs cache filesystem which isn't reiser. The similarities between the systems are: - all HP hosts with smartarray controllers and 2 disks HW mirrored. - All running sles9-sp1 (with kernel 2.6.5-7.147 from suse 9.2 src.rpm... it had autofs4 patches we needed) and openafs-1.3.85. - Not all the same exact model of host, one is a blade, but it's also HW mirrored. We have a ton of similar hosts, but they're not mirroring disk, except our syslog host, which is doing raid-1+0 and isn't running afs. Wondering about possible bug between libafs module and cciss module. Working with novell on cases, but wanted to see if anyone else has seen sporadic oopses with similar hw. We're trying different things like updating kernel and catching up on openafs, but we have to do testing before pushing out to production so it takes time. What's really odd is one of the hosts that's having the issues was up for weeks without any problems, but has crashed multiple times in the last week. No indications of disk failures in logs or in HP agents.. though one blade did have a light on the chassis indicating a disk failure whereas the agents didn't report it. -- Mike Marion-Unix SysAdmin/Staff Engineer-http://www.qualcomm.com "You think it's a conspiracy by the networks to put bad shows on TV. But the shows are bad because that's what people want. It's not like Windows users don't have any power. I think they are happy with Windows, and that's an incredibly depressing thought." -- Steve Jobs
participants (1)
-
Mike Marion