----- Original Message -----
From: "Kelly Burkhart"
I am running SuSE 9.3 on a dual Opteron 246 system (Tyan Thunder K8SR (S2881)).
This machine has been running fine for several weeks using the on board SCSI controllers & SW RAID. We installed the LSI MegaRAID 320-2X controller with the TBBU03 battery backup and reinstalled SuSE 9.3.
The installation did not go flawlessly: - Towards the end of the NFS installation, copying stopped. I switched consoles and noticed that I couldn't ping any other addresses in our network. I brought the ethernet interface down then back up and the installation resumed. - After the install, both megaraid and megaraid_mbox modules were loaded. I removed megaraid from INITRD_MODULES and ran mkinitrd.
At this point we considered the installation suspect, but pressed on with testing.
Now the corruption:
One user was compiling code, another user was loading a database. After some time, g++ started getting internal compiler errors in cc1plus. I compared the checksum of this program with another install and they were different. I reinstalled gcc, verified checksums and everything worked for a while. Then internal compiler errors and bogus checksum again.
I did not see anything alarming in the log files. So I booted into rescue mode and ran reiserfsck --check without any problems. Flashed the controller with the latest firmware and booted again.
This time I set up a 'make clean; make' loop and watched it for about two hours without any problems. Then I started creating 4GB files with dd and deleting them and within 15 minutes another internal compiler error.
Questions:
- Does anyone run a similar setup without problems? - Has anyone seen similar problems? - Can anyone provide me some direction in tracking down this problem?
I assume you are using some type of RAID configuration. Are you running the megaraid monitoring software? If so any errors in the megaserv.log file? I am not sure how doing a compile has anything to do with file corruption. This sounds more like a memory problem to me. Have you done a complete memory test with memtest? That would be the first thing I would do. Also is the card in the middle slot or the left outside slot? Brad Dameron Systems Administrator SeaTab Software www.seatab.com