On Thu, 2005-10-06 at 09:15 +0000, Albert wrote:
Hi all,
I have 3 HP Proliant DL360s: • 2 x Single Xeon 3.4Ghz; 1GB RAM; 72GB SCSI RAID; SuSE 9.1 • 1 x Dual Xeon 3.4Ghz; 2GB RAM; 72GB SCSI RAID; SuSE 9.3 (64 bit)
The dual processor machine runs without any problems. Power to this machine comes from a 1KVA UPS with diesel generator backup. It is installed in a server room with about thirty other servers.
The one single processor machine runs without any problems. The other single processor machine stops after running for anything from 4 hours to 2 months. Power to these machines comes from a 5KVA UPS. It is installed in a rack in an empty office. The office does not have an air conditioner.
No AC? How warn/hot does the room get. Perhaps you are having a thermal problem, the CPU is overheating.
All three of these machines are running the same software, and apart from the OS that is different and the one having an additional processor and more RAM they are identical.
When the machine stops running: - it still has power - the power LED is still on - the CD tray opens - the link LED on both the NIC and the switch are on - there is not network traffic - the machine does not respond on the network - the screen is normally in power save mode and therefore blank - the machine does not respond to keyboard key presses - there is no lights flashing on the keyboard
Have anyone heard of this before? Or does anybody have an idea of what it could be?
I have the machine to my disposal for a couple of hours. During this time I need to remove all the data from it that we can send the machine to HP.
Albert
-- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.0.344 / Virus Database: 267.11.10/120 - Release Date: 2005/10/05
-- Ken Schneider UNIX since 1989, linux since 1994, SuSE since 1998