On Tue, 2006-09-26 at 07:54 -0400, Carl Hartung wrote:
On Monday 25 September 2006 23:42, Peter Sjoberg wrote:
I have a dual opteron system running OpenSuse 10.1 and it has aa very bad habit of hanging. With hang I mean that it's nothing on the screen, no oops, sysrq s,e,i,u,m,t doesn't do anything but b did reboot, sometimes at least. I did add "nmi_watchdog=2" and sending syslog error..crit to a different node in the hope that I would catch something but still I have nothing.
This is a classic symptom of marginal memory, Peter. I'd try a different set of RAM modules, preferably a factory matched set guaranteed for that class of host produced by an established and reputable brand manufacturer. Normally I would agree but this happens to be my primary server so I put some extra $$ on it and have matched pairs of Kingston DDR PC3200/ECC/REG (KVR400D8R3AK2/1G) modules. Since ECC is enabled I would expect it to complain somewhere if it discovered ECC errors. Also, as a test I tried to provoke the system to hang by compile the kernel in a loop, worked fine for 35h
IMHO, the supplier should also offer a 'free' (built into the price) or low cost lifetime warranty and 24 hour advance replacement service. Such companies, in my experience, are generally more stable and reliable to work with in the long run.
hth & regards,
Carl --------------------------------------------------------------------- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org