Strange server crash
Hi to all, We have a Fujitsu Siemens server, Dual Xeon 2.8, 160GB sata HDD configured in software raid. On the server there is Apache, Php, vsftp and postgresql db running. Suse 9.1 64 Prof is installed. About once a month the server just freezes. Sometimes I can ping and it respondes, sometimes not. I cannot connect to the server anymore froma any point of view. None of the services are running anymore, http, ftp, samba etc... I cannot know if the console works because I'm not near the server. I looked over and over the log files and couldn't see anything wrong. Or maybe my log settings are not right? Any clues? Thanks in advance. Andy
On Wednesday 12 October 2005 17.57, Andy wrote:
Hi to all,
We have a Fujitsu Siemens server, Dual Xeon 2.8, 160GB sata HDD configured in software raid. On the server there is Apache, Php, vsftp and postgresql db running. Suse 9.1 64 Prof is installed.
About once a month the server just freezes. Sometimes I can ping and it respondes, sometimes not. I cannot connect to the server anymore froma any point of view. None of the services are running anymore, http, ftp, samba etc... I cannot know if the console works because I'm not near the server. I looked over and over the log files and couldn't see anything wrong. Or maybe my log settings are not right?
Any clues?
Thanks in advance.
Andy
Does it thaw by it self or do you have to do a hard reboot? Have you checked if there are any cron jobs on monthly basis? What is the last things that happen before freeze? Anything at all in the logs that could indicate any type of action / program suddenly dropping? -- /Rikard ----------------------------------------------------------------------------- email : rikard.j@rikjoh.com web : http://www.rikjoh.com mob : +46 (0)736 19 76 25 ------------------------ Public PGP fingerprint ---------------------------- < 15 28 DF 78 67 98 B2 16 1F D3 FD C5 59 D4 B6 78 46 1C EE 56 >
Andy wrote:
Hi to all,
We have a Fujitsu Siemens server, Dual Xeon 2.8, 160GB sata HDD configured in software raid. On the server there is Apache, Php, vsftp and postgresql db running. Suse 9.1 64 Prof is installed.
About once a month the server just freezes. Sometimes I can ping and it respondes, sometimes not. I cannot connect to the server anymore froma any point of view. None of the services are running anymore, http, ftp, samba etc... I cannot know if the console works because I'm not near the server. I looked over and over the log files and couldn't see anything wrong. Or maybe my log settings are not right?
Any clues?
Thanks in advance.
Andy
Hi, ACPI by accident enabled? Regards Reto
Kernel panics? Is the box/cpu overheating? Tim Rainier Information Services, Kalsec, INC trainier@kalsec.com "Andy" <frum@ar-sd.net> 10/12/2005 11:57 AM To <suse-security@suse.com> cc Subject [suse-security] Strange server crash Hi to all, We have a Fujitsu Siemens server, Dual Xeon 2.8, 160GB sata HDD configured in software raid. On the server there is Apache, Php, vsftp and postgresql db running. Suse 9.1 64 Prof is installed. About once a month the server just freezes. Sometimes I can ping and it respondes, sometimes not. I cannot connect to the server anymore froma any point of view. None of the services are running anymore, http, ftp, samba etc... I cannot know if the console works because I'm not near the server. I looked over and over the log files and couldn't see anything wrong. Or maybe my log settings are not right? Any clues? Thanks in advance. Andy
I will respond to all: To restart the server we had to do a hard reboot. We have 1G Ram, 1G Swap space. There is plenty(more GBs) of space on the hard disks. On all partitions(except /boot). The hang-up occurs randomly... sometimes at 3 weeks, sometimes at 2 months, sometimes at 1 month. free -s -m5: total used free shared buffers cached Mem: 1027964 958260 69704 0 45416 729952 -/+ buffers/cache: 182892 845072 Swap: 1052248 90976 961272 ACPI from what I see is enabled. I will disable it. /etc/lilo.conf image = /boot/vmlinuz ###Don't change this comment - YaST2 identifier: Original name: linux### label = Linux initrd = /boot/initrd root = /dev/md0 append = "resume=/dev/sda2 splash=silent console=tty0 desktop" vga = 0x317 To disable acpi I change this line: append = "resume=/dev/sda2 splash=silent acpi=off console=tty0 desktop" right? Overheating is excluded. I attached the boot.msg. I'm not very experienced in interpreting those messages. regards, Andy. ----- Original Message ----- From: <trainier@kalsec.com> To: <suse-security@suse.com> Sent: Thursday, October 13, 2005 12:01 AM Subject: Re: [suse-security] Strange server crash
Kernel panics? Is the box/cpu overheating?
Tim Rainier Information Services, Kalsec, INC trainier@kalsec.com
"Andy" <frum@ar-sd.net> 10/12/2005 11:57 AM
To <suse-security@suse.com> cc
Subject [suse-security] Strange server crash
Hi to all,
We have a Fujitsu Siemens server, Dual Xeon 2.8, 160GB sata HDD configured in software raid. On the server there is Apache, Php, vsftp and postgresql db running. Suse 9.1 64 Prof is installed.
About once a month the server just freezes. Sometimes I can ping and it respondes, sometimes not. I cannot connect to the server anymore froma any point of view. None of the services are running anymore, http, ftp, samba etc... I cannot know if the console works because I'm not near the server. I looked over and over the log files and couldn't see anything wrong. Or maybe my log settings are not right?
Any clues?
Thanks in advance.
Andy
-- Check the headers for your unsubscription address For additional commands, e-mail: suse-security-help@suse.com Security-related bug reports go to security@suse.de, not here
Ohh, just usual bad kernel release from suse folks ;-|. It happens just all the time - now we have 2 servers, they were running redhat 9.0, usual uptimes were about 80-90 days, no kernel errors, no crashes - stable as rock. Now we are running suse-9.3, all YOU updates, and uptimes are 1hour - 1week, then kernel panic or crash ... What kernel to use with 9.3? Virgin from kernel.org? Oh no, no , please. Or buggy fucked up by suse hackers?
I will respond to all:
To restart the server we had to do a hard reboot.
We have 1G Ram, 1G Swap space. There is plenty(more GBs) of space on the hard disks. On all partitions(except /boot).
The hang-up occurs randomly... sometimes at 3 weeks, sometimes at 2 months, sometimes at 1 month.
free -s -m5: total used free shared buffers cached Mem: 1027964 958260 69704 0 45416 729952 -/+ buffers/cache: 182892 845072 Swap: 1052248 90976 961272 ACPI from what I see is enabled. I will disable it. /etc/lilo.conf image = /boot/vmlinuz ###Don't change this comment - YaST2 identifier: Original name:
What do you mean 'overheating is excluded' ? The fact is, the "freezing" you're referring to, generally has three very common causes. 1.) CPU is Overheating. 2.) The Power Supply unit is failing. 3.) The voltage regulator on the motherboard is failing. 4.) The system has run out of memory AND swap. Instead of saying "that's not possible", at least ensure that neither of these are causing issues. The memory and swap thing is a little hard to determine because the system is crashed and you can't see how much memory and swap is in use. So, write a shell script and cron it to run every 5 or 10 minutes. The script can output memory/swap usage to a timestamp in a log file that you can go look at, once you bring the system back up from a crash. The next peice is kernel panics. Type: cat /proc/sys/kernel/panic What's the output? Tim Rainier Information Services, Kalsec, INC trainier@kalsec.com "Andy" <frum@ar-sd.net> wrote on 10/13/2005 04:26:32 AM: linux###
label = Linux initrd = /boot/initrd root = /dev/md0 append = "resume=/dev/sda2 splash=silent console=tty0 desktop" vga = 0x317
To disable acpi I change this line: append = "resume=/dev/sda2 splash=silent acpi=off console=tty0
right?
Overheating is excluded.
I attached the boot.msg. I'm not very experienced in interpreting those messages.
regards, Andy. ----- Original Message ----- From: <trainier@kalsec.com> To: <suse-security@suse.com> Sent: Thursday, October 13, 2005 12:01 AM Subject: Re: [suse-security] Strange server crash
Kernel panics? Is the box/cpu overheating?
Tim Rainier Information Services, Kalsec, INC trainier@kalsec.com
"Andy" <frum@ar-sd.net> 10/12/2005 11:57 AM
To <suse-security@suse.com> cc
Subject [suse-security] Strange server crash
Hi to all,
We have a Fujitsu Siemens server, Dual Xeon 2.8, 160GB sata HDD configured in software raid. On the server there is Apache, Php, vsftp and
desktop" postgresql
db running. Suse 9.1 64 Prof is installed.
About once a month the server just freezes. Sometimes I can ping and it respondes, sometimes not. I cannot connect to the server anymore froma any point of view. None of the services are running anymore, http, ftp, samba etc... I cannot know if the console works because I'm not near the server. I looked over and over the log files and couldn't see anything wrong. Or maybe my log settings are not right?
Any clues?
Thanks in advance.
Andy
-- Check the headers for your unsubscription address For additional commands, e-mail: suse-security-help@suse.com Security-related bug reports go to security@suse.de, not here
[attachment "boot.msg" deleted by Tim Rainier/KAL/Kalsec]
Do you have enough main memory? try 'free -m -s5' to monitor your memory usage. How large is your swap partition? df -h check /var/log/boot.msg, messages, warn, for anything amiss. Are you running out of disk space on your log partition? Keith On Wed, 12 Oct 2005, Andy wrote:
To: suse-security@suse.com From: Andy <frum@ar-sd.net> Subject: [suse-security] Strange server crash
Hi to all,
We have a Fujitsu Siemens server, Dual Xeon 2.8, 160GB sata HDD configured in software raid. On the server there is Apache, Php, vsftp and postgresql db running. Suse 9.1 64 Prof is installed.
About once a month the server just freezes. Sometimes I can ping and it respondes, sometimes not. I cannot connect to the server anymore froma any point of view. None of the services are running anymore, http, ftp, samba etc... I cannot know if the console works because I'm not near the server. I looked over and over the log files and couldn't see anything wrong. Or maybe my log settings are not right?
Any clues?
Thanks in advance.
Andy
I am new SUSE SLE 8 user. I installed SAMBA last week. I restart the system and the X window cannot be restarted. I logged in as root and find all the files are marked as read only. I checked the log files in /var/log and find some log files from unknown machines. Is this a malicious attack? Thanks for any clues. -peter- ..... starting services at daemon failed .... startproc: exit status of parent of /usr/sbin/cron:1 failed .... Failed services in runlevel 3: random atd cron On 10/12/05, suse@karsites.net <suse@karsites.net> wrote:
Do you have enough main memory?
try 'free -m -s5' to monitor your memory usage.
How large is your swap partition?
df -h
check /var/log/boot.msg, messages, warn, for anything amiss.
Are you running out of disk space on your log partition?
Keith
On Wed, 12 Oct 2005, Andy wrote:
To: suse-security@suse.com From: Andy <frum@ar-sd.net> Subject: [suse-security] Strange server crash
Hi to all,
We have a Fujitsu Siemens server, Dual Xeon 2.8, 160GB sata HDD configured in software raid. On the server there is Apache, Php, vsftp and postgresql db running. Suse 9.1 64 Prof is installed.
About once a month the server just freezes. Sometimes I can ping and it respondes, sometimes not. I cannot connect to the server anymore froma any point of view. None of the services are running anymore, http, ftp, samba etc... I cannot know if the console works because I'm not near the server. I looked over and over the log files and couldn't see anything wrong. Or maybe my log settings are not right?
Any clues?
Thanks in advance.
Andy
-- Check the headers for your unsubscription address For additional commands, e-mail: suse-security-help@suse.com Security-related bug reports go to security@suse.de, not here
check your backup processes. I had kernel panics happening to me when using tar to archive to a single file over 20G. Seems a combination of tar, gzip and reiserfs did not like the large archives and would cuase a memory condition that sent the machines into kernel panic. During kernel panic, nothing gets written to disk at all so your logs just stop dead. I found that shortly before this dead ste, I was unable to get onto the machine but it would respond, and then it would just die completely with no icmp responses even. what scripts do you run? _____ From: Andy [mailto:frum@ar-sd.net] Sent: 12 October 2005 05:57 PM To: suse-security@suse.com Subject: [suse-security] Strange server crash Hi to all, We have a Fujitsu Siemens server, Dual Xeon 2.8, 160GB sata HDD configured in software raid. On the server there is Apache, Php, vsftp and postgresql db running. Suse 9.1 64 Prof is installed. About once a month the server just freezes. Sometimes I can ping and it respondes, sometimes not. I cannot connect to the server anymore froma any point of view. None of the services are running anymore, http, ftp, samba etc... I cannot know if the console works because I'm not near the server. I looked over and over the log files and couldn't see anything wrong. Or maybe my log settings are not right? Any clues? Thanks in advance. Andy
participants (8)
-
Andy
-
b@rry
-
Reto Inversini
-
Rikard Johnels
-
suse-list
-
suse@karsites.net
-
trainier@kalsec.com
-
Yixin Luo