[opensuse] comp cant be access, hd failure ?
Friends, I install 11.1 for several days, in last few days my system cant be acces in the morning, I can ping to it, but I cannot ssh and samba didnt work. When I check /var/log/messages, I some errors. Then after I restart, i see almost same errors (you can see it bellow). I that because of Hardware failure ? my Hardisk ? I have 2 disk, 1 with SATA (/dev/sda), 1 with IDE (/dev/sdb). regards, linux-gkfg:/home/arie # mount /dev/sdb5 on / type ext3 (rw,acl,user_xattr) /proc on /proc type proc (rw) sysfs on /sys type sysfs (rw) debugfs on /sys/kernel/debug type debugfs (rw) udev on /dev type tmpfs (rw) devpts on /dev/pts type devpts (rw,mode=0620,gid=5) /dev/sda1 on /back type ext3 (rw,acl,user_xattr) /dev/sdb1 on /boot type ext2 (rw,acl,user_xattr) /dev/sdb6 on /home type xfs (rw) fusectl on /sys/fs/fuse/connections type fusectl (rw) securityfs on /sys/kernel/security type securityfs (rw) none on /proc/sys/fs/binfmt_misc type binfmt_misc (rw) gvfs-fuse-daemon on /home/arie/.gvfs type fuse.gvfs-fuse-daemon (rw,nosuid,nodev,user=arie) /var/log/messages/ Feb 9 21:00:01 linux-gkfg /usr/sbin/cron[4959]: (root) CMD (/root/bin/backup.sh) Feb 9 21:19:39 linux-gkfg smartd[3757]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 65 to 66 Feb 9 21:19:39 linux-gkfg smartd[3757]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 35 to 34 Feb 9 21:19:40 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Currently unreadable (pending) sectors Feb 9 21:19:40 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Offline uncorrectable sectors Feb 9 21:39:40 linux-gkfg -- MARK -- Feb 9 21:49:37 linux-gkfg syslog-ng[2018]: Log statistics; dropped='pipe(/dev/xconsole)=0', dropped='pipe(/dev/tty10)=0', processed='center(queued)=22', processed='center(received)=14', processed='destination(newsnotice)=0', processed='destination(acpid)=0', processed='destination(firewall)=0', processed='destination(null)=0', processed='destination(mail)=0', processed='destination(mailinfo)=0', processed='destination(console)=1', processed='destination(newserr)=0', processed='destination(newscrit)=0', processed='destination(messages)=14', processed='destination(mailwarn)=0', processed='destination(localmessages)=1', processed='destination(netmgm)=0', processed='destination(mailerr)=0', processed='destination(xconsole)=1', processed='destination(warn)=5', processed='source(src)=14' Feb 9 21:49:39 linux-gkfg smartd[3757]: Device: /dev/sda [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 109 to 111 Feb 9 21:49:39 linux-gkfg smartd[3757]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 66 to 65 Feb 9 21:49:39 linux-gkfg smartd[3757]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 34 to 35 Feb 9 21:49:39 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Currently unreadable (pending) sectors Feb 9 21:49:39 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Offline uncorrectable sectors Feb 9 22:09:39 linux-gkfg -- MARK -- Feb 10 08:38:14 linux-gkfg syslog-ng[2035]: syslog-ng starting up; version='2.0.9' Feb 10 08:38:14 linux-gkfg rchal: CPU frequency scaling is not supported by your processor. After I restart (press reset button cause I cannot login to it), I saw these too. Feb 10 08:38:30 linux-gkfg smartd[3607]: Opened configuration file /etc/smartd.conf Feb 10 08:38:30 linux-gkfg smartd[3607]: Drive: DEVICESCAN, implied '-a' Directive on line 26 of file /etc/smartd.conf Feb 10 08:38:30 linux-gkfg smartd[3607]: Configuration file /etc/smartd.conf was parsed, found DEVICESCAN, scanning devices Feb 10 08:38:30 linux-gkfg smartd[3607]: Device: /dev/sda, type changed from 'scsi' to 'sat' Feb 10 08:38:30 linux-gkfg smartd[3607]: Device: /dev/sda [SAT], opened Feb 10 08:38:30 linux-gkfg smartd[3607]: Device: /dev/sda [SAT], found in smartd database. Feb 10 08:38:30 linux-gkfg smartd[3607]: Device: /dev/sda [SAT], is SMART capable. Adding to "monitor" list. Feb 10 08:38:30 linux-gkfg smartd[3607]: Device: /dev/sda [SAT], state read from /var/lib/smartmontools/smartd.ST380211AS-5PS17R71.ata.state Feb 10 08:38:30 linux-gkfg smartd[3607]: Device: /dev/sdb, type changed from 'scsi' to 'sat' Feb 10 08:38:30 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], opened Feb 10 08:38:30 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], found in smartd database. Feb 10 08:38:31 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], is SMART capable. Adding to "monitor" list. Feb 10 08:38:31 linux-gkfg /usr/sbin/cron[3669]: (CRON) STARTUP (V5.0) Feb 10 08:38:31 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], state read from /var/lib/smartmontools/smartd.ST380011A-5MS0WCCX.ata.state Feb 10 08:38:31 linux-gkfg smartd[3607]: Monitoring 2 ATA and 0 SCSI devices Feb 10 08:38:31 linux-gkfg kernel: eth0: no IPv6 routers present Feb 10 08:38:31 linux-gkfg smartd[3607]: Device: /dev/sda [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 111 to 120 Feb 10 08:38:31 linux-gkfg smartd[3607]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 65 to 68 Feb 10 08:38:31 linux-gkfg smartd[3607]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 35 to 32 Feb 10 08:38:31 linux-gkfg smartd[3607]: Device: /dev/sda [SAT], SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 47 to 73 Feb 10 08:38:32 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], 1 Currently unreadable (pending) sectors Feb 10 08:38:32 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], 1 Offline uncorrectable sectors Feb 10 08:38:32 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 61 to 63 Feb 10 08:38:32 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 32 to 29 Feb 10 08:38:32 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 61 to 63 Feb 10 08:38:32 linux-gkfg smartd[3607]: Device: /dev/sda [SAT], state written to /var/lib/smartmontools/smartd.ST380211AS-5PS17R71.ata.state Feb 10 08:38:32 linux-gkfg smartd[3607]: Device: /dev/sdb [SAT], state written to /var/lib/smartmontools/smartd.ST380011A-5MS0WCCX.ata.state Feb 10 08:38:32 linux-gkfg smartd[3691]: smartd has fork()ed into background mode. New PID=3691. Feb 10 08:38:32 linux-gkfg sshd[3722]: Server listening on 0.0.0.0 port 22. Feb 10 08:38:32 linux-gkfg sshd[3722]: Server listening on :: port 22. Feb 10 08:38:33 linux-gkfg kernel: Not cloning cgroup for unused subsystem ns Feb 10 08:39:53 linux-gkfg pulseaudio[4270]: pid.c: Stale PID file, overwriting. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On Monday February 9 2009, Arie Reynaldi Z wrote:
Friends,
I install 11.1 for several days, in last few days my system cant be acces in the morning, I can ping to it, but I cannot ssh and samba didnt work. When I check /var/log/messages, I some errors. Then after I restart, i see almost same errors (you can see it bellow). I that because of Hardware failure ? my Hardisk ? I have 2 disk, 1 with SATA (/dev/sda), 1 with IDE (/dev/sdb).
regards,
Please remember to avoid wrapping lines of the sort that follow.
linux-gkfg:/home/arie # mount ...
/var/log/messages/ Feb 9 21:00:01 linux-gkfg /usr/sbin/cron[4959]: (root) CMD (/root/bin/backup.sh) Feb 9 21:19:39 linux-gkfg smartd[3757]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 65 to 66 Feb 9 21:19:39 linux-gkfg smartd[3757]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 35 to 34 Feb 9 21:19:40 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Currently unreadable (pending) sectors Feb 9 21:19:40 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Offline uncorrectable sectors
Here you're seeing that your hard drive has errors that cannot be corrected (the data redundancy is not sufficient to correct the errors). You are best advised to avoid using this drive immediately, replace it and restore or reinstall whatever it held.
...
Randall Schulz -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
I install 11.1 for several days, in last few days my system cant be acces in the morning, I can ping to it, but I cannot ssh and samba didnt work. When I check /var/log/messages, I some errors. Then after I restart, i see almost same errors (you can see it bellow). I that because of Hardware failure ? my Hardisk ? I have 2 disk, 1 with SATA (/dev/sda), 1 with IDE (/dev/sdb).
regards,
Please remember to avoid wrapping lines of the sort that follow. Sorry.. :)
linux-gkfg:/home/arie # mount
Feb 9 21:19:39 linux-gkfg smartd[3757]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 35 to 34 Feb 9 21:19:40 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Currently unreadable (pending) sectors Feb 9 21:19:40 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Offline uncorrectable sectors
Here you're seeing that your hard drive has errors that cannot be corrected (the data redundancy is not sufficient to correct the errors).
You are best advised to avoid using this drive immediately, replace it and restore or reinstall whatever it held.
Ok. What if I use fsck.ext3, would be any good ? Anyway i will replace this hard drive tonight, cause it's production server :( Now I run crontab to bacekup almost every hour.. so if anything goes wrong i still can restore to earliest hour..
Randall Schulz
Thank you.. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On Monday February 9 2009, Arie Reynaldi Z wrote:
...
Here you're seeing that your hard drive has errors that cannot be corrected (the data redundancy is not sufficient to correct the errors).
You are best advised to avoid using this drive immediately, replace it and restore or reinstall whatever it held.
Ok. What if I use fsck.ext3, would be any good? Anyway i will replace this hard drive tonight, cause it's production server :( Now I run crontab to bacekup almost every hour.. so if anything goes wrong i still can restore to earliest hour..
Fsck cannot help. This is failing / failed hardware. It's the walking dead. I would shutdown anything that might use that drive. It can only get worse if you keep trying to use it, though that's not a certainty. Reading is probably safe, but writing to it in any way is probably not a good idea. And if you copy data from it, that data must be considered suspect until verified. Don't let anything read from it overwrite a backup from before the failure manifested itself. Given that there is currently only one block showing an error, this is probably overstated, but it's better not to exacerbate the problem. Randall Schulz -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Tuesday, 2009-02-10 at 08:55 +0700, Arie Reynaldi Z wrote:
Feb 9 21:49:39 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Currently unreadable (pending) sectors Feb 9 21:49:39 linux-gkfg smartd[3757]: Device: /dev/sdb [SAT], 1 Offline uncorrectable sectors
I think you should run the smart short and long tests. - -- Cheers, Carlos E. R. -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.9 (GNU/Linux) iEYEARECAAYFAkmQ4dEACgkQtTMYHG2NR9WDkgCfTHKcLP7F52Tr6vcxjd84R6FV oEIAmQE300NptiV0hvsWZ5RKe4bf3k0u =0i0l -----END PGP SIGNATURE----- -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
participants (3)
-
Arie Reynaldi Z
-
Carlos E. R.
-
Randall R Schulz