[opensuse] Filesystem corrupted or what?
Hi People, openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it doesn't allow to do anything else but wait. I've already run fsck (ext3) to check it but no error is shown and yet I still have big troubles working with big files (>100MB). In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange. Could it even be an hardware (hard disk) problem? Regards, Jan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
jan kalcic wrote:
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
That seems very likely. Often when I have that kind of behavior it's because there are bad sectors on the disk and it's having to try repeatedly to read data. You may even see I/O errors in your syslog. I would run the SMART self-test, if it's supported -- see the 'smartctl' manpage for details. Most likely your disk is dying. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
David Brodbeck wrote:
jan kalcic wrote:
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
That seems very likely. Often when I have that kind of behavior it's because there are bad sectors on the disk and it's having to try repeatedly to read data. You may even see I/O errors in your syslog.
I would run the SMART self-test, if it's supported -- see the 'smartctl' manpage for details. Most likely your disk is dying.
So hardware problem, right? :( It reports some errors but it seems to be always the same one and I'm not sure they are what you guess. smartctl version 5.37 [i686-suse-linux-gnu] Copyright (C) 2002-6 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Model Family: Hitachi Travelstar 5K80 family Device Model: HTS548060M9AT00 Serial Number: MRLB55L4HZNEHC Firmware Version: MGBOA5EA User Capacity: 60,011,642,880 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 6 ATA Standard is: ATA/ATAPI-6 T13 1410D revision 3a Local Time is: Mon Apr 2 02:30:22 2007 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x85) Offline data collection activity was aborted by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 645) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 46) minutes. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 084 084 062 Pre-fail Always - 2097461 2 Throughput_Performance 0x0005 100 100 040 Pre-fail Offline - 604 3 Spin_Up_Time 0x0007 148 148 033 Pre-fail Always - 2 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 808 5 Reallocated_Sector_Ct 0x0033 095 095 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 040 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 2120 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 800 191 G-Sense_Error_Rate 0x000a 098 098 000 Old_age Always - 65540 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 63 193 Load_Cycle_Count 0x0012 093 093 000 Old_age Always - 73182 194 Temperature_Celsius 0x0002 141 141 000 Old_age Always - 39 (Lifetime Min/Max 7/52) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 376 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 201 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 ATA Error Count: 21303 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 21303 occurred at disk power-on lifetime: 2118 hours (88 days + 6 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 03 3c 62 65 e2 Error: UNC 3 sectors at LBA = 0x0265623c = 40198716 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 08 37 62 65 e0 00 07:44:17.200 READ DMA EXT e7 00 00 00 00 00 a0 00 07:44:11.600 FLUSH CACHE e7 00 00 00 00 00 a0 00 07:44:11.600 FLUSH CACHE 35 00 08 97 0a 4c e0 00 07:44:11.600 WRITE DMA EXT e7 00 00 00 00 00 a0 00 07:44:11.600 FLUSH CACHE Error 21302 occurred at disk power-on lifetime: 2118 hours (88 days + 6 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 03 3c 62 65 e2 Error: UNC 3 sectors at LBA = 0x0265623c = 40198716 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 08 37 62 65 e0 00 07:32:18.000 READ DMA EXT 35 00 10 97 1c 4a e0 00 07:32:18.000 WRITE DMA EXT 35 00 08 f8 3a d3 e0 00 07:32:18.000 WRITE DMA EXT 35 00 10 e0 a4 2c e0 00 07:32:18.000 WRITE DMA EXT 35 00 08 37 1f a2 e0 00 07:32:18.000 WRITE DMA EXT Error 21301 occurred at disk power-on lifetime: 2118 hours (88 days + 6 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 03 3c 62 65 e2 Error: UNC 3 sectors at LBA = 0x0265623c = 40198716 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 08 37 62 65 e0 00 07:32:14.100 READ DMA EXT 35 00 08 5f b7 a5 e0 00 07:32:12.800 WRITE DMA EXT 35 00 08 cf b6 a5 e0 00 07:32:12.800 WRITE DMA EXT 35 00 20 3f 1f a2 e0 00 07:32:12.800 WRITE DMA EXT e7 00 00 00 00 00 a0 00 07:32:08.400 FLUSH CACHE Error 21300 occurred at disk power-on lifetime: 2118 hours (88 days + 6 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 03 3c 62 65 e2 Error: UNC 3 sectors at LBA = 0x0265623c = 40198716 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 08 37 62 65 e0 00 07:24:15.800 READ DMA EXT 25 00 08 37 62 65 e0 00 07:24:11.900 READ DMA EXT 35 00 08 28 a9 ca e0 00 07:24:11.000 WRITE DMA EXT e7 00 00 00 00 00 a0 00 07:24:06.200 FLUSH CACHE e7 00 00 00 00 00 a0 00 07:24:06.200 FLUSH CACHE Error 21299 occurred at disk power-on lifetime: 2118 hours (88 days + 6 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 03 3c 62 65 e2 Error: UNC 3 sectors at LBA = 0x0265623c = 40198716 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 08 37 62 65 e0 00 07:24:11.900 READ DMA EXT 35 00 08 28 a9 ca e0 00 07:24:11.000 WRITE DMA EXT e7 00 00 00 00 00 a0 00 07:24:06.200 FLUSH CACHE e7 00 00 00 00 00 a0 00 07:24:06.200 FLUSH CACHE 35 00 08 d7 eb 4b e0 00 07:24:06.200 WRITE DMA EXT SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 1 - # 2 Short offline Completed without error 00% 0 - Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum. SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 The Monday 2007-04-02 at 02:36 +0200, jan kalcic wrote:
I would run the SMART self-test, if it's supported -- see the 'smartctl' manpage for details. Most likely your disk is dying.
So hardware problem, right? :(
It reports some errors but it seems to be always the same one and I'm not sure they are what you guess.
=== START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED
ok so far
Vendor Specific SMART Attributes with Thresholds:
Nothing mentionable here.
9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 2120
It's a "young" disk.
SMART Error Log Version: 1 ATA Error Count: 21303 (device log contains only the most recent five errors)
Error 21303 occurred at disk power-on lifetime: 2118 hours (88 days + 6 hours)
ie, two hours ago. That's about it.
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 03 3c 62 65 e2 Error: UNC 3 sectors at LBA = 0x0265623c = 40198716
I guess that's an unreadable sector (3 sectors).
SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 1 - # 2 Short offline Completed without error 00% 0 -
You haven't done a short/long/ test recently: those seems to be two short tests made by the computer installer (at 1 hour of age). So, do a short test, then a long one. You can continue using your computer somewhat during the test, but it's better you stop all disk intensive activities, even things like mail fetching if possible, or it will take way longer to test. When finished, read the results again. Tricks: Writing to the bad sector will trigerg remapping it. You can detect them all by copying all sectors to /dev/null with dd.
Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum.
Dunno. Run those tests first. - -- Cheers, Carlos E. R. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.5 (GNU/Linux) Comment: Made with pgp4pine 1.76 iD8DBQFGEGxBtTMYHG2NR9URArphAJ90I+PuC7IFglw1LRjmx7/zFqLGUQCfZ69A IEQ1RYisZEGQ68OwX1vhQjI= =KacE -----END PGP SIGNATURE----- -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
You haven't done a short/long/ test recently: those seems to be two short tests made by the computer installer (at 1 hour of age). So, do a short test, then a long one. You can continue using your computer somewhat during the test, but it's better you stop all disk intensive activities, even things like mail fetching if possible, or it will take way longer to test. When finished, read the results again.
Tricks: Writing to the bad sector will trigerg remapping it. You can detect them all by copying all sectors to /dev/null with dd.
Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum.
Dunno. Run those tests first.
Here we are. Short test completed without error whereas the long one shows a read failure error. smartctl version 5.37 [i686-suse-linux-gnu] Copyright (C) 2002-6 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF READ SMART DATA SECTION === SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 60% 2121 38168578 # 2 Short offline Completed without error 00% 2121 - # 3 Short offline Completed without error 00% 1 - # 4 Short offline Completed without error 00% 0 - -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 The Monday 2007-04-02 at 13:58 +0200, jan kalcic wrote:
Dunno. Run those tests first.
Here we are. Short test completed without error whereas the long one shows a read failure error.
The long one on recent makes does asurface test.
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 60% 2121 38168578 # 2 Short offline Completed without error 00% 2121 -
Ok, try to learn where that LBA=38168578 is, I mean, which partition, and overwrite it fully with dd: this triggers sector remapping. First, backup it completely to another disk (file by file, with rsync). Then, run the test again. Finally, copy files back. Drives have a region allocated to remap bad sectors. When trying to write to a bad sector, it is transparently reallocated to a new one in the manufacturer reserved region: Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 5 Reallocated_Sector_Ct 0x0033 095 095 005 Pre-fail Always - 0 You can also use the manufacturer provided test utility. These usually come as a boot floppy or CD that runs roughly the same spart tests under cpu control so that you can learn verbosely what is wrong. - -- Cheers, Carlos E. R. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.5 (GNU/Linux) Comment: Made with pgp4pine 1.76 iD4DBQFGEWKotTMYHG2NR9URAuRBAJMH0JQeWWg1tEgxnobQz/UPtWEaAJ9pTGrx lsbclsQXKKy/TrI8Bh9fSg== =SZaD -----END PGP SIGNATURE----- -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
* jan kalcic <jandot@googlemail.com> [04-01-07 19:13]:
openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it doesn't allow to do anything else but wait. I've already run fsck (ext3) to check it but no error is shown and yet I still have big troubles working with big files (>100MB).
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
Sounds like the swap space is full or your file system has ran out of space. -- Patrick Shanahan Registered Linux User #207535 http://wahoo.no-ip.org @ http://counter.li.org HOG # US1244711 Photo Album: http://wahoo.no-ip.org/gallery2 OpenSUSE Linux http://en.opensuse.org/ -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
Patrick Shanahan wrote:
* jan kalcic <jandot@googlemail.com> [04-01-07 19:13]:
openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it doesn't allow to do anything else but wait. I've already run fsck (ext3) to check it but no error is shown and yet I still have big troubles working with big files (>100MB).
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
Sounds like the swap space is full or your file system has ran out of space.
I can exclude this. My guess is something wrong with filesystem or an hardware problem as suggested. # df -h /dev/hda2 8.9G 5.3G 3.2G 63% / /dev/hda3 3.0G 1.1G 1.8G 39% /home # free total used free shared buffers cached Mem: 1035748 560712 475036 0 22640 359068 -/+ buffers/cache: 179004 856744 Swap: 0 0 0 -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
* jan kalcic <jandot@googlemail.com> [04-01-07 20:53]: [...]
I can exclude this. My guess is something wrong with filesystem or an hardware problem as suggested.
# df -h /dev/hda2 8.9G 5.3G 3.2G 63% / /dev/hda3 3.0G 1.1G 1.8G 39% /home
# free total used free shared buffers cached Mem: 1035748 560712 475036 0 22640 359068 -/+ buffers/cache: 179004 856744 Swap: 0 0 0
Agreed gud luk, -- Patrick Shanahan Registered Linux User #207535 http://wahoo.no-ip.org @ http://counter.li.org HOG # US1244711 Photo Album: http://wahoo.no-ip.org/gallery2 OpenSUSE Linux http://en.opensuse.org/ -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On Monday 02 April 2007 01:12, jan kalcic wrote:
Hi People,
openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it doesn't allow to do anything else but wait. I've already run fsck (ext3) to check it but no error is shown and yet I still have big troubles working with big files (>100MB).
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
How are you working with the files? Is this for all files or just some you open in a particular program? Perhaps this program wants to read in the entire file, which is why it takes so long? Regarding the slow speed and problems doing anything else: are these IDE hard drives? Did you check with hdparm to make sure DMA is enabled on the drives? -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
Hello, On Mon, 02 Apr 2007 01:12:49 +0200 jan kalcic <jandot@googlemail.com> wrote:
Hi People,
openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it doesn't allow to do anything else but wait. I've already run fsck (ext3) to check it but no error is shown and yet I still have big troubles working with big files (>100MB).
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
The problem looks like caused by a bad sector. Did you see a dmesg or /var/log/messages? If It is a bad sector, then you would could use DFT(Drive Fitness Test). First of all, **backup your all data**. Next, download Drive Fitness Test. http://www.hitachigst.com/hdd/support/table.htm Downloads -> [Drive Fitness Test] download a CD image -> burn it to CD-R After boot from CD-R, choose [2. ATA support only] if you won't check for SATA drive. select it Up and Down to test for your drive and start [Advanced Test]. *Caution*: An Advanced Test takes very long time becuase that checks a all sector. (Perhaps it will take about a hour as your HDD) And if that found a bad sector, then correct it. After this check, you would must fsck by using LiveCD or Knoppix before boot system. HTH eshsf -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
eshsf wrote:
Hello,
On Mon, 02 Apr 2007 01:12:49 +0200 jan kalcic <jandot@googlemail.com> wrote:
Hi People,
openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it doesn't allow to do anything else but wait. I've already run fsck (ext3) to check it but no error is shown and yet I still have big troubles working with big files (>100MB).
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
The problem looks like caused by a bad sector. Did you see a dmesg or /var/log/messages? If It is a bad sector, then you would could use DFT(Drive Fitness Test).
Now I'm sure it's an hardware problem. /var/log/messages looks like: Mar 19 16:31:52 venus kernel: hda: dma_intr: status=0x51 { DriveReady SeekComplete Error } Mar 19 16:31:52 venus kernel: hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=40227774, high=2, low=6673342, s ector=40227567 Mar 19 16:31:52 venus kernel: ide: failed opcode was: unknown Mar 19 16:31:52 venus kernel: end_request: I/O error, dev hda, sector 40227567 Mar 19 16:31:56 venus kernel: hda: dma_intr: status=0x51 { DriveReady SeekComplete Error } Mar 19 16:31:56 venus kernel: hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=40227774, high=2, low=6673342, s ector=40227575 Mar 19 16:31:56 venus kernel: ide: failed opcode was: unknown Mar 19 16:31:56 venus kernel: end_request: I/O error, dev hda, sector 40227575 Mar 19 16:32:00 venus kernel: hda: dma_intr: status=0x51 { DriveReady SeekComplete Error } Mar 19 16:32:00 venus kernel: hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=40227774, high=2, low=6673342, s ector=40227583 Mar 19 16:32:00 venus kernel: ide: failed opcode was: unknown Mar 19 16:32:00 venus kernel: end_request: I/O error, dev hda, sector 40227583 Mar 19 16:32:04 venus kernel: hda: dma_intr: status=0x51 { DriveReady SeekComplete Error } Mar 19 16:32:04 venus kernel: hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=40227774, high=2, low=6673342, s ector=40227591 Mar 19 16:32:04 venus kernel: ide: failed opcode was: unknown Mar 19 16:32:04 venus kernel: end_request: I/O error, dev hda, sector 40227591 Mar 19 16:33:31 venus kernel: hda: dma_intr: status=0x51 { DriveReady SeekComplete Error } Mar 19 16:33:31 venus kernel: hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=40227774, high=2, low=6673342, s ector=40227599 Mar 19 16:33:31 venus kernel: ide: failed opcode was: unknown Mar 19 16:33:31 venus kernel: end_request: I/O error, dev hda, sector 40227599 Mar 19 16:33:31 venus kernel: hda: dma_intr: status=0x51 { DriveReady SeekComplete Error } Mar 19 16:33:31 venus kernel: hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=40227774, high=2, low=6673342, s ector=40227607 -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On Mon, 2 Apr 2007 12:16:35 +0900 eshsf <eshsf@mbj.nifty.com> wrote:
Hello,
On Mon, 02 Apr 2007 01:12:49 +0200 jan kalcic <jandot@googlemail.com> wrote:
Hi People,
openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it doesn't allow to do anything else but wait. I've already run fsck (ext3) to check it but no error is shown and yet I still have big troubles working with big files (>100MB).
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
The problem looks like caused by a bad sector. Did you see a dmesg or /var/log/messages? If It is a bad sector, then you would could use DFT(Drive Fitness Test).
First of all, **backup your all data**. Next, download Drive Fitness Test. http://www.hitachigst.com/hdd/support/table.htm Downloads -> [Drive Fitness Test] download a CD image -> burn it to CD-R After boot from CD-R, choose [2. ATA support only] if you won't check for SATA drive. select it Up and Down to test for your drive and start [Advanced Test].
*Caution*: An Advanced Test takes very long time becuase that checks a all sector. (Perhaps it will take about a hour as your HDD)
And if that found a bad sector, then correct it. After this check, you would must fsck by using LiveCD or Knoppix before boot system.
FYI: if you have a seagate HDD, there is a SeaTools too. http://www.seagate.com/www/en-us/support/downloads/ eshsf -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
eshsf wrote:
On Mon, 2 Apr 2007 12:16:35 +0900 eshsf <eshsf@mbj.nifty.com> wrote:
Hello,
On Mon, 02 Apr 2007 01:12:49 +0200 jan kalcic <jandot@googlemail.com> wrote:
Hi People,
openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it doesn't allow to do anything else but wait. I've already run fsck (ext3) to check it but no error is shown and yet I still have big troubles working with big files (>100MB).
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
The problem looks like caused by a bad sector. Did you see a dmesg or /var/log/messages? If It is a bad sector, then you would could use DFT(Drive Fitness Test).
First of all, **backup your all data**. Next, download Drive Fitness Test. http://www.hitachigst.com/hdd/support/table.htm Downloads -> [Drive Fitness Test] download a CD image -> burn it to CD-R After boot from CD-R, choose [2. ATA support only] if you won't check for SATA drive. select it Up and Down to test for your drive and start [Advanced Test].
*Caution*: An Advanced Test takes very long time becuase that checks a all sector. (Perhaps it will take about a hour as your HDD)
And if that found a bad sector, then correct it. After this check, you would must fsck by using LiveCD or Knoppix before boot system.
FYI: if you have a seagate HDD, there is a SeaTools too. http://www.seagate.com/www/en-us/support/downloads/
eshsf
It's a hitachi disk. I'll try DFT and will see what happens. Thanks to you all so far. Regards, Jan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On Mon, 02 Apr 2007 21:05:30 +0200 jan kalcic <jandot@googlemail.com> wrote:
eshsf wrote:
On Mon, 2 Apr 2007 12:16:35 +0900 eshsf <eshsf@mbj.nifty.com> wrote:
Hello,
On Mon, 02 Apr 2007 01:12:49 +0200 jan kalcic <jandot@googlemail.com> wrote:
Hi People,
openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it doesn't allow to do anything else but wait. I've already run fsck (ext3) to check it but no error is shown and yet I still have big troubles working with big files (>100MB).
In addition, during the wait, the hard disk sounds like it is doing such a big work that it's so busy it can't even move my mouse. Quite strange.
Could it even be an hardware (hard disk) problem?
The problem looks like caused by a bad sector. Did you see a dmesg or /var/log/messages? If It is a bad sector, then you would could use DFT(Drive Fitness Test).
First of all, **backup your all data**. Next, download Drive Fitness Test. http://www.hitachigst.com/hdd/support/table.htm Downloads -> [Drive Fitness Test] download a CD image -> burn it to CD-R After boot from CD-R, choose [2. ATA support only] if you won't check for SATA drive. select it Up and Down to test for your drive and start [Advanced Test].
*Caution*: An Advanced Test takes very long time becuase that checks a all sector. (Perhaps it will take about a hour as your HDD)
And if that found a bad sector, then correct it. After this check, you would must fsck by using LiveCD or Knoppix before boot system.
FYI: if you have a seagate HDD, there is a SeaTools too. http://www.seagate.com/www/en-us/support/downloads/
eshsf
It's a hitachi disk. I'll try DFT and will see what happens.
When I had a phenomenon same as this in the past, I resolved this problem by the way I said. However I think the best way is a clean install after backup a data and corrected a bad sector by using DFT, SeaTools, and so on. Thanks, eshsf -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
On 2007-04-01 17:12, jan kalcic wrote:
Hi People,
openSuSE 10.2, never had a problem with filesystem but suddenly it seems it has become "bad" as it takes something like 10 or more seconds to open a file, especially the biggest ones, and in the meanwhile it
As others have mentioned, this could be a sign of a problem with the drive. Note that it also be a problem with the power supply: Insufficient power to the system can mask itself as drive read/write errors or memory errors (and other problems as well, but those are probably the most important ones). -- Moral indignation is jealousy with a halo. -- HG Wells -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org
participants (7)
-
Anders Johansson
-
Carlos E. R.
-
Darryl Gregorash
-
David Brodbeck
-
eshsf
-
jan kalcic
-
Patrick Shanahan