[Bug 386555] New: High number of hard drive load cycles on notebooks
https://bugzilla.novell.com/show_bug.cgi?id=386555 Summary: High number of hard drive load cycles on notebooks Product: openSUSE 11.0 Version: Beta 2 Platform: Other OS/Version: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: alberto.passalacqua@tin.it QAContact: kernel-maintainers@forge.provo.novell.com Found By: --- Following what I read about some users experiences with the latest release of Fedora and Ubuntu, having "clicks" and high load cycles counts on certain laptops (see https://bugs.launchpad.net/ubuntu/+source/acpi-support/+bug/59695 for example), I asked for some testing in Beta 2. One hundred load cycles per hour were reported on a Lenovo R61 using openSUSE 11 beta 2, as you can read here: http://lists.opensuse.org/opensuse-factory/2008-05/msg00066.html I think further investigation is required, being this issue quite serious, considering its impact on the disk life. Regards, Alberto -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
Andreas Jaeger
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c1
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User alberto.passalacqua@tin.it added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c3
Alberto Passalacqua
https://bugzilla.novell.com/show_bug.cgi?id=386555
User alberto.passalacqua@tin.it added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c4
Alberto Passalacqua
https://bugzilla.novell.com/show_bug.cgi?id=386555
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User pavel@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c5
--- Comment #5 from Pavel Machek
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c6
--- Comment #6 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User alberto.passalacqua@tin.it added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c7
--- Comment #7 from Alberto Passalacqua
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c8
--- Comment #8 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User alberto.passalacqua@tin.it added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c9
--- Comment #9 from Alberto Passalacqua
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c10
--- Comment #10 from Tejun Heo
It's not a question of how seriously you (at Novell) take the problem. I never wrote you're not considering it seriously. But I also think that if this hardware works correctly under Windows and Mac OS X, it should be work correctly without ugly hacks under Linux too.
Other points I can agree to but it working under mac os X is just dumb luck. The setting just isn't healthy. Let's say the vendor sets the unload timeout to 10 seconds and under nominal laptop circumstances Windows issues IOs every 8 secs unless it's completely idle. It will work fine there. Let's say mac does so every 7 to 9 seconds which will work fine too. Now, let's say linux does so every 10-12 seconds. Now you have a problem. Being idle for longer period time is a good thing which we should strive for but in this case it's fast death for the drive. This is why some people are reporting the problem goes away when laptop mode is disabled because then IOs will be issued more frequently. Such short fixed timeouts just can't generically work. It's bound to break. They had to go with longer timeout or adaptive one. So, it's not like something is wrong with linux, it's just different and the setting is way too aggressive for real world. Please think of my friend's laptop I mentioned earlier. That drive is going to be toasted pretty soon even if it's just differently configured windows. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
User trenn@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c11
--- Comment #11 from Thomas Renninger
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c12
--- Comment #12 from Stefan Seyfried
192: drive does not unload heads from the specs, 255 disables APM
On my machine, i usually set it to 192 when on AC power and to 128 when on battery power. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c13
--- Comment #13 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c14
--- Comment #14 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c15
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c16
--- Comment #16 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c17
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c18
--- Comment #18 from Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=386555
User robin.listas@telefonica.net added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c19
Carlos Robinson
https://bugzilla.novell.com/show_bug.cgi?id=386555
User pearson44j@wanadoo.fr added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c20
--- Comment #20 from James PEARSON
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c21
--- Comment #21 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User alberto.passalacqua@tin.it added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c22
--- Comment #22 from Alberto Passalacqua
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c23
Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c24
--- Comment #24 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User roland.kletzing@materna.de added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c25
roland kletzing
Aieeeeeeeee.......... This is nasty. The BIOS / drive vendors are setting APM to aggressive values w/ idle IO access pattern of windows on mind. The setting is too aggressive to the point of being fragile and different idle IO pattern causes the drive to unload like crazy && you can't really expect Linux or any other operating system to have similar idle IO pattern as windows. :-(
even worse - there are disks on the market which don`t let you tune the timeouts with standard methods. you need to get a proprietary DOS *sigh* tool to tune the unload interval - and WD support may give that to you - or not.... see these threads: http://marc.info/?l=linux-kernel&m=120777293511872&w=2 http://marc.info/?l=linux-kernel&m=121071269907588&w=2 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
User sontek@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c26
--- Comment #26 from John Anderson
https://bugzilla.novell.com/show_bug.cgi?id=386555
User sontek@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c27
--- Comment #27 from John Anderson
https://bugzilla.novell.com/show_bug.cgi?id=386555
User sontek@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c28
John Anderson
https://bugzilla.novell.com/show_bug.cgi?id=386555
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c29
--- Comment #29 from Tejun Heo
this doesn`t only happen on notebooks but also with enterprise disks for 24/7hrs useage.
even worse - there are disks on the market which don`t let you tune the timeouts with standard methods. you need to get a proprietary DOS *sigh* tool to tune the unload interval - and WD support may give that to you - or not....
see these threads: http://marc.info/?l=linux-kernel&m=120777293511872&w=2 http://marc.info/?l=linux-kernel&m=121071269907588&w=2
Yeah, I recall the thread. I don't really think we can do something about it tho. The only possible solution is to periodically issue commands to the drive to keep its head from unloading. Yuk.. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c30
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
Greg Kroah-Hartman
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c31
--- Comment #31 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User alberto.passalacqua@tin.it added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c32
Alberto Passalacqua
https://bugzilla.novell.com/show_bug.cgi?id=386555
User roland.kletzing@materna.de added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c33
--- Comment #33 from roland kletzing
This is tough to solve.
yes, probably. so why not creating better "transparency" for the end user here ? smartctl can read the power-on-hours and the load_cycle_count. each value for itself doesn't tell much... you won`t even need the power_on_hours value if you just check for load cycle count increase for specific time intervals. if we want to stop harddisk vendors doing such dumb things, more users must raise an eyebrow. the more noise about this, the more chances vendors will do something about that. but how should an technically inexperienced user know, that he actually suffers from that issue ? one step into that direction would be adding a feature to smartmontools to give out a warning message, when it detects excessive load cycle count number. see http://sourceforge.net/mailarchive/forum.php?thread_name=261451287%40web.de&forum_name=smartmontools-support ---
It works on the HP Pavillion. Moreover, it seems the problem is being addressed also by the manufacturer with a new firmware.
any link/information for that? WD support was pretty ignorant about this issue and it wouldn`t hurt telling them that other vendors do something about that problem. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c34
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c35
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c36
--- Comment #36 from Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c37
--- Comment #37 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c38
Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c39
--- Comment #39 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c40
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c41
--- Comment #41 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c42
--- Comment #42 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User roland.kletzing@materna.de added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c43
--- Comment #43 from roland kletzing
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c44
Stefan Seyfried
Hmmm... Maybe a separate package then? I'm almost done implementing glob matching w/ hal info caching. Man.. bash is a great programming language. :-(
If separate package is the way to go, I'll package it and put it on OBS. I've never submitted a package before. What else do I need to do?
Add it in the PDB and submit it to autobuild. I can do that for you. If you already have it in OBS, please put the package "source dir" somewhere accessible (hm, i should be able to check it out from the OBS...). Make sure you select a good license, so that we can get past legal quickly ;-) Then we need management's approval to get that still into 11.0, so i'll add relevant people to cc now. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c45
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c46
--- Comment #46 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c47
Stefan Seyfried
Here's the OBS project. The license is beerware. Is that something legal can agree with?
Let's hope so, i added Ciaran and Jürgen to CC. Feel free to remove yourself again ;-) If this is not "good enough", we can probably also dual-license it "beerware / BSD 3-clause". BSD is known to go easily through legal ;) https://build.opensuse.org/package/show?package=storage-fixup&project=home%3Ateheo
Every file is accessible through OBS but just in case, the git tree is at..
Yes, i was able to check it out with OSC. Everything is easy if you have an BS account, it's only hard for people without.
http://git.kernel.org/?p=linux/kernel/git/tj/storage-fixup.git;a=shortlog;h=...
I don't know anything about PDB so I would appreciate if you can help me with that.
I'll take care of it. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
User pavel@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c48
--- Comment #48 from Pavel Machek
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c50
--- Comment #50 from Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c51
--- Comment #51 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c52
--- Comment #52 from Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c55
--- Comment #55 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c56
--- Comment #56 from Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c57
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User alberto.passalacqua@tin.it added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c58
--- Comment #58 from Alberto Passalacqua
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c59
--- Comment #59 from Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c60
Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=386555
User coolo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c61
Stephan Kulow
https://bugzilla.novell.com/show_bug.cgi?id=386555
User pavel@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c62
--- Comment #62 from Pavel Machek
https://bugzilla.novell.com/show_bug.cgi?id=386555
User coolo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c63
--- Comment #63 from Stephan Kulow
https://bugzilla.novell.com/show_bug.cgi?id=386555
User pacho@condmat1.ciencias.uniovi.es added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c70
--- Comment #70 from Pacho Ramos
https://bugzilla.novell.com/show_bug.cgi?id=386555
User pacho@condmat1.ciencias.uniovi.es added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c71
--- Comment #71 from Pacho Ramos
https://bugzilla.novell.com/show_bug.cgi?id=386555
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c72
--- Comment #72 from Stefan Seyfried
Then, Is "storage-fixup" the package who fixes this problem? From http://software.opensuse.org , seems that it has versions for factory, SLED 10 and opensuse-10.3, Is save use factory rpm in 11.0 ?
Yes. But there will also be an online update for 11.0 delivered soon, so it should pop up in YOU shortly without the need for adding any buildservice repositories. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c73
--- Comment #73 from Tejun Heo
After reading storage-fixup.conf, seems that -B 255 is being used but, after reading https://wiki.ubuntu.com/DanielHahler/Bug59695 , seems that some drives would need -B 254 instead. Also laptop-mode-tools now uses 254 instead of 255 from 1.35 version as can be read in http://samwel.tk/laptop_mode/changelog
Also, Is fully disabling Power Manager really needed? Wouldn't it cause some problems related with overheat or short battery lifetime?
As discussed above, no one value suits every drive. I hope it were like that but the standard isn't too specific about which value means exactly what. Furthermore, this is ATA and vendors often forget to follow the spec. So, for some drives, 255 is a good value for others 254, yet others 128. That's why storage-fixup matches specific machines and use appropriate commands. We'll need to find out which value is the appropriate one for specific machine and add it machine-by-machine. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=386555
User pacho@condmat1.ciencias.uniovi.es added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c74
--- Comment #74 from Pacho Ramos
https://bugzilla.novell.com/show_bug.cgi?id=386555
User ast@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c75
--- Comment #75 from Anja Stock
https://bugzilla.novell.com/show_bug.cgi?id=386555
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=386555#c76
--- Comment #76 from Tejun Heo
From now on, please use the following wiki page to track this problem and report new ones to linux-ide@vger.kernel.org as this problem is not specific to SUSE.
http://ata.wiki.kernel.org/index.php/Known_issues Thanks. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com