[Bug 715852] New: data loss with ext4, SSD, kernel 3.0??
https://bugzilla.novell.com/show_bug.cgi?id=715852 https://bugzilla.novell.com/show_bug.cgi?id=715852#c0 Summary: data loss with ext4, SSD, kernel 3.0?? Classification: openSUSE Product: openSUSE 11.4 Version: Final Platform: x86-64 OS/Version: Other Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: jnelson-suse@jamponi.net QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:6.0) Gecko/20100101 Firefox/6.0 I've recently upgraded to a new laptop with an SSD, and also gone with tumbleweed. I am experiencing /significant/ data loss issues when the laptop crashes or hangs. Specifically, I'm seeing files that should have been written *minutes* ago disappear. rpms that I have installed claim they don't exist -- but the files do and they're all 0 bytes - and rpm -Va and rpm -ql know about them. rpm database corruption. Missing files (esp. KDE config files). I am using the *exact* same setup that I've been using for a few years now: ext4 on top of lvm on top of LUKS. I've had more data loss in the last 2 weeks than I've had in 2 years, and I'm not sure what the problem is: the Intel 320 SSD? The new kernel (3.0.4 as of this writing)? The drive: INTEL SSDSA2BW16 Laptop: Lenovo T520i. I'm not using any special mount options or anything. LUKS (and DM) don't support TRIM, so that's not it. barriers are enabled. data=ordered is the mount mode. Reproducible: Always Steps to Reproduce: 1. 2. 3. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=715852 https://bugzilla.novell.com/show_bug.cgi?id=715852#c Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|kernel-maintainers@forge.pr |jack@suse.com |ovo.novell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
From the first sight I'd say that flusher thread is not properly writing out
https://bugzilla.novell.com/show_bug.cgi?id=715852 https://bugzilla.novell.com/show_bug.cgi?id=715852#c1 Jan Kara <jack@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO InfoProvider| |jnelson-suse@jamponi.net --- Comment #1 from Jan Kara <jack@suse.com> 2012-03-27 18:23:48 UTC --- Interesting, thanks for report. Is this deterministic? I.e. could you say that all the files written to some point got lost? Maybe you could try the following: mkdir /tmp/testdir; for (( i = 0; i < 1000; i++ )); do echo "file$i"; echo "foobar" >/tmp/testdir/file$i; sleep 5; done And trigger reboot (e.g. "echo b >/proc/sysrq-trigger") after the above is running for 10 minutes or so. Then you could compare how many files got lost compared to what was written to the screen. old files... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=715852 https://bugzilla.novell.com/show_bug.cgi?id=715852#c2 Jon Nelson <jnelson-suse@jamponi.net> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |NEW InfoProvider|jnelson-suse@jamponi.net | --- Comment #2 from Jon Nelson <jnelson-suse@jamponi.net> 2012-03-27 20:32:28 UTC --- A few thinks have changed since this report was filed: 1. I added "hdparm -W0 /dev/sda" to /etc/init.d/boot.local to disable the drive's write cache (should not be necessary). 2. the kernel is now 3.1.9-1.4-desktop 3. I've largely switched over to btrfs, except for /boot and the root filesystem. I can re-enable the drive's write cache if that'd be useful. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=715852 https://bugzilla.novell.com/show_bug.cgi?id=715852#c3 Jan Kara <jack@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO InfoProvider| |jnelson-suse@jamponi.net --- Comment #3 from Jan Kara <jack@suse.com> 2012-03-29 10:46:35 UTC --- You are right that drive's write cache shouldn't cause problems. So feel free to reenable it. If the problem was in writeback code, you would actually observe it with btrfs as well. So testing with btrfs would be useful as well. So can you still reproduce the problem? I've tried to reproduce it with a simple test script but I failed. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=715852 https://bugzilla.novell.com/show_bug.cgi?id=715852#c4 Jon Nelson <jnelson-suse@jamponi.net> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |NEW InfoProvider|jnelson-suse@jamponi.net | --- Comment #4 from Jon Nelson <jnelson-suse@jamponi.net> 2012-03-29 13:07:47 UTC --- I am not able to reproduce the issue, however, the machine is vastly more stable than before. I can't think of a reboot that has been necessary except for kernel upgrades or my own reasons. I could try the script above but I can't do it today. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=715852 https://bugzilla.novell.com/show_bug.cgi?id=715852#c5 Jan Kara <jack@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |WORKSFORME --- Comment #5 from Jan Kara <jack@suse.com> 2012-04-03 00:03:53 UTC --- Ok, for now I'll close this bug as WORKSFORME. But in case you happen to see this again or are even able to reproduce the problem, please reopen this bug. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com