[Bug 223742] New: Suspend to disk broken
https://bugzilla.novell.com/show_bug.cgi?id=223742 Summary: Suspend to disk broken Product: openSUSE 10.2 Version: RC 1 Platform: Other OS/Version: Other Status: NEW Severity: Major Priority: P5 - None Component: Basesystem AssignedTo: hmacht@novell.com ReportedBy: thoenig@novell.com QAContact: qa@suse.de CC: seife@novell.com Suspend to disk broken. The image is being written successfully, "S" is being written in the next line. Then, the system does not shut down. Shutting down the system manually lets the system trying to resume. But it fails with the warning (fuzzy, resume: could not find configuration file). Resume aborts. With Beta2 and/or Beta2 Plus suspend to disk was working, suspend to RAM works fine. Hardware used: Lenovo ThinkPad T60. Tests were done /without/ USB mass storage being attached. No docking or similar schabernack was performed. Please let me know if I can provide logs or anything else which might help. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #1 from thoenig@novell.com 2006-11-25 08:25 MST ------- (In reply to comment #0)
Shutting down the system manually lets the system trying to resume.
This should read: Shutting down the system manually and then powering on again lets the system trying to resume. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 seife@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |fseidel@novell.com, hmacht@novell.com, | |pavel@novell.com AssignedTo|hmacht@novell.com |seife@novell.com Status|NEW |ASSIGNED ------- Comment #2 from seife@novell.com 2006-11-25 15:51 MST ------- (In reply to comment #0)
Suspend to disk broken.
The image is being written successfully, "S" is being written in the next line. Then, the system does not shut down.
Please provide /var/log/pm-suspend.log If there is nothing in there, please provide "lsmod" of the running system before suspend and "cat /proc/sys/kernel/tainted"
Shutting down the system manually lets the system trying to resume. But it fails with the warning (fuzzy, resume: could not find configuration file).
This one is harmless, it appears always.
Resume aborts.
This one is not harmless. Another thing to try: - boot with init=/bin/bash - mount /proc - mount /sys - swapon -a - s2disk -f /var/lib/s2disk.conf If this one also does not work, try it again, but on resume boot with "noresume init=/bin/bash", then do "guessfstype $YOUR_SWAP_PARTITION" -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #3 from thoenig@novell.com 2006-11-26 05:35 MST ------- Created an attachment (id=106951) --> (https://bugzilla.novell.com/attachment.cgi?id=106951&action=view) Debug patch (kernel not tainted, With the attached patch I run into "3b1: bad signature"; the signature check fails. guessfstype for my swap parition: thoenig@nouse:~> sudo guessfstype /dev/sda6 /dev/sda6 *appears* to be: swap -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 thoenig@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Severity|Major |Blocker ------- Comment #4 from thoenig@novell.com 2006-11-26 05:38 MST ------- Raising to blocker as I encountered data loss. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #5 from thoenig@novell.com 2006-11-26 05:40 MST ------- I'm tempted to run mkswap on my swap partition but we probably want to investigate what is going on; I will not do this for now. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #6 from thoenig@novell.com 2006-11-26 05:56 MST ------- - boot with init=/bin/bash - mount /proc - mount /sys - swapon -a - s2disk -f /var/lib/s2disk.conf results in a successful suspension (no error wrt signature of the swap partition). However, resuming does not work. The system does not even try to resume but boots up normally. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #7 from seife@novell.com 2006-11-26 08:07 MST ------- (In reply to comment #5)
I'm tempted to run mkswap on my swap partition but we probably want to investigate what is going on; I will not do this for now.
good idea. please check what your swap signature is (or attach the first 16kb of your swap partition). the signature is at the end of the first page of the swap partition. or just add a printf for swsusp.header_sig -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 seife@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |rjwysocki@sisk.pl ------- Comment #8 from seife@novell.com 2006-11-26 08:12 MST ------- (In reply to comment #4)
Raising to blocker as I encountered data loss.
how? was the filesystem corrupted? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 seife@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |thoenig@novell.com ------- Comment #9 from seife@novell.com 2006-11-26 08:17 MST ------- another idea: sanity check the swap_offset / shift / size values. is swap_offset isn´t 0, then something fishy is going on. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #10 from pavel@novell.com 2006-11-26 14:27 MST ------- What data loss? I'd suggest to mkswap and move on. I do not think we have mechanism to get rid of lost swap signature. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #11 from thoenig@novell.com 2006-11-26 15:12 MST ------- (In reply to comment #7)
good idea. please check what your swap signature is (or attach the first 16kb of your swap partition).
the signature is at the end of the first page of the swap partition.
or just add a printf for swsusp.header_sig
(In reply to comment #9)
another idea: sanity check the swap_offset / shift / size values.
is swap_offset isn´t 0, then something fishy is going on.
Yes, I will try that tomorrow morning at work. (In reply to comment #8)
how? was the filesystem corrupted?
(In reply to comment #10)
What data loss?
I had an open file when triggering suspend to disk. This file was corrupt after that unsuccessful attempt.
I'd suggest to mkswap and move on. I do not think we have mechanism to get rid of lost swap signature.
I will do that right *after* we find out what is going on. I did not perform anything special (read: others will are likely to run into this as well). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #13 from pavel@novell.com 2006-11-26 16:14 MST ------- Rafael said: I bet the swap header is fragmented in which case we can't use the swap file for the suspend. Please suggest the user to try the "swap-offset" program out of suspend-0.5 and see what it says. ..but you are using swap _partition_, not swap _file_, right? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #14 from thoenig@novell.com 2006-11-26 16:29 MST ------- (In reply to comment #13)
I bet the swap header is fragmented in which case we can't use the swap file for the suspend. Please suggest the user to try the "swap-offset" program out of suspend-0.5 and see what it says.
...but you are using swap _partition_, not swap _file_, right?
Yes, OS10.2 is configured to use the swap partition. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 thoenig@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Version|RC 1 |RC 3 ------- Comment #15 from thoenig@novell.com 2006-11-29 04:21 MST ------- Looks like a SATA issue. There are I/O errors I will try to capture with a serial console. Currently we have verified the systems Lenovo ThinkPad T60 and T60P to break with this issue. Still present in RC3. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 fred.blaise@gmail.com changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |fred.blaise@gmail.com Version|RC 3 |Final ------- Comment #16 from fred.blaise@gmail.com 2006-12-10 15:09 MST ------- Got the same issue as Timo. T60 as well, but the version below his ;) s2ram is fine. Also hangs right after S is printed on console. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #17 from fred.blaise@gmail.com 2006-12-10 15:23 MST ------- If you try, in init 3 mode, the following as root: s2disk $YOUR_SWAP_DEV and then switch to console 10, you will see never-ending I/O error on reiserfs it seems (?)... goes to fast to see, and I don't know how to stop them... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #18 from justforspam@email.it 2006-12-13 14:12 MST ------- Created an attachment (id=109644) --> (https://bugzilla.novell.com/attachment.cgi?id=109644&action=view) susped log (paulatz) I have exactly the same issue here. In a few seconds I will try the command. For now I attach /var/log/pm-suspend.log -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 justforspam@email.it changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|thoenig@novell.com | ------- Comment #19 from justforspam@email.it 2006-12-13 14:28 MST ------- More information: after reformatting the swap partition now I get a bunch of reiserfs input/output errors. I'm using an HP pavilion dv5200. This is the sata controller: 00:1f.2 SATA controller: Intel Corporation 82801GBM/GHM (ICH7 Family) Serial ATA Storage Controller AHCI (rev 01) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 thoenig@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |thoenig@novell.com ------- Comment #20 from thoenig@novell.com 2006-12-13 14:34 MST ------- Lorenzo, this is not the information required to release the NEEDINFO. If you switch through your virtual terminals after the crash has occurred you will see error messages which require a serial console to be captured. If you have a docking station with a serial port feel free to do this. Otherwise we have to wait until I find time to do that. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #21 from pavel@novell.com 2006-12-13 14:37 MST ------- Alternative methods for Lorenzo: 1) digital camera, just take a picture of messages 2) pen and paper. Old-tech, but usually works fine ;-). Actually just give us few of those messages, perhaps that's enough. Has it ever worked? Are you using anything strange such as host-protected area? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #22 from thoenig@novell.com 2006-12-13 14:43 MST ------- Pavel, both options might not be suitable for this case. The errors scroll by *very* quickly. Very unlikely that either a human eye or a CCD is quick enough to capture anything. Probably screen lock still works; I haven't tried that. With regard to your question if it has ever worked: Yes it has reliably with SLE10 and also during the early stages of OS10.2 development. If I recall correctly it broke sometime near the end of the beta phase. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #23 from pavel@novell.com 2006-12-13 15:00 MST ------- Insert mdelay(1000); into printk. Or give me at least picture of last messages. If it worked okay in some beta... can you try to use kernel from that beta if it fixes it, first first broken beta, and run diff -u between those? Should be fairly small and patch that breaks it should be easy to locate. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #24 from justforspam@email.it 2006-12-14 10:58 MST ------- Created an attachment (id=109799) --> (https://bugzilla.novell.com/attachment.cgi?id=109799&action=view) first part of "screen capture" I have managed to take some picture of the kernel messages. I am attaching them as images. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 justforspam@email.it changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED ------- Comment #25 from justforspam@email.it 2006-12-14 11:02 MST ------- Created an attachment (id=109801) --> (https://bugzilla.novell.com/attachment.cgi?id=109801&action=view) second part after a short while the kernel displayed only this message forever. In case it matters here is my partitioning: Disk /dev/sda: 100.0 GB, 100030242816 bytes 255 heads, 63 sectors/track, 12161 cylinders Units = cilindri of 16065 * 512 = 8225280 bytes Dispositivo Boot Start End Blocks Id System /dev/sda1 * 1 3916 31455238+ 7 HPFS/NTFS /dev/sda2 3918 12030 65167672+ f W95 Ext'd (LBA) /dev/sda3 12031 12161 1052257+ 7 HPFS/NTFS /dev/sda5 3918 11750 62918541 83 Linux /dev/sda6 11751 11881 1052226 83 Linux /dev/sda7 11882 12030 1196811 82 Linux swap / Solaris -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #26 from pavel@novell.com 2006-12-14 12:28 MST ------- Lorenzo: Something goes very wrong with SATA in your case.... strange, you seem to have AHCI in your machine. That should work ok. Does s2ram work okay for you? Timo: so errors scroll and continue scrolling forever, only they are too fast to read? OTOH AHCI timeout messages usually do not scroll _that_ fast. Get a slower graphics card, vesafb usually does the trick ;-). Or press magic-sysrq-0 to silence the console. Messages will freeze and you can read them. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #27 from justforspam@email.it 2006-12-14 15:42 MST ------- s2ram does not work because it is not supported by hardware. Kernel suspend to disk worked like charm with suse 10.1, the problem appeared when I updated. I have tried to revert to kernel suspend with 10.2 but configuration files are really garbled about that and I couldn't make it work fine. I didn't tried harder because it is deprecated, but I can if it is useful. P.S. magic sysrequests have to be enabled in "YaST -> System -> System config" but are difficult to type on a laptop keyboard! -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #28 from seife@novell.com 2006-12-15 06:08 MST ------- (In reply to comment #27)
I have tried to revert to kernel suspend with 10.2 but configuration files are really garbled about that and I couldn't make it work fine. I didn't tried harder because it is deprecated, but I can if it is useful.
/etc/pm/config, but kernel suspend support is broken in 10.2 pm-utils package, see bug 226115 for a trivial patch that fixes it. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #29 from robin.knapp@fujitsu-siemens.com 2007-01-17 13:42 MST ------- You might try the patch in Bug 231088, comment 19 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #30 from justforspam@email.it 2007-01-18 09:50 MST ------- I have had partial success with the kernel patch (make; make modules_install install; mkintrd; reboot): I was able to suspend and recover (software susp) but had kernel returned some SATA errors that slowed down the process. Now I will attach the relevant dmesg output (as soon as I can find the attach form...) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #31 from justforspam@email.it 2007-01-18 09:51 MST ------- Created an attachment (id=113692) --> (https://bugzilla.novell.com/attachment.cgi?id=113692&action=view) kernel output while recovering from suspend, after patching the kernel It was well hidden... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #32 from pavel@novell.com 2007-01-18 17:19 MST ------- #30 : does that mean problem is fixed? SATA errors are different problem, and recoverable, right? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #33 from justforspam@email.it 2007-01-19 04:15 MST ------- I think the bug could be closed. It seem it was a SATA issue since the beginning I am not 100% sure the new minor problem is unrelated to the old major one. The fact is now suspend works even if a bit oddly (at least for me). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 pavel@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |FIXED ------- Comment #34 from pavel@novell.com 2007-01-20 12:51 MST ------- Yep, closed. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 ------- Comment #35 from seife@novell.com 2007-01-20 16:41 MST ------- Since this was reported against 10.2: is the fix in the 10.2 kernel repository so that we will push it out with the next update? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=223742 behlert@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |CLOSED -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
participants (1)
-
bugzilla_noreply@novell.com