[Bug 450256] New: System freezes during resume from disk
https://bugzilla.novell.com/show_bug.cgi?id=450256 Summary: System freezes during resume from disk Product: openSUSE 11.1 Version: RC 1 Platform: i686 OS/Version: SuSE Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: radomir.cernoch@gmail.com QAContact: qa@suse.de Found By: Community User When resuming the system from "suspend to disk", the machine freezes completely. I am using a fresh openSUSE 11.1-RC1 with 2.6.27.7-4-default kernel on IBM ThinkPad T42. It is very likely that the problem is related to the graphical card as it hangs approximately around the time, when X server gets started. Moreover after the freeze CapsLock LED starts blinking, which resemlbes me the struggle with proprietal ATI drivers on a different notebook. The problem does not occur during suspend/resume to/from RAM. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c1
--- Comment #1 from Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c2
--- Comment #2 from Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
Andreas Jaeger
https://bugzilla.novell.com/show_bug.cgi?id=450256
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c3
Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=450256
User mhopf@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c4
Matthias Hopf
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c5
--- Comment #5 from Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c6
Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c7
Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c8
Rafael Wysocki
Suspend/resume (s&r) from battery causes a freeze always. The only difference is that the blinking CapsLock blinks slower as the CPU frequency is lower ;-). I have even tried pluging in the AC during the hibernation, but without a success.
So, the box always crashes during resume from hibernation if it has been hibernated on battery power. Is that correct? If that is the case, have you tried to unload the battery module before hibernation?
Actually the ACPI was the first thing that came to my mind. But why would the system freeze always at the point when the "suspend console" (Does it have a proper name?) switches back to either X server or normal console (tty)?
Devices are being resumed at that point.
The memory corruption of GPU seems more logical, doesn't it?
Not necessarily. Your observations made in comment #5 pretty much exclude an X issue, unless they are incorrect. ;-)
And do you know any way, how this could be debugged?
In case it was a kernel (or X?) bug, who should I contact first? And is there a way how to some kind of kernel oops in such conditions?
First, as I said before, I think it is a kernel issue. Second, as you said the box worked correctly with the kernel from openSUSE 10.3, do you have any experience with building and installing the kernel yourself? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c9
Radomír Černoch
So, the box always crashes during resume from hibernation if it has been hibernated on battery power. Is that correct?
Not exactly. Putting all evidence together the notebook crashes always when it goes on battery (at least for a while) during the hibernation (when it is already turned off). Running the actual process of suspending or hibernation from battery or AC makes no difference.
If that is the case, have you tried to unload the battery module before hibernation?
Wow, this helps! Unloading the battery makes the hibernation safe. (tested 3x)
Devices are being resumed at that point. ... Not necessarily. Your observations made in comment #5 pretty much exclude an X issue, unless they are incorrect. ;-)
Oh, I didn't know. It seems that I'm faster writing than thinking :-) Sorry.
First, as I said before, I think it is a kernel issue. Second, as you said the box worked correctly with the kernel from openSUSE 10.3, do you have any experience with building and installing the kernel yourself?
There should not be any problem about it. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User trenn@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c10
--- Comment #10 from Thomas Renninger
And do you know any way, how this could be debugged? And is there away how to some kind of kernel oops in such conditions? Serial console? IBM ThinkPad T42 This should still have one? There is a boot option (I do not find it in kernel-parameters?), to keep the console up during suspend/resume. You might want to increase the ACPI output first: echo 0x1f >/sys/module/acpi/parameters/debug_level or even (can be a lot output): echo 0x21f >/sys/module/acpi/parameters/debug_level
Then suspend and check the last lines it executes you see on serial console (with this boot param to not disable serial) when it freezes. If it points to ACPI, attach acpidump output of the system. Rafael/Pavel might want to correct/enhance my comments. BTW: Is this possibly related to a Docking station? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c11
--- Comment #11 from Radomír Černoch
Serial console? This should still have one?
Unfortunately not. A serial port can be accessed through the docking station, which I do not have. Do you think it would be possible to do this through USB-serial converter if the appropriate module was inserted into initrd?
If it points to ACPI, attach acpidump output of the system.
Doesn't the fact, that removal of 'battery' module makes the system work reliably, actually show that it is an ACPI problem? I would like to give you the info from the serial console, but currently I am lacking the equipment..
BTW: Is this possibly related to a Docking station?
The notebook is not attached to a docking station. Nevertheless I cannot rule out the possibility that some part of openSUSE responsible of docking is actually causing the trouble. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User trenn@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c12
Thomas Renninger
Do you think it would be possible to do this through USB-serial converter No, it could be done via firewire (firescope), but I doubt you have that HW.
I expect all we can do is what Rafael already suggested: - If that is the case, have you tried to unload the battery module before hibernation? - Second, as you said the box worked correctly with the kernel from openSUSE 10.3, do you have any experience with building and installing the kernel yourself For the latter, he probably thinks about git bisecting. You have to get the latest git kernel and use git bisect (passing the last kernel which you know worked). You then have build, install and boot the newly built kernel. Then tell git bisect whether it worked or not and it will choose the next set of patches worth testing until the offending patch is found. On a first round you might want try out with the latest vanilla kernels (.28-rcX -> maybe it's already fixed?, .27 -> maybe it's a SUSE patch?, .26 -> if it works it's a nice starting point for git bisect). For getting the latest Linux kernel git sources do: git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6 linux-2.6 Finding the offending patch can be cumbersome, maybe someone has a better idea, I do not have any. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c13
Radomír Černoch
No, it could be done via firewire (firescope), but I doubt you have that HW.
Right, I do not have this device. :-(
I expect all we can do is what Rafael already suggested: - If that is the case, have you tried to unload the battery module before hibernation?
As I have already written, unloading the "battery" module helps and the suspend/resume becomes reliable.
For the latter, he probably thinks about git bisecting.
Ok, I will try the bisecting. I already have some experience with building a custom vanilla kernel. But from the last time I tried it (~ 2.6.14), something must have changed in the system: When doing the sequence # make oldconfig # make # make install # make modules_install # mkinitrd the newly installed kernel can no longer find the root partition. Is there a turotial for building vanilla kernels on openSUSE? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User mhopf@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c14
Matthias Hopf
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c15
Radomír Černoch
I already have some experience with building a custom vanilla kernel. But from the last time I tried it (~ 2.6.14), something must have changed in the system: When doing the sequence # make oldconfig # make # make install # make modules_install # mkinitrd the newly installed kernel can no longer find the root partition. Is there a turotial for building vanilla kernels on openSUSE?
I do no longer need the help (just a few modules got renamed from vanilla to SUSE flavour). I am starting the bisection... 4000 revisions to go... I will let you know ASAP. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c16
--- Comment #16 from Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
Radomír Černoch
Should I post it to LKML as well That's Rafael to decide, he probably will be the one who will look at this in
https://bugzilla.novell.com/show_bug.cgi?id=450256
User trenn@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c17
Thomas Renninger
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c18
--- Comment #18 from Radomír Černoch
That's Rafael to decide, he probably will be the one who will look at this in the end anyway?
Ok, so far I am not posting anything anywhere. If you needed some more info or testing, just drop me an e-mail. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c19
Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c20
Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c21
--- Comment #21 from Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c22
--- Comment #22 from Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c23
--- Comment #23 from Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c24
--- Comment #24 from Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c25
--- Comment #25 from Rafael Wysocki
Created an attachment (id=258754) --> (https://bugzilla.novell.com/attachment.cgi?id=258754) [details] Output of 'dmidecode'
Thanks, I'm going to take care of this tomorrow.
By the way, do you have an idea, how many systems could be affected, by this? I was just wondering, whether it is worth adding any kind of GUI checkbox somewhere in YaST to activate the workaroung you sent me.
Well, you're the first person reporting it and so far the only one, so I guess not too many. ;-) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c26
--- Comment #26 from Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
Luc Verhaegen
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c27
Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c28
--- Comment #28 from Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c29
Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
User lmuelle@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c30
Lars Müller
https://bugzilla.novell.com/show_bug.cgi?id=450256
User lmuelle@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c31
--- Comment #31 from Lars Müller
https://bugzilla.novell.com/show_bug.cgi?id=450256
User lmuelle@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c32
Lars Müller
https://bugzilla.novell.com/show_bug.cgi?id=450256
User seife@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c33
--- Comment #33 from Stefan Seyfried
https://bugzilla.novell.com/show_bug.cgi?id=450256
User lmuelle@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c34
Lars Müller
https://bugzilla.novell.com/show_bug.cgi?id=450256
Lars Müller
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c35
Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c36
--- Comment #36 from Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c37
--- Comment #37 from Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c38
--- Comment #38 from Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c39
--- Comment #39 from Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c40
--- Comment #40 from Rafael Wysocki
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rodelrod@hotmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c41
Rodrigo Daunoravicius
https://bugzilla.novell.com/show_bug.cgi?id=450256
User radomir.cernoch@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c42
--- Comment #42 from Radomír Černoch
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rodelrod@hotmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c43
--- Comment #43 from Rodrigo Daunoravicius
https://bugzilla.novell.com/show_bug.cgi?id=450256
User rjw@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=450256#c44
--- Comment #44 from Rafael Wysocki
Without 'acpi_sleep=s4_nohwsig':
Resume from hibernate always fails if the AC power is unplugged at any point, even if no periferals are attached, wether or not there is a battery in the unit.
With 'acpi_sleep=s4_nohwsig':
Resume works even if AC power is unplugged but (this I found later) there is one case where resume from both suspend and hibernate fail: when I resume/hibernate while docked, it won't resume after undocked, wether or not there is a battery on the unit and wether or not there are any peripherals (one external monitor, network, AC power) attached to the UltraDock. The reverse case works fine, I can hibernate/suspend while undocked and resume once docked.
I think this is an upstream bug. To verify this you can test the kernel from ftp://ftp.suse.com/pub/projects/kernel/kotd/HEAD/ (ie. 2.6.29-rc7+). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com