[Bug 1029634] New: kernel panic after wakeup from STR / on switching terminals / displays with 4.4.49-16-default
http://bugzilla.suse.com/show_bug.cgi?id=1029634 Bug ID: 1029634 Summary: kernel panic after wakeup from STR / on switching terminals / displays with 4.4.49-16-default Classification: openSUSE Product: openSUSE Distribution Version: Leap 42.2 Hardware: Other OS: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel Assignee: kernel-maintainers@forge.provo.novell.com Reporter: okurz@suse.com QA Contact: qa-bugs@suse.de CC: anton.smorodskyi@suse.com, sebastian.chlad@novell.com, tiwai@suse.com, wvvelzen@gmail.com Depends on: 1018911 Found By: --- Blocker: --- Created attachment 717661 --> http://bugzilla.suse.com/attachment.cgi?id=717661&action=edit screenshot showing kernel panic stack trace +++ This bug was initially created as a clone of Bug #1018911 +++ ## observation On DELL Latitude E7250 after waking up the system from STR in the docking station after some seconds the system freezes and only shows a blinking caps lock LED, i.e. kernel panic. I assume it can also be reproduced by switching displays with xrandr and/or also when switching from vt7 with X to vt1 under these circumstances, i.e. after wakeup from standby. Once so far I managed to switch to text terminal and observe an OOM message and a stack trace (see screenshot). As the problem commonly was triggered by switching displays and the stack trace mentions drm I assume the problem is related to the graphics driver but because it takes some seconds for the system to crash it might also be a critical OOM happening. kdump is enabled, no crash dump was recorded in the filesystem so far. Reason might be I am running cryptlvm? ## reproducible For me the procedure which seems to reproduce it consistently is as follows * put system to STR in the docking station * wait for system to be in STR * close lid * remove from dock * put it back to dock * open lid (system starts up) * call `xrandr --outpxrandr --output eDP1 --primary --auto --output DP1-1 --off; xrandr --output eDP1 --primary --auto --output DP1-1 --auto --left-of eDP1` * wait some seconds * observe the problem (blinking caps lock led, system does not react anymore) ## expected results last good should be linux 4.4.36, at least I do not recall observing this problem in before the last kernel update, at least not that often. Expected: Obviously the kernel should not crash here. ## problem H1. regression in kernel update H1.1 4.4.27-2.1 -> 4.4.49-16 H1.2 4.4.36-5.1 -> 4.4.49-16 H1.3 4.4.36-8.1 -> 4.4.49-16 H1.4 4.4.46-11.1 -> 4.4.49-16 H2. regression in graphics driver H3. OOM since some package update ## further information The problem might be related to what Anton is observing since 4.4.36 in bug 1018911 BIOS: [ 0.000000] DMI: Dell Inc. Latitude E7250/0TVD2T, BIOS A07 09/01/2015 kernel command line: [ 0.000000] Kernel command line: BOOT_IMAGE=/vmlinuz-4.4.49-16-default root=/dev/mapper/system-root ro loader=syslinux quiet resume=/dev/system/swap splash=silent quiet showopts crashkernel=103M,high crashkernel=72M,low graphics controller: 00:02.0 VGA compatible controller: Intel Corporation Broadwell-U Integrated Graphics (rev 09) -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1029634
Wilfred van Velzen
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c1
--- Comment #1 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c2
--- Comment #2 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c3
Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c4
--- Comment #4 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c5
--- Comment #5 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c6
Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c7
--- Comment #7 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c8
--- Comment #8 from Benjamin Brunner
http://bugzilla.suse.com/show_bug.cgi?id=1029634
Benjamin Brunner
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c9
--- Comment #9 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c10
Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c11
Oliver Kurz
Could you try with kernel-vanilla 4.4.49 to check whether the issue is reproducible or not?
good idea. It is *not* reproducible with kernel-vanilla-4.4.49 -> SUSE patches.
If it's about our own patches, I can give a kernel with the recent i915 changes reverted later.
I would appreciate that. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c12
--- Comment #12 from Oliver Kurz
If it's about our own patches, I can give a kernel with the recent i915 changes reverted later.
I would appreciate that.
Would it be more efficient if I build the kernel on my own to bisect? But for that probably I would a little hint where to start. I guess building a package from OBS locally and systematically commenting/uncommenting in a spec file might work? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c13
--- Comment #13 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c14
--- Comment #14 from Oliver Kurz
I'm building a kernel with a few i915 patches reverted in IBS home:tiwai:test:bnc1029634 repo. Could you give it a try later?
tested, FAILED. same symptoms. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c15
--- Comment #15 from Takashi Iwai
4.4.36-8.1 can not reproduce the problem
REJECT H1.1 4.4.27-2.1 -> 4.4.49-16 REJECT H1.2 4.4.36-5.1 -> 4.4.49-16 SUPPORT H1.3, H1.4
4.4.46-11 can reproduce the problem
REJECT H1.3 4.4.36-8.1 -> 4.4.49-16 ACCEPT H1.4 4.4.46-11.1 -> 4.4.49-16
... but the subject states it's a regression from 4.4.46 to 4.4.49. Which is the correct information? If 4.4.46 works, the range is much narrower than 4.4.36. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c16
--- Comment #16 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c17
Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c18
Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c19
Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c20
--- Comment #20 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c21
--- Comment #21 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c22
--- Comment #22 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c23
--- Comment #23 from Oliver Kurz
I managed to bisect and found the culprit patch. Ironically, it's a fix for S3 resume with SKL MST (bsc#1015359).
funny ;-) linux-4.4.56-2.1.g0b839db-default PASSED -> no crash (crosschecked that the crash still appears with kernel-default-4.4.49) so I confirm your revert of the patch fixes the problem for me. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1029634
Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c24
--- Comment #24 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c25
--- Comment #25 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c26
Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c27
Lyude Paul
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c28
--- Comment #28 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c29
--- Comment #29 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c30
--- Comment #30 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c31
--- Comment #31 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c32
--- Comment #32 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c33
--- Comment #33 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c34
--- Comment #34 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c35
--- Comment #35 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c36
Takashi Iwai
However, unfortunately, this doesn't fix the crash with 4.4.x kernel with the S3 fix backport, by some reason. It works for 4.11-rc4. Still investigating...
It turned out that I overlooked the stray update module. The fix itself works with 4.4.x backport, too, but the second patch seems not triggering the intel_dp_check_mst_status(), so the first patch would be needed for 4.4.x. Oliver, could you check whether the kernel in IBS home:tiwai:test:bnc1029634-2 repo works? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1029634
Swamp Workflow Management
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c37
Oliver Kurz
Oliver, could you check whether the kernel in IBS home:tiwai:test:bnc1029634-2 repo works?
problem reproduced with 4.4.56-2.g0b839db-default -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c38
--- Comment #38 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c39
--- Comment #39 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c40
Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c41
--- Comment #41 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c42
Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c43
--- Comment #43 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c44
--- Comment #44 from Takashi Iwai
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c45
--- Comment #45 from Bernhard Wiedemann
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c46
--- Comment #46 from Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c47
Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c48
--- Comment #48 from Swamp Workflow Management
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c49
--- Comment #49 from Swamp Workflow Management
http://bugzilla.suse.com/show_bug.cgi?id=1029634
Swamp Workflow Management
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c50
Oliver Kurz
http://bugzilla.suse.com/show_bug.cgi?id=1029634
Swamp Workflow Management
http://bugzilla.suse.com/show_bug.cgi?id=1029634
http://bugzilla.suse.com/show_bug.cgi?id=1029634#c51
--- Comment #51 from Swamp Workflow Management
http://bugzilla.suse.com/show_bug.cgi?id=1029634
Swamp Workflow Management
http://bugzilla.suse.com/show_bug.cgi?id=1029634
Oliver Kurz
participants (1)
-
bugzilla_noreply@novell.com