[kernel-bugs] [Bug 1178474] New: Short freezes for up to tens of seconds
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 Bug ID: 1178474 Summary: Short freezes for up to tens of seconds Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: x86-64 OS: openSUSE Tumbleweed Status: NEW Severity: Major Priority: P5 - None Component: Kernel Assignee: kernel-bugs@opensuse.org Reporter: bengan@bag.org QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- Since I zypper duped to TW 20201030 on my laptop[1] I've got freezes for some seconds up to tens of seconds. Nothing conclusive in the logs and nothing weird in dmesg. I suspect the kernel as I rebooted with the old kernel, 5.8.12-1-default, and everything worked as before. A weird work around from the freezes is that if I (in plasma) hit alt-tab the system goes back to being responsive, but only until next short freeze. Still nothing conclusive in the logs. [1] https://linux-hardware.org/?probe=543c444fdf -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c1 --- Comment #1 from Thomas Zimmermann <tzimmermann@suse.com> --- I have this bug since TW 20201119. I have Gnome 3.28 plus Kernel 5.9
Linux linux-uq9g 5.9.8-2-default #1 SMP Thu Nov 12 07:43:32 UTC 2020 (ea93937) x86_64 x86_64 x86_64 GNU/Linux
-- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c2 --- Comment #2 from Thomas Zimmermann <tzimmermann@suse.com> --- I have a T450 with Intel chipset: 00:02.0 VGA compatible controller: Intel Corporation HD Graphics 5500 (rev 09) (prog-if 00 [VGA controller]) Subsystem: Lenovo Device 5036 Flags: bus master, fast devsel, latency 0, IRQ 47 Memory at e0000000 (64-bit, non-prefetchable) [size=16M] Memory at c0000000 (64-bit, prefetchable) [size=512M] I/O ports at 3000 [size=64] Expansion ROM at 000c0000 [virtual] [disabled] [size=128K] Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit- Capabilities: [d0] Power Management version 2 Capabilities: [a4] PCI Advanced Features Kernel driver in use: i915 Kernel modules: i915 I'll try to boot with the old kernel 5.8. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 Thomas Zimmermann <tzimmermann@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Summary|Short freezes for up to |Display freezes for up to |tens of seconds |tens of seconds with kernel | |5.9 -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c3 --- Comment #3 from Thomas Zimmermann <tzimmermann@suse.com> --- Quoting myself from the email to opensuse-factory
In regular intervals, gvim stops updating the screen and freezes. I have to flip to separate workspace, which un-freezes gvim. All my key strokes were processes, as I can see the characters after unfreezing. Also, sometimes the overview screen doesn't update the small workspace previews.
I suspect this is a bug in GNOME shell 3.38, which maybe doesn't compose the updated windows correctly.
I suspected a GNOME issue, although the bug report points in a different direction. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c6 --- Comment #6 from Bengt G�rd�n <bengan@bag.org> --- Some new findings. I saw an email about fedora 33 working with 5.9 and so tried a live version with Rawhide and 5.10.0. No lockup. One difference was Wayland. So I rebooted my Opensuse TW with kernel 5.9.8 and switched to Wayland in SDDM. No lockups for 1h last night and no lockups for 1h this morning. Anyone got some suggestions how to proceed with the fault isolation with kernel 5.9, i915 and X? -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c7 --- Comment #7 from Thomas Zimmermann <tzimmermann@suse.com> --- (In reply to Bengt G�rd�n from comment #6)
Some new findings. I saw an email about fedora 33 working with 5.9 and so tried a live version with Rawhide and 5.10.0. No lockup. One difference was Wayland. So I rebooted my Opensuse TW with kernel 5.9.8 and switched to Wayland in SDDM. No lockups for 1h last night and no lockups for 1h this morning.
Great find. So it's probably in how the Xorg driver interacts with the kernel.
Anyone got some suggestions how to proceed with the fault isolation with kernel 5.9, i915 and X?
Ideally, you could bisect the issue with the upstream kernel. I'm trying this myself, but it goes really slow as I can only reproduce the bug on my daily work machine. What I found so far it that the issue got introduced somewhere between v5.8 and v5.9. Although others reported problems with v5.8, I can't reproduce them. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c8 --- Comment #8 from Thomas Zimmermann <tzimmermann@suse.com> --- (In reply to Thomas Zimmermann from comment #7)
What I found so far it that the issue got introduced somewhere between v5.8 and v5.9. Although others reported problems with v5.8, I can't reproduce them.
I think I just saw one of these errors on v5.8. It's just a lot less often. I have to verify. :/ -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c9 --- Comment #9 from Bengt G�rd�n <bengan@bag.org> --- Yesterday I upgraded to TW 20201129 and there where still lockups. I googled around and found this link: https://linuxreviews.org/Linux_5.9_Is_Released_With_New_Drivers,_Improved_AMD_GPU_Support,_And_Support_The_x86-64_FSGSBASE_CPU_Instructions#Intel's_Also_In_The_GPU_Game_Now I started reading and there is ahci.mobile_lpm_policy=1 and intel_idle.max_cstate=1. I rebooted with those statements. Now I've been running 5.9.10 for 1.5h and no lockups. I'm not sure if both statements are needed. Will try later when everything seems stable enough. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c10 --- Comment #10 from Bengt G�rd�n <bengan@bag.org> --- (In reply to Bengt G�rd�n from comment #9)
Will try later when everything seems stable enough.
Unfortunately it isn't. Uptime is 4h51m and I've had one short lockup so far. It's much better than before but not gone. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c11 --- Comment #11 from Thomas Zimmermann <tzimmermann@suse.com> --- Found it in v5.7 -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c12 Martin Wilck <martin.wilck@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |martin.wilck@suse.com --- Comment #12 from Martin Wilck <martin.wilck@suse.com> --- (In reply to Bengt G�rd�n from comment #9)
I started reading and there is ahci.mobile_lpm_policy=1 and intel_idle.max_cstate=1.
What does this do to your battery life? -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c13 --- Comment #13 from Bengt G�rd�n <bengan@bag.org> --- (In reply to Martin Wilck from comment #12)
(In reply to Bengt G�rd�n from comment #9)
I started reading and there is ahci.mobile_lpm_policy=1 and intel_idle.max_cstate=1.
What does this do to your battery life?
Not sure right now but I will measure it after the weekend. Need to have this laptop going without a hiccup for a few days. But I suspect it's going to decrease the battery life. At least what I've read so far. I ran 5.9.10 over night and when I woke up this morning it was completely frozen. So the boot statements didn't help at all except it seems to have put off the inevitably for some time. Last night I ran 5.9.10 for about 6h with just one temporary lockup (around 10 sec). -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c14 Mark Draheim <rickscafe.casablanca@gmx.net> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |rickscafe.casablanca@gmx.ne | |t --- Comment #14 from Mark Draheim <rickscafe.casablanca@gmx.net> --- I wondered if I am the only one experiencing this. I have no hard lockups but I have keystrokes appearing after pauses and YT video freezing and unfreezing at regular intervals while audio keeps playing. The delayed showing of keys I typed is irritating without end. This started with kernel 5.9. Luckily, I pinned kernel 5.8.15, that shows none of these problems, and am now waiting for a fix. Laptop is a Lenovo Thinkbook 13s-IML -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c16 --- Comment #16 from Mark Draheim <rickscafe.casablanca@gmx.net> --- (In reply to Patrik Jakobsson from comment #15)
Can you try setting i915.enable_dc=0 and i915.enable_psr=0. Try them one at a time so we know which one (if any) helps.
for me, neither has any positive effect. Disabling dc made video playback stuttering a tiny tad less annoying but it came with visual artefacts like old window decorations blinking in. Disabling psr got me a hard lockup 30 seconds into the session. On a general note, I had screen lockups occasionally with kernels before 5.9 on this laptop i7-10th but not on an old Skylake laptop with integrated intel graphics. But the video stuttering definitely started with kernel 5.9. It shows in video playback freeze framing every few seconds. On the desktop I have window-fade-on-close which quite often yields the window stopping at half transparent and then closing as in the fading started, then screen does not update, then window is gone. It is probably the same with the typing keys lag, ie screen is simply not updating for a second, then the chars I typed appear at once. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c17 --- Comment #17 from Takashi Iwai <tiwai@suse.com> --- If the bug is related with the power managing of i915: there is the update of i915 firmware files in the last week, and it might be worth to try. The updated kernel-firmware-* packages are found in OBS home:tiwai:branches:Kernel:HEAD/kernel-firmware repo, http://download.opensuse.org/repositories/home:/tiwai:/branches:/Kernel:/HEA... Can anyone test this? -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c18 --- Comment #18 from Thomas Zimmermann <tzimmermann@suse.com> --- Hi (In reply to Mark Draheim from comment #14)
I wondered if I am the only one experiencing this. I have no hard lockups but I have keystrokes appearing after pauses and YT video freezing and unfreezing at regular intervals while audio keeps playing. The delayed showing of keys I typed is irritating without end.
This started with kernel 5.9. Luckily, I pinned kernel 5.8.15, that shows none of these problems, and am now waiting for a fix.
I did some testing of older kernels and was able to see this issue with kernels at least as old as 5.7. I seems to have increased in frequency with 5.9, though. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c19 --- Comment #19 from Takashi Iwai <tiwai@suse.com> --- Then we might be looking for multiple issues that appear as the similar behavior: the screen lockup. But one case is (supposedly happening often after the system resume) a complete screen freeze, while another one is a temporary freeze until the key stroke or some other action. But who knows... -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178474 http://bugzilla.opensuse.org/show_bug.cgi?id=1178474#c20 --- Comment #20 from Mark Draheim <rickscafe.casablanca@gmx.net> --- for lack of ideas, I tried the old disable-vsync. And it does make a difference for me. I had plasma compositor vsync set to auto. Now switched to never. YT video now plays without freezeframing every few seconds. Typing seems fine, too. What puzzles me is that the micro freezes are not present with kernel 5.8 but they do show with every 5.9 kernel when vsync is set to auto. Anyway, worksforme and I am happy for now. -- You are receiving this mail because: You are the assignee for the bug.
participants (1)
-
bugzilla_noreply@suse.com