[Bug 1028575] New: System randomly freezes or crashes to the login screen, glitches until rebooted
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575 Bug ID: 1028575 Summary: System randomly freezes or crashes to the login screen, glitches until rebooted Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: x86-64 OS: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel Assignee: kernel-maintainers@forge.provo.novell.com Reporter: sonichedgehog_hyperblast00@yahoo.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- Approximately once every 1 to 3 days of uptime, the system experiences a sudden and inexplicable crash: The image completely freezes in place, although unlike similar crashes in the past I can keep moving the mouse pointer around. A few seconds afterward, I find myself in a black console... and a few seconds after that, I'm back at the login screen. If I attempt to log back in however, the image either freezes again or desktop effects are no longer working without any error message as to why. Not even forcefully restarting X11 (control + alt + backspace twice) fixes the remaining glitches, and the only way to truly recover the system is to also reboot. Although the crashes are completely random, I vaguely get the impression they might be happening when an event triggers certain desktop effects. Several times the crash occurred as a system tray notification popped up, whereas just now the system crashed while I was switching desktops in the middle of the desktop cube animation. Certain games might also have a probability of causing this. I use the free video drivers and default system packages, all latest versions of openSUSE Tumbleweed. My card is a Radeon R7 370, GCN 1.0 on RadeonSI. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c1
Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c2
--- Comment #2 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c3
--- Comment #3 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c4
--- Comment #4 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c5
--- Comment #5 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c6
--- Comment #6 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c7
--- Comment #7 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c8
--- Comment #8 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c9
--- Comment #9 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c10
--- Comment #10 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c11
--- Comment #11 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c13
Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c14
Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c15
--- Comment #15 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c16
Ismail Donmez
I feel that at this point, I should express my disappointment regarding the lack of attention this report has received over the course of a month. I marked it as high priority (which I believe it is), posted about this on the forum, and wrote about it to the Factory mailing list (where it was completely ignored). To this day, I still have no idea what this is or when and how it might be fixed.
Sorry for my silence but I am already working on upgrading llvm to 4.0 which I hope will fix this issue, if not then we can ask for a revert on the llvm mailing list for the problematic commit. Randomly reverting commits on llvm can have bad consequences. Sadly the Open Build Service seems to be slow these days and it takes a lot of back and forth to see if my commits work or not. You can see the progress in https://build.opensuse.org/package/show/devel:tools:compiler/llvm4 Thanks, ismail -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c17
--- Comment #17 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c18
kevin Zhu
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c19
--- Comment #19 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c20
--- Comment #20 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c21
--- Comment #21 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c22
--- Comment #22 from Ismail Donmez
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c23
--- Comment #23 from kevin Zhu
llvm4 is now accepted into Factory and should be in the next update.
Thanks Ismail. As an enthusiastic non-developer Linux user, I wonder whether there is any way to apply the patch myself without waiting for a build? Is there any guide, or just roughly the direction, that you may kindly show me? :) Many thanks, Kevin -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c24
--- Comment #24 from Ismail Donmez
(In reply to Ismail Donmez from comment #22)
llvm4 is now accepted into Factory and should be in the next update.
Thanks Ismail. As an enthusiastic non-developer Linux user, I wonder whether there is any way to apply the patch myself without waiting for a build? Is there any guide, or just roughly the direction, that you may kindly show me? :)
Since more than one package is involved here (llvm, Mesa, ...) I would really want you to expect the next update. It should happen this week, if not I'll create a test repo for you people. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c25
--- Comment #25 from Mircea Kitsune
llvm4 is now accepted into Factory and should be in the next update.
Thank you very much, that is great news! I'll monitor its progress at http://tumbleweed.boombatower.com and immediately update once it hits Tumbleweed. Let's just hope it truly makes the lockup go away... it's been happening for so long now, I'll only draw a verdict after I see it gone for at least a month. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c26
--- Comment #26 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c27
--- Comment #27 from Ismail Donmez
llvm 4.0.0 is now in openSUSE Tumbleweed: I have preformed a 'zypper dup', installed it, and restarted. Now it's time to see if this really makes the freeze go away.
Sadly Mesa was built with llvm 3.9 still, I asked the Factory maintainers to recompile all llvm dependencies and that should hopefully will be available soon. Thank you! -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c28
--- Comment #28 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c29
--- Comment #29 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c30
--- Comment #30 from Ismail Donmez
A 'zypper dup' earlier today upgraded my machine to Kernel 4.10.10 and reinstalled Mesa 17.0.3. I understand this means that llvm 4.0.0 should be in effect on both of these components, and if llvm was at fault the issue should now disappear. I've had more freezes during the last few days, and if somehow they continue even after this we'll seriously need to dig further. I will immediately let everyone know if I see another freeze from this point on.
Yes, please. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c31
--- Comment #31 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c32
--- Comment #32 from kevin Zhu
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c33
--- Comment #33 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c34
Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c35
--- Comment #35 from Ismail Donmez
Once again, I'm dealing with at least one system crash per day. The latest one happens even after upgrading to the 4.11.0 Kernel, meaning the error was ported to it as well.
I understand your frustration but basically logs show that you are getting random GPU lockups, searching Google for "radeon r7 370 lockup" shows a lot of results even from Windows. So at this point I believe this might be a hardware problem. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c36
Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c37
--- Comment #37 from Mircea Kitsune
I understand your frustration but basically logs show that you are getting random GPU lockups, searching Google for "radeon r7 370 lockup" shows a lot of results even from Windows. So at this point I believe this might be a hardware problem.
I considered a hardware problem, but that seems unlikely for many reasons: The video card is still very new (about an year now) and a solid Gigabyte model... I've had another Radeon / Gigabyte card before it, which lasted me for nearly a decade. This issue is also triggered only by the KDE desktop, when certain desktop effects play or I select some windows (even with effects turned off)... I play high-end games on my machine, yet even they don't trigger this exact crash! The behavior of the crash also closely indicates a software problem... hardware issues usually cause a total freeze (Kernel panic), whereas this only freezes the desktop (without mouse pointer) then causes the monitor to shut down. (In reply to Takashi Iwai from comment #36)
If it's a GPU lockup, and it's in the fairly recent upstream code, could you rather raise the report to upstream (e.g. bugzilla.freedesktop.org)? There you may get more attention.
Also, please don't touch the priority field of bugzilla. It's not the field the reporter may modify. You can adjust the severity, but not the priority. Thanks.
I have a report there as well, mirroring this one: https://bugs.freedesktop.org/show_bug.cgi?id=100306 Since only windows in KDE seem to trigger this specific crash, I'll probably make a third mirror on KDE's bug tracker as well... need to consider if it's related closely enough to KDE for that. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c38
--- Comment #38 from Ismail Donmez
(In reply to Ismail Donmez from comment #35)
I understand your frustration but basically logs show that you are getting random GPU lockups, searching Google for "radeon r7 370 lockup" shows a lot of results even from Windows. So at this point I believe this might be a hardware problem.
I considered a hardware problem, but that seems unlikely for many reasons: The video card is still very new (about an year now) and a solid Gigabyte model... I've had another Radeon / Gigabyte card before it, which lasted me for nearly a decade. This issue is also triggered only by the KDE desktop, when certain desktop effects play or I select some windows (even with effects turned off)... I play high-end games on my machine, yet even they don't trigger this exact crash! The behavior of the crash also closely indicates a software problem... hardware issues usually cause a total freeze (Kernel panic), whereas this only freezes the desktop (without mouse pointer) then causes the monitor to shut down.
I can't see if in this bug report but did you try to set kwin to use XRender instead of OpenGL? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c39
--- Comment #39 from Mircea Kitsune
I can't see if in this bug report but did you try to set kwin to use XRender instead of OpenGL?
If you're referring to desktop compositing, I tried disabling it altogether. This might make the problem slightly rarer, but it is not the root of the cause: The freezes can still occur even without effects enabled at all. The trigger always seems to be selecting another window or popping up a new panel: If I put the mouse pointer near a panel to make it appear, or alt-tab switch to another window, or switch to a different desktop... that's when there's a chance that it happens. It never happens when I do something in any application, only when I do something that changes the desktop. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
Tomáš Chvátal
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
Tomáš Chvátal
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c40
--- Comment #40 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c41
--- Comment #41 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c42
--- Comment #42 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c43
--- Comment #43 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c44
--- Comment #44 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c45
--- Comment #45 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c46
--- Comment #46 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c47
--- Comment #47 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c48
--- Comment #48 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c49
--- Comment #49 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c50
--- Comment #50 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c51
--- Comment #51 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c52
Michal Srb
I feel that at this point, I should express my disappointment regarding the lack of attention this report has received over the course of a month. I marked it as high priority (which I believe it is), posted about this on the forum, and wrote about it to the Factory mailing list (where it was completely ignored). To this day, I still have no idea what this is or when and how it might be fixed.
You are running Tumbleweed - bleeding edge. And this issue is likely hardware specific - if we don't have your hardware, all we can do is guess what may be wrong. (In reply to Mircea Kitsune from comment #46)
Mesa: User error: GL_INVALID_OPERATION in glTexSubImage2D(invalid texture image)
Some application is using OpenGL badly. It shouldn't cause issue like this. (In reply to Mircea Kitsune from comment #48)
I find it remarkable how the cause of the crash appears to have immediately changed after me making the comment above yesterday; I tested my theory that desktop effects are the root for 2 months, yet the moment I publish my observations the behavior changes in less than a day. This further makes me concerned that someone might be deliberately programming this crash using vulnerabilities in system components, solely for how strange this coincidence is.
Nah, locking issues are typically random and rare. That makes them hard to reproduce and debug. Nobody is hacking you. (In reply to Mircea Kitsune from comment #49)
Created attachment 735281 [details] Memtest86 screenshot
Your system memory is ok, that's good, but says nothing about GPU. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c53
--- Comment #53 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c54
--- Comment #54 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c55
--- Comment #55 from Michal Srb
How do I get the full log? "dmesg -w" prints the data in the console, which has a character limit by default. If it's simply the output of "dmesg" after rebooting and logging back in, I already attached that here... however it is pretty old, so next time it happens I will make a new one.
You can redirect the output to a file, then get the file after reboot: dmesg -w > dmesg.txt Alternatively use journalctl to retrieve some of the older logs: journalctl --list-boots journalctl --boot=<ID> --dmesg > dmesg.txt -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c56
Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c57
--- Comment #57 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c58
--- Comment #58 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c59
--- Comment #59 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c60
--- Comment #60 from Mircea Kitsune
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575
http://bugzilla.opensuse.org/show_bug.cgi?id=1028575#c61
Michal Srb
participants (1)
-
bugzilla_noreply@novell.com