[Bug 1173248] New: Kernel 5.7.2 - nvidia Installer hangs with high system load
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248 Bug ID: 1173248 Summary: Kernel 5.7.2 - nvidia Installer hangs with high system load Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: Other OS: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel Assignee: kernel-bugs@opensuse.org Reporter: axel.braun@gmx.de QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- Created attachment 839025 --> http://bugzilla.opensuse.org/attachment.cgi?id=839025&action=edit Installation log Installation of current TW image causes issue if Nvidia-driver is installed: At installation of nvidia-glG05-440.82-30.1.x86_64 , system hangs under high system load. Previous compilation of driver fails, see attachment -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c1
--- Comment #1 from Axel Braun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c5
Axel Braun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c6
--- Comment #6 from Axel Braun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c7
Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c8
Axel Braun
That's still NVIDIA 440.82. Please try with NVIDIA 440.100 (repos have been updated yesterday).
Please scroll down in the attachment, nvidia 440.100 is loaded, but too late: kernel--default-devel-5.7.5-1.2.x86_64 is installed in step 82, and compiles against the old nvidia driver 176/182) Installieren: nvidia-gfxG05-kmp-default-440.100_k5.7.2_1-26.1.x86_64 comes much later around, delivering the new version, and compiles the new modules. The issue that it hangs at step 178 (178/182) Installieren: nvidia-glG05-440.100-26.1.x86_64 is probably the fact that it cant unload the nvidia modules completely. What brings me to this conclusion? I did the update again today, and before that I switched to the intel graphics: X1E:/home/docb # prime-select intel X1E:/home/docb # glxinfo | grep 'OpenGL renderer string' OpenGL renderer string: Mesa DRI Intel(R) UHD Graphics 630 (CFL GT2) When doing so, the message Cant unload nvidia.drm (or similar) scrolled through the terminal (After switching graphics you need to log off and on again to get into Intel) My guess is that this causes the system to hang. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c9
--- Comment #9 from Stefan Dirsch
(178/182) Installieren: nvidia-glG05-440.100-26.1.x86_64
Can't find this in the attached logfile. Indeed seems installation of 440.100 worked fine. Maybe you should check if none of the NVIDIA packages is installed twice in different versions. tumbleweed/x86_64/x11-video-nvidiaG05-440.100-26.1.x86_64.rpm tumbleweed/x86_64/nvidia-glG05-440.100-26.1.x86_64.rpm tumbleweed/x86_64/nvidia-computeG05-440.100-26.1.x86_64.rpm tumbleweed/x86_64/nvidia-gfxG05-kmp-default-440.100_k5.7.2_1-26.1.x86_64.rpm These should be installed. Mabye you need to uninstall a mess of nvidia packages and reinstall them proper again. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c10
--- Comment #10 from Axel Braun
(178/182) Installieren: nvidia-glG05-440.100-26.1.x86_64
Can't find this in the attached logfile. Indeed seems installation of 440.100 worked fine. Maybe you should check if none of the NVIDIA packages is installed twice in different versions.
tumbleweed/x86_64/x11-video-nvidiaG05-440.100-26.1.x86_64.rpm tumbleweed/x86_64/nvidia-glG05-440.100-26.1.x86_64.rpm tumbleweed/x86_64/nvidia-computeG05-440.100-26.1.x86_64.rpm tumbleweed/x86_64/nvidia-gfxG05-kmp-default-440.100_k5.7.2_1-26.1.x86_64.rpm
X1E:/home/docb # rpm -qa | grep nvidia nvidia-glG05-440.100-26.1.x86_64 nvidia-gfxG05-kmp-default-440.100_k5.7.2_1-26.1.x86_64 x11-video-nvidiaG05-440.100-26.1.x86_64 nvidia-computeG05-440.100-26.1.x86_64 kernel-firmware-nvidia-20200610-1.1.noarch
These should be installed. Mabye you need to uninstall a mess of nvidia packages and reinstall them proper again.
Hm, that should not be the idea behind zypper dup ;-) BTW, the message when switching to intel driver is: modprobe: FATAL: Module nvidia_drm is in use. Best guess is that this module causes the issue -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c11
Stefan Dirsch
BTW, the message when switching to intel driver is:
What do you mean with switching to intel? Do you have an Optimus system with Intel/NVIDIA combo and are trying to use suse-prime?
modprobe: FATAL: Module nvidia_drm is in use. Best guess is that this module causes the issue
-- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c12
Axel Braun
What do you mean with switching to intel? Do you have an Optimus system with Intel/NVIDIA combo and are trying to use suse-prime?
Correct. Using suse-prime-bbswitch -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c13
--- Comment #13 from Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c14
Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c15
Axel Braun
I'm not sure what you're trying to achieve here. NVIDIA mode or Intel mode or Intel mode with NVIDIA GPU completely off to save more power?
The upgrade (zypper dup) should work independent which GPU is activated. I wonder how people deal with the issue that have only a Nvidia card (changing it to ATI/AMD is not the answer here ;-) as they cant deactivate nvidia driver. Or maybe they do it in init 3. So, not sure if zypper people should look into this , or how we can find out why zypper hangs. I'm happy to have some more broken upgrades if it helps.... -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c16
--- Comment #16 from Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c17
Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248
http://bugzilla.opensuse.org/show_bug.cgi?id=1173248#c18
--- Comment #18 from Axel Braun
I'm afraid you need to report this to the zypper guys with zypper log, etc. They will tell you. I suggest to open a new bug once you can reproduce the issue with "zypper dup", because you won't find any longer the appropriated zypper logs meanwhile (overwritten by subsequent zypper runs). :-(
OK, will do. Thanks for your help! -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@suse.com