[Bug 1188745] New: kexec reboot fails since 5.13.4
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745 Bug ID: 1188745 Summary: kexec reboot fails since 5.13.4 Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: Other OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel Assignee: kernel-bugs@opensuse.org Reporter: mrueckert@suse.com QA Contact: qa-bugs@suse.de CC: sndirsch@suse.com Found By: --- Blocker: --- Jul 26 01:47:44 fortress kernel: iommu ivhd0: AMD-Vi: Event logged [INVALID_DEVICE_REQUEST device=0a:00.0 pasid=0x00000 address=0xfffffffdf8000000 flags=0x0a00] Jul 26 01:47:48 fortress kernel: NVRM: GPU 0000:0a:00.0: RmInitAdapter failed! (0x23:0x65:1204) Jul 26 01:47:48 fortress kernel: NVRM: GPU 0000:0a:00.0: rm_init_adapter failed, device minor number 0 Jul 26 01:47:48 fortress kernel: [drm:nv_drm_load [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000a00] Failed to allocate NvKmsKapiDevice Jul 26 01:47:48 fortress kernel: [drm:nv_drm_probe_devices [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000a00] Failed to register device The same worked fine with the same version of the nvidia driver (470.57.02) on 5.13.2. I can provide full boot log if needed. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c1
--- Comment #1 from Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c2
--- Comment #2 from Marcus R�ckert
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c3
--- Comment #3 from Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c4
Marcus R�ckert
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c5
--- Comment #5 from Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c6
Marcus R�ckert
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c7
Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
Stefan Dirsch
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c10
--- Comment #10 from Joerg Roedel
Jul 26 01:47:44 fortress kernel: iommu ivhd0: AMD-Vi: Event logged [INVALID_DEVICE_REQUEST device=0a:00.0 pasid=0x00000 address=0xfffffffdf8000000 flags=0x0a00]
This is an interrupt translation request while IRQ remapping is disabled for the device. Do the command line parameters differ between the first and the kexec kernel? There are no AMD IOMMU driver changes between 5.13.2 and 5.13.4, only ARM-SMMU and Intel VT-d fixes. So nothing changed on the AMD driver side.
I can provide full boot log if needed.
What hardware does this happen on? A full boot log might also help. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c12
--- Comment #12 from Marcus R�ckert
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c13
--- Comment #13 from Joerg Roedel
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c14
--- Comment #14 from Marcus R�ckert
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c15
--- Comment #15 from Joerg Roedel
bisect would mean rebuilding the kernel over and over and rebooting the machine for each kernel?
Yes, there are 610 commits between 5.13.2 and 5.13.4, so ~9-10 rounds of compiling/testing. I don't see another way to find the offending commit, sorry. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c16
--- Comment #16 from Marcus R�ckert
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c17
--- Comment #17 from Marcus R�ckert
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c18
--- Comment #18 from Joerg Roedel
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745
http://bugzilla.opensuse.org/show_bug.cgi?id=1188745#c19
--- Comment #19 from Stefan Dirsch
participants (1)
-
bugzilla_noreply@suse.com