[Bug 1201205] Kernel-default 5.18.9 hangs on boot
https://bugzilla.suse.com/show_bug.cgi?id=1201205 https://bugzilla.suse.com/show_bug.cgi?id=1201205#c18 Thomas Zimmermann <tzimmermann@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Flags|needinfo? | --- Comment #18 from Thomas Zimmermann <tzimmermann@suse.com> --- Hi (In reply to Dronskowski from comment #13)
Without attaching the full journal, on failing boots I find this kernel bug:
Jul 05 18:36:06 localhost kernel: [drm] amdgpu kernel modesetting enabled. Jul 05 18:36:06 localhost kernel: amdgpu: Ignoring ACPI CRAT on non-APU system Jul 05 18:36:06 localhost kernel: amdgpu: Virtual CRAT table created for CPU Jul 05 18:36:06 localhost kernel: amdgpu: Topology: Add CPU node Jul 05 18:36:06 localhost kernel: Console: switching to colour dummy device 80x25 Jul 05 18:36:06 localhost systemd-udevd[363]: Worker [364] terminated by signal 9 (KILL) Jul 05 18:36:06 localhost systemd-udevd[363]: 0000:01:00.0: Worker [364] failed Jul 05 18:36:06 localhost kernel: BUG: kernel NULL pointer dereference, address: 0000000000000008 Jul 05 18:36:06 localhost kernel: #PF: supervisor read access in kernel mode Jul 05 18:36:06 localhost kernel: #PF: error_code(0x0000) - not-present page Jul 05 18:36:06 localhost kernel: PGD 0 P4D 0 Jul 05 18:36:06 localhost kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI Jul 05 18:36:06 localhost kernel: CPU: 3 PID: 364 Comm: systemd-udevd Tainted: G OE 5.18.9-6.ge00841d-default #1 openSUSE Tumbleweed (unreleased) 4bea8a49b83056be532b84db201d290b7bce70fb Jul 05 18:36:06 localhost kernel: Hardware name: System manufacturer System Product Name/PRIME X370-PRO, BIOS 6042 04/28/2022 Jul 05 18:36:06 localhost kernel: RIP: 0010:kernfs_find_and_get_ns+0x11/0x70 Jul 05 18:36:06 localhost kernel: Code: 08 48 83 40 40 01 49 8b 46 08 48 83 40 58 01 31 c0 eb d1 66 0f 1f 44 00 00 0f 1f 44 00 00 41 55 49 89 d5 41 54 49 89 f4 55 53 <48> 8b 47 08 48 89 fb 48 85 c0 48 0f 44 c7 48 8b 68 50 48 83 c5 60 Jul 05 18:36:06 localhost kernel: RSP: 0018:ffffa5a802083a38 EFLAGS: 00010246 Jul 05 18:36:06 localhost kernel: RAX: 0000000000000000 RBX: ffffffffb119fc20 RCX: ffffa5a802083a10 Jul 05 18:36:06 localhost kernel: RDX: 0000000000000000 RSI: ffffffffb119fd68 RDI: 0000000000000000 Jul 05 18:36:06 localhost kernel: RBP: 0000000000000000 R08: 0000000000000040 R09: 00000000fcf00000 Jul 05 18:36:06 localhost kernel: R10: 0000000000000000 R11: ffff88c0c77e829c R12: ffffffffb119fd68 Jul 05 18:36:06 localhost kernel: R13: 0000000000000000 R14: ffff88c0d0c8d7c0 R15: 0000000000000000 Jul 05 18:36:06 localhost kernel: FS: 00007f1114895b00(0000) GS:ffff88c7ceac0000(0000) knlGS:0000000000000000 Jul 05 18:36:06 localhost kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 05 18:36:06 localhost kernel: CR2: 0000000000000008 CR3: 0000000107a46000 CR4: 00000000003506e0 Jul 05 18:36:06 localhost kernel: Call Trace: Jul 05 18:36:06 localhost kernel: <TASK> Jul 05 18:36:06 localhost kernel: sysfs_unmerge_group+0x18/0x60 Jul 05 18:36:06 localhost kernel: dpm_sysfs_remove+0x20/0x60 Jul 05 18:36:06 localhost kernel: device_del+0xb2/0x3f0 Jul 05 18:36:06 localhost kernel: platform_device_del.part.0+0x13/0x70 Jul 05 18:36:06 localhost kernel: platform_device_unregister+0x1c/0x30 Jul 05 18:36:06 localhost kernel: sysfb_disable+0x2b/0x60 Jul 05 18:36:06 localhost kernel: remove_conflicting_framebuffers+0x1b/0xc0 Jul 05 18:36:06 localhost kernel: remove_conflicting_pci_framebuffers+0xce/0x120 Jul 05 18:36:06 localhost kernel: drm_aperture_remove_conflicting_pci_framebuffers+0x57/0x80 Jul 05 18:36:06 localhost kernel: amdgpu_pci_probe+0x126/0x3c0 [amdgpu ab2a35e28bca10ea2bed443b5ef9d0bdfa6ec825] Jul 05 18:36:06 localhost kernel: local_pci_probe+0x41/0x80 Jul 05 18:36:06 localhost kernel: pci_device_probe+0xc3/0x220 Jul 05 18:36:06 localhost kernel: really_probe+0x1a1/0x370 Jul 05 18:36:06 localhost kernel: __driver_probe_device+0xfc/0x170 Jul 05 18:36:06 localhost kernel: driver_probe_device+0x1f/0x90 Jul 05 18:36:06 localhost kernel: __driver_attach+0xbb/0x190 Jul 05 18:36:06 localhost kernel: ? __device_attach_driver+0xe0/0xe0 Jul 05 18:36:06 localhost kernel: bus_for_each_dev+0x72/0xb0 Jul 05 18:36:06 localhost kernel: bus_add_driver+0x159/0x200 Jul 05 18:36:06 localhost kernel: driver_register+0x89/0xd0 Jul 05 18:36:06 localhost kernel: ? 0xffffffffc1113000 Jul 05 18:36:06 localhost kernel: do_one_initcall+0x44/0x200 Jul 05 18:36:06 localhost kernel: ? kmem_cache_alloc_trace+0x177/0x350 Jul 05 18:36:06 localhost kernel: do_init_module+0x4a/0x250 Jul 05 18:36:06 localhost kernel: __do_sys_init_module+0x138/0x1b0 Jul 05 18:36:06 localhost kernel: do_syscall_64+0x5b/0x80 Jul 05 18:36:06 localhost kernel: ? __vm_munmap+0x90/0x110 Jul 05 18:36:06 localhost kernel: ? syscall_exit_to_user_mode+0x17/0x40 Jul 05 18:36:06 localhost kernel: ? do_syscall_64+0x67/0x80 Jul 05 18:36:06 localhost kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae Jul 05 18:36:06 localhost kernel: RIP: 0033:0x7f11153a202e Jul 05 18:36:06 localhost kernel: Code: 48 8b 0d fd 9d 0e 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d ca 9d 0e 00 f7 d8 64 89 01 48 Jul 05 18:36:06 localhost kernel: RSP: 002b:00007ffd92dad4a8 EFLAGS: 00000246 ORIG_RAX: 00000000000000af Jul 05 18:36:06 localhost kernel: RAX: ffffffffffffffda RBX: 0000562b842b8ef0 RCX: 00007f11153a202e Jul 05 18:36:06 localhost kernel: RDX: 0000562b842b9370 RSI: 00000000010cd35f RDI: 00007f11128c7010 Jul 05 18:36:06 localhost kernel: RBP: 0000562b842b9370 R08: 0000000000261000 R09: 85ebca77c2b2ae63 Jul 05 18:36:06 localhost kernel: R10: 00000000000331a1 R11: 0000000000000246 R12: 0000000000020000 Jul 05 18:36:06 localhost kernel: R13: 0000000000000000 R14: 0000562b842fa050 R15: 0000562b842b9370 Jul 05 18:36:06 localhost kernel: </TASK> Jul 05 18:36:06 localhost kernel: Modules linked in: amdgpu(+) crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd sp5100_tco xhci_pci xhci_pci_renesas ccp nvme xhci_hcd nvme_core drm_ttm_helper ttm usbcore iommu_v2 gpu_sched drm_dp_helper video wmi btrfs blake2b_generic libcrc32c crc32c_intel xor raid6_pq l2tp_ppp l2tp_netlink l2tp_core ip6_udp_tunnel udp_tunnel pppox ppp_generic slhc v4l2loopback(OE) videodev mc sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua msr efivarfs Jul 05 18:36:06 localhost kernel: CR2: 0000000000000008 Jul 05 18:36:06 localhost kernel: ---[ end trace 0000000000000000 ]--- Jul 05 18:36:06 localhost kernel: RIP: 0010:kernfs_find_and_get_ns+0x11/0x70 Jul 05 18:36:06 localhost kernel: Code: 08 48 83 40 40 01 49 8b 46 08 48 83 40 58 01 31 c0 eb d1 66 0f 1f 44 00 00 0f 1f 44 00 00 41 55 49 89 d5 41 54 49 89 f4 55 53 <48> 8b 47 08 48 89 fb 48 85 c0 48 0f 44 c7 48 8b 68 50 48 83 c5 60 Jul 05 18:36:06 localhost kernel: RSP: 0018:ffffa5a802083a38 EFLAGS: 00010246 Jul 05 18:36:06 localhost kernel: RAX: 0000000000000000 RBX: ffffffffb119fc20 RCX: ffffa5a802083a10 Jul 05 18:36:06 localhost kernel: RDX: 0000000000000000 RSI: ffffffffb119fd68 RDI: 0000000000000000 Jul 05 18:36:06 localhost kernel: RBP: 0000000000000000 R08: 0000000000000040 R09: 00000000fcf00000 Jul 05 18:36:06 localhost kernel: R10: 0000000000000000 R11: ffff88c0c77e829c R12: ffffffffb119fd68 Jul 05 18:36:06 localhost kernel: R13: 0000000000000000 R14: ffff88c0d0c8d7c0 R15: 0000000000000000 Jul 05 18:36:06 localhost kernel: FS: 00007f1114895b00(0000) GS:ffff88c7ceac0000(0000) knlGS:0000000000000000 Jul 05 18:36:06 localhost kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 05 18:36:06 localhost kernel: CR2: 0000000000000008 CR3: 0000000107a46000 CR4: 00000000003506e0
Thank you so much for this stacktrace. This looks like a problem that we recently fixed in the upstream kernel. Yesterday, I backported the patch into out stable branch. So this problem should be fixed in the next kernel update. Sorry for all the inconvenience with the graphics stack in recent weeks. We try to modernize it, but it's not a trivial issue. -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@suse.com