Thomas Zimmermann changed bug 1201205
What Removed Added
Flags needinfo?  

Comment # 18 on bug 1201205 from
Hi

(In reply to Dronskowski from comment #13)
> Without attaching the full journal, on failing boots I find this kernel bug:
> 
> > Jul 05 18:36:06 localhost kernel: [drm] amdgpu kernel modesetting enabled.
> > Jul 05 18:36:06 localhost kernel: amdgpu: Ignoring ACPI CRAT on non-APU system
> > Jul 05 18:36:06 localhost kernel: amdgpu: Virtual CRAT table created for CPU
> > Jul 05 18:36:06 localhost kernel: amdgpu: Topology: Add CPU node
> > Jul 05 18:36:06 localhost kernel: Console: switching to colour dummy device 80x25
> > Jul 05 18:36:06 localhost systemd-udevd[363]: Worker [364] terminated by signal 9 (KILL)
> > Jul 05 18:36:06 localhost systemd-udevd[363]: 0000:01:00.0: Worker [364] failed
> > Jul 05 18:36:06 localhost kernel: BUG: kernel NULL pointer dereference, address: 0000000000000008
> > Jul 05 18:36:06 localhost kernel: #PF: supervisor read access in kernel mode
> > Jul 05 18:36:06 localhost kernel: #PF: error_code(0x0000) - not-present page
> > Jul 05 18:36:06 localhost kernel: PGD 0 P4D 0 
> > Jul 05 18:36:06 localhost kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
> > Jul 05 18:36:06 localhost kernel: CPU: 3 PID: 364 Comm: systemd-udevd Tainted: G           OE     5.18.9-6.ge00841d-default #1 openSUSE Tumbleweed (unreleased) 4bea8a49b83056be532b84db201d290b7bce70fb
> > Jul 05 18:36:06 localhost kernel: Hardware name: System manufacturer System Product Name/PRIME X370-PRO, BIOS 6042 04/28/2022
> > Jul 05 18:36:06 localhost kernel: RIP: 0010:kernfs_find_and_get_ns+0x11/0x70
> > Jul 05 18:36:06 localhost kernel: Code: 08 48 83 40 40 01 49 8b 46 08 48 83 40 58 01 31 c0 eb d1 66 0f 1f 44 00 00 0f 1f 44 00 00 41 55 49 89 d5 41 54 49 89 f4 55 53 <48> 8b 47 08 48 89 fb 48 85 c0 48 0f 44 c7 48 8b 68 50 48 83 c5 60
> > Jul 05 18:36:06 localhost kernel: RSP: 0018:ffffa5a802083a38 EFLAGS: 00010246
> > Jul 05 18:36:06 localhost kernel: RAX: 0000000000000000 RBX: ffffffffb119fc20 RCX: ffffa5a802083a10
> > Jul 05 18:36:06 localhost kernel: RDX: 0000000000000000 RSI: ffffffffb119fd68 RDI: 0000000000000000
> > Jul 05 18:36:06 localhost kernel: RBP: 0000000000000000 R08: 0000000000000040 R09: 00000000fcf00000
> > Jul 05 18:36:06 localhost kernel: R10: 0000000000000000 R11: ffff88c0c77e829c R12: ffffffffb119fd68
> > Jul 05 18:36:06 localhost kernel: R13: 0000000000000000 R14: ffff88c0d0c8d7c0 R15: 0000000000000000
> > Jul 05 18:36:06 localhost kernel: FS:  00007f1114895b00(0000) GS:ffff88c7ceac0000(0000) knlGS:0000000000000000
> > Jul 05 18:36:06 localhost kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > Jul 05 18:36:06 localhost kernel: CR2: 0000000000000008 CR3: 0000000107a46000 CR4: 00000000003506e0
> > Jul 05 18:36:06 localhost kernel: Call Trace:
> > Jul 05 18:36:06 localhost kernel:  <TASK>
> > Jul 05 18:36:06 localhost kernel:  sysfs_unmerge_group+0x18/0x60
> > Jul 05 18:36:06 localhost kernel:  dpm_sysfs_remove+0x20/0x60
> > Jul 05 18:36:06 localhost kernel:  device_del+0xb2/0x3f0
> > Jul 05 18:36:06 localhost kernel:  platform_device_del.part.0+0x13/0x70
> > Jul 05 18:36:06 localhost kernel:  platform_device_unregister+0x1c/0x30
> > Jul 05 18:36:06 localhost kernel:  sysfb_disable+0x2b/0x60
> > Jul 05 18:36:06 localhost kernel:  remove_conflicting_framebuffers+0x1b/0xc0
> > Jul 05 18:36:06 localhost kernel:  remove_conflicting_pci_framebuffers+0xce/0x120
> > Jul 05 18:36:06 localhost kernel:  drm_aperture_remove_conflicting_pci_framebuffers+0x57/0x80
> > Jul 05 18:36:06 localhost kernel:  amdgpu_pci_probe+0x126/0x3c0 [amdgpu ab2a35e28bca10ea2bed443b5ef9d0bdfa6ec825]
> > Jul 05 18:36:06 localhost kernel:  local_pci_probe+0x41/0x80
> > Jul 05 18:36:06 localhost kernel:  pci_device_probe+0xc3/0x220
> > Jul 05 18:36:06 localhost kernel:  really_probe+0x1a1/0x370
> > Jul 05 18:36:06 localhost kernel:  __driver_probe_device+0xfc/0x170
> > Jul 05 18:36:06 localhost kernel:  driver_probe_device+0x1f/0x90
> > Jul 05 18:36:06 localhost kernel:  __driver_attach+0xbb/0x190
> > Jul 05 18:36:06 localhost kernel:  ? __device_attach_driver+0xe0/0xe0
> > Jul 05 18:36:06 localhost kernel:  bus_for_each_dev+0x72/0xb0
> > Jul 05 18:36:06 localhost kernel:  bus_add_driver+0x159/0x200
> > Jul 05 18:36:06 localhost kernel:  driver_register+0x89/0xd0
> > Jul 05 18:36:06 localhost kernel:  ? 0xffffffffc1113000
> > Jul 05 18:36:06 localhost kernel:  do_one_initcall+0x44/0x200
> > Jul 05 18:36:06 localhost kernel:  ? kmem_cache_alloc_trace+0x177/0x350
> > Jul 05 18:36:06 localhost kernel:  do_init_module+0x4a/0x250
> > Jul 05 18:36:06 localhost kernel:  __do_sys_init_module+0x138/0x1b0
> > Jul 05 18:36:06 localhost kernel:  do_syscall_64+0x5b/0x80
> > Jul 05 18:36:06 localhost kernel:  ? __vm_munmap+0x90/0x110
> > Jul 05 18:36:06 localhost kernel:  ? syscall_exit_to_user_mode+0x17/0x40
> > Jul 05 18:36:06 localhost kernel:  ? do_syscall_64+0x67/0x80
> > Jul 05 18:36:06 localhost kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xae
> > Jul 05 18:36:06 localhost kernel: RIP: 0033:0x7f11153a202e
> > Jul 05 18:36:06 localhost kernel: Code: 48 8b 0d fd 9d 0e 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d ca 9d 0e 00 f7 d8 64 89 01 48
> > Jul 05 18:36:06 localhost kernel: RSP: 002b:00007ffd92dad4a8 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
> > Jul 05 18:36:06 localhost kernel: RAX: ffffffffffffffda RBX: 0000562b842b8ef0 RCX: 00007f11153a202e
> > Jul 05 18:36:06 localhost kernel: RDX: 0000562b842b9370 RSI: 00000000010cd35f RDI: 00007f11128c7010
> > Jul 05 18:36:06 localhost kernel: RBP: 0000562b842b9370 R08: 0000000000261000 R09: 85ebca77c2b2ae63
> > Jul 05 18:36:06 localhost kernel: R10: 00000000000331a1 R11: 0000000000000246 R12: 0000000000020000
> > Jul 05 18:36:06 localhost kernel: R13: 0000000000000000 R14: 0000562b842fa050 R15: 0000562b842b9370
> > Jul 05 18:36:06 localhost kernel:  </TASK>
> > Jul 05 18:36:06 localhost kernel: Modules linked in: amdgpu(+) crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd sp5100_tco xhci_pci xhci_pci_renesas ccp nvme xhci_hcd nvme_core drm_ttm_helper ttm usbcore iommu_v2 gpu_sched drm_dp_helper video wmi btrfs blake2b_generic libcrc32c crc32c_intel xor raid6_pq l2tp_ppp l2tp_netlink l2tp_core ip6_udp_tunnel udp_tunnel pppox ppp_generic slhc v4l2loopback(OE) videodev mc sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua msr efivarfs
> > Jul 05 18:36:06 localhost kernel: CR2: 0000000000000008
> > Jul 05 18:36:06 localhost kernel: ---[ end trace 0000000000000000 ]---
> > Jul 05 18:36:06 localhost kernel: RIP: 0010:kernfs_find_and_get_ns+0x11/0x70
> > Jul 05 18:36:06 localhost kernel: Code: 08 48 83 40 40 01 49 8b 46 08 48 83 40 58 01 31 c0 eb d1 66 0f 1f 44 00 00 0f 1f 44 00 00 41 55 49 89 d5 41 54 49 89 f4 55 53 <48> 8b 47 08 48 89 fb 48 85 c0 48 0f 44 c7 48 8b 68 50 48 83 c5 60
> > Jul 05 18:36:06 localhost kernel: RSP: 0018:ffffa5a802083a38 EFLAGS: 00010246
> > Jul 05 18:36:06 localhost kernel: RAX: 0000000000000000 RBX: ffffffffb119fc20 RCX: ffffa5a802083a10
> > Jul 05 18:36:06 localhost kernel: RDX: 0000000000000000 RSI: ffffffffb119fd68 RDI: 0000000000000000
> > Jul 05 18:36:06 localhost kernel: RBP: 0000000000000000 R08: 0000000000000040 R09: 00000000fcf00000
> > Jul 05 18:36:06 localhost kernel: R10: 0000000000000000 R11: ffff88c0c77e829c R12: ffffffffb119fd68
> > Jul 05 18:36:06 localhost kernel: R13: 0000000000000000 R14: ffff88c0d0c8d7c0 R15: 0000000000000000
> > Jul 05 18:36:06 localhost kernel: FS:  00007f1114895b00(0000) GS:ffff88c7ceac0000(0000) knlGS:0000000000000000
> > Jul 05 18:36:06 localhost kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > Jul 05 18:36:06 localhost kernel: CR2: 0000000000000008 CR3: 0000000107a46000 CR4: 00000000003506e0

Thank you so much for this stacktrace. This looks like a problem that we
recently fixed in the upstream kernel. Yesterday, I backported the patch into
out stable branch. So this problem should be fixed in the next kernel update.

Sorry for all the inconvenience with the graphics stack in recent weeks. We try
to modernize it, but it's not a trivial issue.


You are receiving this mail because: