Bug ID | 1219444 |
---|---|
Summary | amdgpu critical error |
Classification | openSUSE |
Product | openSUSE Distribution |
Version | Leap 15.5 |
Hardware | x86-64 |
OS | openSUSE Leap 15.5 |
Status | NEW |
Severity | Normal |
Priority | P5 - None |
Component | Kernel |
Assignee | kernel-bugs@opensuse.org |
Reporter | teuniz@protonmail.com |
QA Contact | qa-bugs@suse.de |
Target Milestone | --- |
Found By | --- |
Blocker | --- |
Created attachment 872372 [details]
Output of dmesg
The kernel crashes approx every 5 minutes.
I reverted back to kernel 5.14.21-150500.55.19-default because with that one it
crashes approx once a day.
Operating System: openSUSE Leap 15.5
KDE Plasma Version: 5.27.9
KDE Frameworks Version: 5.103.0
Qt Version: 5.15.8
Kernel Version: 5.14.21-150500.55.44-default (64-bit)
Graphics Platform: X11
Processors: 32 × 13th Gen Intel Core i9-13900K
Memory: 31.0 GiB of RAM
Graphics Processor: AMD Radeon Pro W6600
Manufacturer: HP
Product Name: HP Z2 Tower G9 Workstation Desktop PC
dmesg | grep amdgpu
[ 1.540640] [drm] amdgpu kernel modesetting enabled.
[ 1.540703] amdgpu: CRAT table not found
[ 1.540705] amdgpu: Virtual CRAT table created for CPU
[ 1.540712] amdgpu: Topology: Add CPU node
[ 1.542670] amdgpu 0000:03:00.0: amdgpu: Fetched VBIOS from VFCT
[ 1.542671] amdgpu: ATOM BIOS: 113-D5330400-100
[ 1.542770] amdgpu 0000:03:00.0: vgaarb: deactivate vga console
[ 1.542771] amdgpu 0000:03:00.0: amdgpu: Trusted Memory Zone (TMZ) feature
disabled as experimental (default)
[ 1.542799] amdgpu 0000:03:00.0: amdgpu: VRAM: 8176M 0x0000008000000000 -
0x00000081FEFFFFFF (8176M used)
[ 1.542800] amdgpu 0000:03:00.0: amdgpu: GART: 512M 0x0000000000000000 -
0x000000001FFFFFFF
[ 1.542801] amdgpu 0000:03:00.0: amdgpu: AGP: 267894784M 0x0000008400000000
- 0x0000FFFFFFFFFFFF
[ 1.542845] [drm] amdgpu: 8176M of VRAM memory ready
[ 1.542845] [drm] amdgpu: 15892M of GTT memory ready.
[ 1.548699] amdgpu 0000:03:00.0: amdgpu: PSP runtime database doesn't exist
[ 1.548704] amdgpu 0000:03:00.0: amdgpu: PSP runtime database doesn't exist
[ 2.854516] amdgpu 0000:03:00.0: amdgpu: STB initialized to 2048 entries
[ 2.895100] amdgpu 0000:03:00.0: amdgpu: Will use PSP to load VCN firmware
[ 3.094413] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not
available
[ 3.115717] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta
ucode is not available
[ 3.115740] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f,
smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b2b00
(59.43.0)
[ 3.115745] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched
[ 3.115777] amdgpu 0000:03:00.0: amdgpu: use vbios provided pptable
[ 3.165133] amdgpu 0000:03:00.0: amdgpu: SMU is initialized successfully!
[ 3.268063] kfd kfd: amdgpu: Allocated 3969056 bytes on gart
[ 3.268478] amdgpu: sdma_bitmap: ffff
[ 3.302091] amdgpu: HMM registered 8176MB device memory
[ 3.302135] amdgpu: SRAT table not found
[ 3.302136] amdgpu: Virtual CRAT table created for GPU
[ 3.302599] amdgpu: Topology: Add dGPU node [0x73e3:0x1002]
[ 3.302601] kfd kfd: amdgpu: added device 1002:73e3
[ 3.302617] amdgpu 0000:03:00.0: amdgpu: SE 2, SH per SE 2, CU per SH 8,
active_cu_number 28
[ 3.302658] amdgpu 0000:03:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on
hub 0
[ 3.302659] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1
on hub 0
[ 3.302659] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4
on hub 0
[ 3.302660] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5
on hub 0
[ 3.302660] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6
on hub 0
[ 3.302661] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7
on hub 0
[ 3.302661] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8
on hub 0
[ 3.302662] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9
on hub 0
[ 3.302662] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10
on hub 0
[ 3.302663] amdgpu 0000:03:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11
on hub 0
[ 3.302663] amdgpu 0000:03:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on
hub 0
[ 3.302664] amdgpu 0000:03:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on
hub 0
[ 3.302665] amdgpu 0000:03:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on
hub 1
[ 3.302665] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1
on hub 1
[ 3.302666] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4
on hub 1
[ 3.302666] amdgpu 0000:03:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on
hub 1
[ 3.303573] [drm] Initialized amdgpu 3.49.0 20150101 for 0000:03:00.0 on
minor 0
[ 3.308709] fbcon: amdgpudrmfb (fb0) is primary device
[ 3.505728] amdgpu 0000:03:00.0: amdgpu: [mmhub] page fault (src_id:0
ring:157 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 3.505731] amdgpu 0000:03:00.0: amdgpu: in page starting at address
0x0000000006004000 from client 0x12 (VMC)
[ 3.505733] amdgpu 0000:03:00.0: amdgpu:
MMVM_L2_PROTECTION_FAULT_STATUS:0x0000073A
[ 3.505733] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: DCEDMC
(0x3)
[ 3.505734] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 3.505735] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x5
[ 3.505735] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
[ 3.505735] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 3.505736] amdgpu 0000:03:00.0: amdgpu: RW: 0x0
[ 3.524299] amdgpu 0000:03:00.0: [drm] fb0: amdgpudrmfb frame buffer device
[ 4.537456] snd_hda_intel 0000:03:00.1: bound 0000:03:00.0 (ops
amdgpu_dm_audio_component_bind_ops [amdgpu])
[ 5.416287] amdgpu 0000:03:00.0: amdgpu: [mmhub] page fault (src_id:0
ring:157 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 5.416312] amdgpu 0000:03:00.0: amdgpu: in page starting at address
0x0000000006004000 from client 0x12 (VMC)
[ 5.416319] amdgpu 0000:03:00.0: amdgpu:
MMVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 5.416324] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID:
unknown (0x0)
[ 5.416329] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 5.416333] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
[ 5.416336] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 5.416340] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 5.416343] amdgpu 0000:03:00.0: amdgpu: RW: 0x0
[ 73.156519] amdgpu 0000:03:00.0: amdgpu: [mmhub] page fault (src_id:0
ring:157 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 73.156538] amdgpu 0000:03:00.0: amdgpu: in page starting at address
0x0000000006004000 from client 0x12 (VMC)
[ 73.156546] amdgpu 0000:03:00.0: amdgpu:
MMVM_L2_PROTECTION_FAULT_STATUS:0x0000073A
[ 73.156551] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: DCEDMC
(0x3)
[ 73.156562] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 73.156566] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x5
[ 73.156570] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
[ 73.156578] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 73.156582] amdgpu 0000:03:00.0: amdgpu: RW: 0x0
uname -a
5.14.21-150500.55.44-default #1 SMP PREEMPT_DYNAMIC Mon Jan 15 10:03:40 UTC
2024 (cc7d8b6) x86_64 x86_64 x86_64 GNU/Linux
lspci
VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23
WKS-XL [Radeon PRO W6600]