Bug ID 1209294
Summary New Thinkpad T14s GPU reset/crash with openSUSE Tumbleweed
Classification openSUSE
Product openSUSE Tumbleweed
Version Current
Hardware Other
OS Other
Status NEW
Severity Normal
Priority P5 - None
Component Basesystem
Assignee screening-team-bugs@suse.de
Reporter martin.liska@suse.com
QA Contact qa-bugs@suse.de
Found By ---
Blocker ---

I'm using latest TW:
$ uname -a
Linux kettlebell 6.2.4-1-default #1 SMP PREEMPT_DYNAMIC Sat Mar 11 10:13:47 UTC
2023 (0532a55) x86_64 x86_64 x86_64 GNU/Linux

with Gnome and time to time my display freezes (in HexChat e.g.) due to GPU
crash:

[ 8864.772439] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout,
signaled seq=101963, emitted seq=101965
[ 8864.773012] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information:
process  pid 0 thread  pid 0
[ 8864.773500] amdgpu 0000:33:00.0: amdgpu: GPU reset begin!
[ 8865.523942] amdgpu 0000:33:00.0: amdgpu: MODE2 reset
[ 8865.532678] amdgpu 0000:33:00.0: amdgpu: GPU reset succeeded, trying to
resume
[ 8865.532808] [drm] PCIE GART of 1024M enabled (table at 0x000000F43FC00000).
[ 8865.532868] [drm] PSP is resuming...
[ 8865.555025] [drm] reserve 0xa00000 from 0xf43e000000 for PSP TMR
[ 8865.892352] amdgpu 0000:33:00.0: amdgpu: RAS: optional ras ta ucode is not
available
[ 8865.904685] amdgpu 0000:33:00.0: amdgpu: RAP: optional rap ta ucode is not
available
[ 8865.904686] amdgpu 0000:33:00.0: amdgpu: SECUREDISPLAY: securedisplay ta
ucode is not available
[ 8865.904688] amdgpu 0000:33:00.0: amdgpu: SMU is resuming...
[ 8865.906884] amdgpu 0000:33:00.0: amdgpu: SMU is resumed successfully!
[ 8865.908842] [drm] DMUB hardware initialized: version=0x0400002E
[ 8866.415536] [drm] kiq ring mec 2 pipe 1 q 0
[ 8866.422382] [drm] VCN decode and encode initialized successfully(under DPG
Mode).
[ 8866.422910] [drm] JPEG decode initialized successfully.
[ 8866.422916] amdgpu 0000:33:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on
hub 0
[ 8866.422922] amdgpu 0000:33:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1
on hub 0
[ 8866.422925] amdgpu 0000:33:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4
on hub 0
[ 8866.422926] amdgpu 0000:33:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5
on hub 0
[ 8866.422927] amdgpu 0000:33:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6
on hub 0
[ 8866.422928] amdgpu 0000:33:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7
on hub 0
[ 8866.422930] amdgpu 0000:33:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8
on hub 0
[ 8866.422931] amdgpu 0000:33:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9
on hub 0
[ 8866.422933] amdgpu 0000:33:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10
on hub 0
[ 8866.422934] amdgpu 0000:33:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11
on hub 0
[ 8866.422936] amdgpu 0000:33:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on
hub 0
[ 8866.422937] amdgpu 0000:33:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on
hub 1
[ 8866.422939] amdgpu 0000:33:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1
on hub 1
[ 8866.422940] amdgpu 0000:33:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4
on hub 1
[ 8866.422942] amdgpu 0000:33:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on
hub 1
[ 8866.430012] amdgpu 0000:33:00.0: amdgpu: recover vram bo from shadow start
[ 8866.430014] amdgpu 0000:33:00.0: amdgpu: recover vram bo from shadow done
[ 8866.430029] amdgpu 0000:33:00.0: amdgpu: GPU reset(1) succeeded!
[ 8866.430186] gmc_v10_0_process_interrupt: 58 callbacks suppressed
[ 8866.430190] amdgpu 0000:33:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:40 vmid:7 pasid:32769, for process Xwayland pid 4428 thread Xwayland:cs0
pid 4433)
[ 8866.430198] amdgpu 0000:33:00.0: amdgpu:   in page starting at address
0x00008001004c0000 from client 0x1b (UTCL2)
[ 8866.430203] amdgpu 0000:33:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00741051
[ 8866.430206] amdgpu 0000:33:00.0: amdgpu:      Faulty UTCL2 client ID: TCP
(0x8)
[ 8866.430209] amdgpu 0000:33:00.0: amdgpu:      MORE_FAULTS: 0x1
[ 8866.430211] amdgpu 0000:33:00.0: amdgpu:      WALKER_ERROR: 0x0
[ 8866.430213] amdgpu 0000:33:00.0: amdgpu:      PERMISSION_FAULTS: 0x5
[ 8866.430215] amdgpu 0000:33:00.0: amdgpu:      MAPPING_ERROR: 0x0
[ 8866.430217] amdgpu 0000:33:00.0: amdgpu:      RW: 0x1
[ 8866.430222] amdgpu 0000:33:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:40 vmid:7 pasid:32769, for process Xwayland pid 4428 thread Xwayland:cs0
pid 4433)
[ 8866.430227] amdgpu 0000:33:00.0: amdgpu:   in page starting at address
0x00008001004c0000 from client 0x1b (UTCL2)
[ 8866.430229] amdgpu 0000:33:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 8866.430232] amdgpu 0000:33:00.0: amdgpu:      Faulty UTCL2 client ID: CB/DB
(0x0)
[ 8866.430234] amdgpu 0000:33:00.0: amdgpu:      MORE_FAULTS: 0x0
[ 8866.430236] amdgpu 0000:33:00.0: amdgpu:      WALKER_ERROR: 0x0
[ 8866.430238] amdgpu 0000:33:00.0: amdgpu:      PERMISSION_FAULTS: 0x0
[ 8866.430239] amdgpu 0000:33:00.0: amdgpu:      MAPPING_ERROR: 0x0
[ 8866.430241] amdgpu 0000:33:00.0: amdgpu:      RW: 0x0

sudo lspci | grep VGA
33:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI]
Rembrandt [Radeon 680M] (rev d1)


You are receiving this mail because: