nvme nvme0: frozen state error detected, reset controller

3 Dec 2020

      Hi,

I have this annoying issue with any of the recent kernels that come with 
Leap 15.1 and 15.2 that my SSD becomes read-only after a while. I can 
reproduce the behavior almost 100% of the time pushing the same heavy 
I/O work load on it. I have one "golden" kernel under which issue 
_never_ shows up (uptime easily over 45 days and pushing the same work 
load multiple times a day). This "golden" kernel is: 
vmlinuz-4.12.14-lp151.28.10-default.

Any kernel I have tried since the above during Leap 15.1 updates, and 
now the latest Leap 15.2 (vmlinuz-5.3.18-lp152.50-default) have the 
above described issue with the SSD failure and system lockup. The last 
log that goes to the screen is:

pcieport 000:00:1d.4: DPC: unmasked uncorrectable error detected
nvme nvme0: frozen state error detected, reset controller

I would just stay with the "golden" kernel if it was not for some other 
issues (HDMI problem) that I have with /it/. The latest 5.3 kernel 
definitely has the HDMI issue fixed, and I'd love to move on, but cannot 
due to the SSD issue.

The drive is an Intel model number: HBRPEKNX0202AH.

It's been years since I last built my own custom kernels, and I was 
really hoping to not have to do that again. Please let me know if there 
is any additional information that would be useful to address this issue.

Best,
-Gerhard

Main

Development

Information

Community

Social Media

Other

Gerhard Theurich

Daniel Wagner

Gerhard Theurich

Daniel Wagner

Gerhard Theurich

tags

participants (2)