[Bug 1228773] New: Kernel / full system freezes for two minutes due to lack of I/O on NVME drive
https://bugzilla.suse.com/show_bug.cgi?id=1228773 Bug ID: 1228773 Summary: Kernel / full system freezes for two minutes due to lack of I/O on NVME drive Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: Other OS: openSUSE Tumbleweed Status: NEW Severity: Major Priority: P5 - None Component: Kernel:Storage Assignee: kernel-bugs@suse.de Reporter: lassi.vaatamoinen@gmail.com QA Contact: qa-bugs@suse.de Target Milestone: --- Found By: --- Blocker: --- Created attachment 876445 --> https://bugzilla.suse.com/attachment.cgi?id=876445&action=edit Dmesg for the recent NVME incident The issue is sporadic. Sometimes happens several times a day, but there can be 10 days period with no incidences. It looks like under quick full CPU loads and/or heavy I/O usage, the system freezes for two minutes. HDD light flashes in 1-2 Hz frequency, on/off. -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1228773 https://bugzilla.suse.com/show_bug.cgi?id=1228773#c1 --- Comment #1 from Lassi Väätämöinen <lassi.vaatamoinen@gmail.com> --- Additional information: NVME: Kingston A2000 NVME Firmware: S5Z42109 Motherboard: MSI X470 GAMING PRO MAX BIOS: BIOS M.H0 10/15/2023 -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1228773 https://bugzilla.suse.com/show_bug.cgi?id=1228773#c2 --- Comment #2 from Lassi Väätämöinen <lassi.vaatamoinen@gmail.com> --- BIOS self-test for the NVME drive: OK SMART report for the drive: smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.10.2-1-default] (SUSE RPM) Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Number: KINGSTON SA2000M81000G Serial Number: 50026B768398B539 Firmware Version: S5Z42109 PCI Vendor/Subsystem ID: 0x2646 IEEE OUI Identifier: 0x0026b7 Controller ID: 1 NVMe Version: 1.3 Number of Namespaces: 1 Namespace 1 Size/Capacity: 1 000 204 886 016 [1,00 TB] Namespace 1 Utilization: 454 782 439 424 [454 GB] Namespace 1 Formatted LBA Size: 512 Namespace 1 IEEE EUI-64: 0026b7 68398b5395 Local Time is: Fri Aug 2 01:06:40 2024 EEST Firmware Updates (0x14): 2 Slots, no Reset required Optional Admin Commands (0x0017): Security Format Frmw_DL Self_Test Optional NVM Commands (0x005f): Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat Timestmp Log Page Attributes (0x0f): S/H_per_NS Cmd_Eff_Lg Ext_Get_Lg Telmtry_Lg Maximum Data Transfer Size: 32 Pages Warning Comp. Temp. Threshold: 75 Celsius Critical Comp. Temp. Threshold: 80 Celsius Supported Power States St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat 0 + 9.00W - - 0 0 0 0 0 0 1 + 4.60W - - 1 1 1 1 0 0 2 + 3.80W - - 2 2 2 2 0 0 3 - 0.0450W - - 3 3 3 3 2000 2000 4 - 0.0040W - - 4 4 4 4 15000 15000 Supported LBA Sizes (NSID 0x1) Id Fmt Data Metadt Rel_Perf 0 + 512 0 0 === START OF SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED SMART/Health Information (NVMe Log 0x02) Critical Warning: 0x00 Temperature: 42 Celsius Available Spare: 100% Available Spare Threshold: 10% Percentage Used: 3% Data Units Read: 19 848 929 [10,1 TB] Data Units Written: 29 694 373 [15,2 TB] Host Read Commands: 227 241 862 Host Write Commands: 378 903 392 Controller Busy Time: 5 820 Power Cycles: 970 Power On Hours: 4 703 Unsafe Shutdowns: 22 Media and Data Integrity Errors: 0 Error Information Log Entries: 0 Warning Comp. Temperature Time: 0 Critical Comp. Temperature Time: 0 Thermal Temp. 1 Transition Count: 76 Thermal Temp. 1 Total Time: 185 Error Information (NVMe Log 0x01, 16 of 256 entries) No Errors Logged Self-test Log (NVMe Log 0x06) Self-test status: No self-test in progress Num Test_Description Status Power_on_Hours Failing_LBA NSID Seg SCT Code 0 Extended Completed without error 4703 - - - - - 1 Extended Completed without error 1775 - - - - - -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@suse.com