[Bug 1229039] New: Kernels 6.10.2-1 and 6.10.3-1 do not boot, hard reset after 1 minute of black screen
https://bugzilla.suse.com/show_bug.cgi?id=1229039 Bug ID: 1229039 Summary: Kernels 6.10.2-1 and 6.10.3-1 do not boot, hard reset after 1 minute of black screen Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: x86-64 OS: openSUSE Tumbleweed Status: NEW Severity: Normal Priority: P5 - None Component: Kernel Assignee: kernel-bugs@opensuse.org Reporter: opensuse@erezsh.org QA Contact: qa-bugs@suse.de Target Milestone: --- Found By: --- Blocker: --- Redirected to open a bug report following: https://forums.opensuse.org/t/sometimes-linux-is-not-booting-on-latest-kerne... Problem: Last kernel I can boot on is 6.9.9-1. Grub2 EFI starts as expected. When selecting to boot with either 6.10 kernels available to me, I get a black screen. No logs at all. After around 1 minute I get a hard reset. It seems to be exactly 1 minute every time but I haven't timed it. Expected: To be able to boot. Additional information: I'm using Dell Latitude 7440 laptop. Secure boot is off. ``` ❯ inxi -GSaz System: Kernel: 6.9.9-1-default arch: x86_64 bits: 64 compiler: gcc v: 13.3.0 clocksource: tsc avail: acpi_pm parameters: BOOT_IMAGE=/boot/vmlinuz-6.9.9-1-default root=/dev/mapper/system-root mitigations=auto quiet security=apparmor xe.force_probe=a7a1 i915.force_probe=!a7a1 Desktop: KDE Plasma v: 6.1.3 tk: Qt v: N/A info: frameworks v: 6.4.0 wm: kwin_wayland tools: avail: xscreensaver vt: 3 dm: SDDM Distro: openSUSE Tumbleweed 20240807 Graphics: Device-1: Intel Raptor Lake-P [Iris Xe Graphics] vendor: Dell driver: xe v: kernel alternate: i915 arch: Gen-13 process: Intel 7 (10nm) built: 2022+ ports: active: DP-5,eDP-1 empty: DP-1, DP-2, DP-3, DP-4, HDMI-A-1 bus-ID: 0000:00:02.0 chip-ID: 8086:a7a1 class-ID: 0300 Display: wayland server: X.org v: 1.21.1.12 with: Xwayland v: 24.1.1 compositor: kwin_wayland driver: X: loaded: modesetting unloaded: fbdev,vesa alternate: intel dri: iris gpu: xe d-rect: 3840x1200 display-ID: 0 Monitor-1: DP-5 pos: right res: 1920x1080 size: N/A modes: N/A Monitor-2: eDP-1 pos: primary,left res: 1920x1200 size: N/A modes: N/A API: EGL v: 1.5 hw: drv: intel iris platforms: device: 0 drv: iris device: 1 drv: swrast gbm: drv: iris surfaceless: drv: iris wayland: drv: iris x11: drv: iris API: OpenGL v: 4.6 compat-v: 4.5 vendor: intel mesa v: 24.1.3 glx-v: 1.4 direct-render: yes renderer: Mesa Intel Graphics (RPL-U) device-ID: 8086:a7a1 memory: 15 GiB unified: yes display-ID: :1.0 API: Vulkan v: 1.3.290 layers: 1 device: 0 type: integrated-gpu name: Intel Graphics (RPL-U) driver: N/A device-ID: 8086:a7a1 surfaces: xcb,xlib,wayland ``` ``` ❯ ll /boot total 64162 lrwxrwxrwx 1 root root 42 Aug 2 19:23 config-6.10.2-1-default -> ../usr/lib/modules/6.10.2-1-default/config lrwxrwxrwx 1 root root 42 Aug 9 09:32 config-6.10.3-1-default -> ../usr/lib/modules/6.10.3-1-default/config lrwxrwxrwx 1 root root 41 Jul 16 20:15 config-6.9.9-1-default -> ../usr/lib/modules/6.9.9-1-default/config drwxr-xr-x 4 root root 2048 Jan 1 1970 efi drwxr-xr-x 1 root root 98 Aug 9 09:33 grub2 lrwxrwxrwx 1 root root 23 Aug 9 09:32 initrd -> initrd-6.10.3-1-default -rw------- 1 root root 65640212 Aug 9 09:33 initrd-6.9.9-1-default lrwxrwxrwx 1 root root 47 Aug 2 19:23 sysctl.conf-6.10.2-1-default -> ../usr/lib/modules/6.10.2-1-default/sysctl.conf lrwxrwxrwx 1 root root 47 Aug 9 09:32 sysctl.conf-6.10.3-1-default -> ../usr/lib/modules/6.10.3-1-default/sysctl.conf lrwxrwxrwx 1 root root 46 Jul 16 20:15 sysctl.conf-6.9.9-1-default -> ../usr/lib/modules/6.9.9-1-default/sysctl.conf lrwxrwxrwx 1 root root 46 Aug 2 19:23 System.map-6.10.2-1-default -> ../usr/lib/modules/6.10.2-1-default/System.map lrwxrwxrwx 1 root root 46 Aug 9 09:32 System.map-6.10.3-1-default -> ../usr/lib/modules/6.10.3-1-default/System.map lrwxrwxrwx 1 root root 45 Jul 16 20:15 System.map-6.9.9-1-default -> ../usr/lib/modules/6.9.9-1-default/System.map lrwxrwxrwx 1 root root 24 Aug 9 09:32 vmlinuz -> vmlinuz-6.10.3-1-default lrwxrwxrwx 1 root root 43 Aug 2 19:23 vmlinuz-6.10.2-1-default -> ../usr/lib/modules/6.10.2-1-default/vmlinuz lrwxrwxrwx 1 root root 43 Aug 9 09:32 vmlinuz-6.10.3-1-default -> ../usr/lib/modules/6.10.3-1-default/vmlinuz lrwxrwxrwx 1 root root 42 Jul 16 20:15 vmlinuz-6.9.9-1-default -> ../usr/lib/modules/6.9.9-1-default/vmlinuz ``` -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1229039 https://bugzilla.suse.com/show_bug.cgi?id=1229039#c1 Felix Miata <mrmazda@earthlink.net> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |mrmazda@earthlink.net --- Comment #1 from Felix Miata <mrmazda@earthlink.net> --- Looks like a duplicate of bug 1228929. -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1229039 https://bugzilla.suse.com/show_bug.cgi?id=1229039#c2 --- Comment #2 from Erez Shomron <opensuse@erezsh.org> --- I believe this is a duplicate of https://bugzilla.opensuse.org/show_bug.cgi?id=1226809 Because I had ZFS installed. There are two problems here: 1. The zfs package is not in sync with the official repository. 2. The kernel module is auto-installed when you run `zypper in zfs` ``` ❯ sudo zypper in --dry-run zfs [sudo] password for root: Loading repository data... Reading installed packages... Resolving package dependencies... The following recommended package was automatically selected: zfs-ueficert The following 9 NEW packages are going to be installed: libnvpair3 libuutil3 libzfs4 libzfs_core3 libzpool5 zfs zfs-bash-completion zfs-kmp-default zfs-ueficert ``` -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1229039 https://bugzilla.suse.com/show_bug.cgi?id=1229039#c3 --- Comment #3 from Erez Shomron <opensuse@erezsh.org> --- Sorry, for the additional comment, I meant duplicate of: https://bugzilla.opensuse.org/show_bug.cgi?id=1219799 -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1229039 https://bugzilla.suse.com/show_bug.cgi?id=1229039#c4 Egge <hpe@airpost.net> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |hpe@airpost.net --- Comment #4 from Egge <hpe@airpost.net> --- I am also having this issue, and I am not using zfs. Did a sudo zypper dup --no-allow-vendor-change and at the next boot the issue appeared. I have not done any config/boot changes myself. Booting without the quiet and splash kernel-parameters, the last line shown is [1.406198][T186] clocksource: Switched to clocksource tsc then it freezes and after a little while it reboots (possibly after 1 min as mentioned, have not timed it either). Booting with only the root kernel-parameter makes no difference. MB: Asrock x570 pro4 CPU: AMD Ryzen 7 3800X 8-Core Processor
inxi -GSaz System: Kernel: 6.9.9-1-default arch: x86_64 bits: 64 compiler: gcc v: 13.3.0 clocksource: tsc avail: hpet,acpi_pm parameters: BOOT_IMAGE=/vmlinuz-6.9.9-1-default root=UUID=bdf30380-f75e-4158-89c4-427325017708 quiet splash=silent rd.driver.pre=vfio-pci vfio-pci.ids=1002:6811,1002:aab0,1912:0014 mitigations=auto Desktop: KDE Plasma v: 6.1.4 tk: Qt v: N/A info: frameworks v: 6.4.0 wm: kwin_x11 tools: avail: xscreensaver vt: 2 dm: SDDM Distro: openSUSE Tumbleweed 20240808 Graphics: Device-1: AMD Curacao PRO [Radeon R7 370 / R9 270/370 OEM] vendor: Micro-Star MSI driver: vfio-pci v: N/A alternate: radeon,amdgpu arch: GCN-1 code: Southern Islands process: TSMC 28nm built: 2011-20 pcie: gen: 3 speed: 8 GT/s lanes: 4 link-max: lanes: 16 bus-ID: 05:00.0 chip-ID: 1002:6811 class-ID: 0300 Device-2: AMD Navi 23 [Radeon RX 6600/6600 XT/6600M] vendor: ASUSTeK driver: amdgpu v: kernel arch: RDNA-2 code: Navi-2x process: TSMC n7 (7nm) built: 2020-22 pcie: gen: 4 speed: 16 GT/s lanes: 16 ports: active: DP-1, DP-2, DP-3, HDMI-A-1 empty: Writeback-1 bus-ID: 0e:00.0 chip-ID: 1002:73ff class-ID: 0300 Display: x11 server: X.Org v: 21.1.12 with: Xwayland v: 24.1.1 compositor: kwin_x11 driver: X: loaded: amdgpu dri: radeonsi gpu: amdgpu display-ID: :0 screens: 1 Screen-1: 0 s-res: 5760x2104 s-dpi: 96 s-size: 1520x555mm (59.84x21.85") s-diag: 1618mm (63.71") Monitor-1: DP-1 mapped: DisplayPort-0 pos: primary,2-2 model: HP 25x serial: <filter> built: 2019 res: 1920x1080 dpi: 90 gamma: 1.2 size: 544x303mm (21.42x11.93") diag: 623mm (24.5") ratio: 16:9 modes: max: 1920x1080 min: 720x400 Monitor-2: DP-2 mapped: DisplayPort-1 pos: 2-4 model: VA24D serial: <filter> built: 2020 res: 1920x1080 dpi: 93 gamma: 1.2 size: 527x296mm (20.75x11.65") diag: 604mm (23.8") ratio: 16:9 modes: max: 1920x1080 min: 720x400 Monitor-3: DP-3 mapped: DisplayPort-2 pos: 2-1 model: VA24D serial: <filter> built: 2020 res: 1920x1080 dpi: 93 gamma: 1.2 size: 527x296mm (20.75x11.65") diag: 604mm (23.8") ratio: 16:9 modes: max: 1920x1080 min: 720x400 Monitor-4: HDMI-A-1 mapped: HDMI-A-0 pos: 1-3 model: Philips 170B serial: <filter> built: 2005 res: 1280x1024 dpi: 96 gamma: 1.2 size: 338x270mm (13.31x10.63") diag: 433mm (17") ratio: 5:4 modes: max: 1280x1024 min: 720x400 API: EGL v: 1.5 hw: drv: amd radeonsi platforms: device: 0 drv: radeonsi device: 1 drv: swrast gbm: drv: kms_swrast surfaceless: drv: radeonsi x11: drv: radeonsi inactive: wayland API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.1.3 glx-v: 1.4 direct-render: yes renderer: AMD Radeon RX 6600 XT (radeonsi navi23 LLVM 18.1.8 DRM 3.57 6.9.9-1-default) device-ID: 1002:73ff memory: 7.81 GiB unified: no API: Vulkan v: 1.3.290 layers: 5 device: 0 type: discrete-gpu name: AMD Radeon RX 6600 XT (RADV NAVI23) driver: N/A device-ID: 1002:73ff surfaces: xcb,xlib -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1229039 https://bugzilla.suse.com/show_bug.cgi?id=1229039#c5 --- Comment #5 from Egge <hpe@airpost.net> --- I noticed initrd was missing for 6.10.3 and grub.cfg was missing the lines echo 'Loading initial ramdisk ...' initrd /initrd-6.10.3-1-default Recreating initrd using dracut initrd-6.10.3-1-default and adding the two missing lines during boot (edit entry) made the boot go somewhat further, but still not booting completely/as expected. The last on the screen now is [ OK ] Reached target Basic System. and the keyboard is unresponsive, as in no capslock or numlock led toggle or crtl+alt+del having any effect. -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1229039 https://bugzilla.suse.com/show_bug.cgi?id=1229039#c6 --- Comment #6 from Erez Shomron <opensuse@erezsh.org> --- (In reply to Egge from comment #4)
I am also having this issue, and I am not using zfs. Did a
sudo zypper dup --no-allow-vendor-change
and at the next boot the issue appeared. I have not done any config/boot changes myself. Booting without the quiet and splash kernel-parameters, the last line shown is
[1.406198][T186] clocksource: Switched to clocksource tsc
then it freezes and after a little while it reboots (possibly after 1 min as mentioned, have not timed it either). Booting with only the root kernel-parameter makes no difference.
MB: Asrock x570 pro4 CPU: AMD Ryzen 7 3800X 8-Core Processor
inxi -GSaz System: Kernel: 6.9.9-1-default arch: x86_64 bits: 64 compiler: gcc v: 13.3.0 clocksource: tsc avail: hpet,acpi_pm parameters: BOOT_IMAGE=/vmlinuz-6.9.9-1-default root=UUID=bdf30380-f75e-4158-89c4-427325017708 quiet splash=silent rd.driver.pre=vfio-pci vfio-pci.ids=1002:6811,1002:aab0,1912:0014 mitigations=auto Desktop: KDE Plasma v: 6.1.4 tk: Qt v: N/A info: frameworks v: 6.4.0 wm: kwin_x11 tools: avail: xscreensaver vt: 2 dm: SDDM Distro: openSUSE Tumbleweed 20240808 Graphics: Device-1: AMD Curacao PRO [Radeon R7 370 / R9 270/370 OEM] vendor: Micro-Star MSI driver: vfio-pci v: N/A alternate: radeon,amdgpu arch: GCN-1 code: Southern Islands process: TSMC 28nm built: 2011-20 pcie: gen: 3 speed: 8 GT/s lanes: 4 link-max: lanes: 16 bus-ID: 05:00.0 chip-ID: 1002:6811 class-ID: 0300 Device-2: AMD Navi 23 [Radeon RX 6600/6600 XT/6600M] vendor: ASUSTeK driver: amdgpu v: kernel arch: RDNA-2 code: Navi-2x process: TSMC n7 (7nm) built: 2020-22 pcie: gen: 4 speed: 16 GT/s lanes: 16 ports: active: DP-1, DP-2, DP-3, HDMI-A-1 empty: Writeback-1 bus-ID: 0e:00.0 chip-ID: 1002:73ff class-ID: 0300 Display: x11 server: X.Org v: 21.1.12 with: Xwayland v: 24.1.1 compositor: kwin_x11 driver: X: loaded: amdgpu dri: radeonsi gpu: amdgpu display-ID: :0 screens: 1 Screen-1: 0 s-res: 5760x2104 s-dpi: 96 s-size: 1520x555mm (59.84x21.85") s-diag: 1618mm (63.71") Monitor-1: DP-1 mapped: DisplayPort-0 pos: primary,2-2 model: HP 25x serial: <filter> built: 2019 res: 1920x1080 dpi: 90 gamma: 1.2 size: 544x303mm (21.42x11.93") diag: 623mm (24.5") ratio: 16:9 modes: max: 1920x1080 min: 720x400 Monitor-2: DP-2 mapped: DisplayPort-1 pos: 2-4 model: VA24D serial: <filter> built: 2020 res: 1920x1080 dpi: 93 gamma: 1.2 size: 527x296mm (20.75x11.65") diag: 604mm (23.8") ratio: 16:9 modes: max: 1920x1080 min: 720x400 Monitor-3: DP-3 mapped: DisplayPort-2 pos: 2-1 model: VA24D serial: <filter> built: 2020 res: 1920x1080 dpi: 93 gamma: 1.2 size: 527x296mm (20.75x11.65") diag: 604mm (23.8") ratio: 16:9 modes: max: 1920x1080 min: 720x400 Monitor-4: HDMI-A-1 mapped: HDMI-A-0 pos: 1-3 model: Philips 170B serial: <filter> built: 2005 res: 1280x1024 dpi: 96 gamma: 1.2 size: 338x270mm (13.31x10.63") diag: 433mm (17") ratio: 5:4 modes: max: 1280x1024 min: 720x400 API: EGL v: 1.5 hw: drv: amd radeonsi platforms: device: 0 drv: radeonsi device: 1 drv: swrast gbm: drv: kms_swrast surfaceless: drv: radeonsi x11: drv: radeonsi inactive: wayland API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.1.3 glx-v: 1.4 direct-render: yes renderer: AMD Radeon RX 6600 XT (radeonsi navi23 LLVM 18.1.8 DRM 3.57 6.9.9-1-default) device-ID: 1002:73ff memory: 7.81 GiB unified: no API: Vulkan v: 1.3.290 layers: 5 device: 0 type: discrete-gpu name: AMD Radeon RX 6600 XT (RADV NAVI23) driver: N/A device-ID: 1002:73ff surfaces: xcb,xlib
Do you have the initrd for 6.10.3-1 in /boot? If not, I think it's better to share your info in https://bugzilla.opensuse.org/show_bug.cgi?id=1228929 My issue's root cause is for sure a duplicate of ZFS kernel module de-sync bug, which is not officially supported. -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1229039 https://bugzilla.suse.com/show_bug.cgi?id=1229039#c7 --- Comment #7 from Egge <hpe@airpost.net> --- I did not have a initrd for 6.10.3 in boot after "dup", but recreating it did not help. But what seems to have worked was to reinstall ("force update") the 6.10.3 kernel, and I am now running 6.10.3. So the cause seems to have been incomplete kernel installation during "dup". -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1229039 https://bugzilla.suse.com/show_bug.cgi?id=1229039#c9 --- Comment #9 from Erez Shomron <opensuse@erezsh.org> --- No, the latest ZFS kernel module provided by filesystems repo is compiled for the latest kernel provided by openSUSE for TW. But this issue will reproduce every time filesystems repo falls behind the latest snapshot's kernel. However as I've been told it's not officially supported. -- You are receiving this mail because: You are the assignee for the bug.
participants (1)
-
bugzilla_noreply@suse.com