Bug ID 1231813
Summary [riscv64][Unmatched][regression] Hard lockup on reboot/poweroff with 6.11
Classification openSUSE
Product openSUSE Tumbleweed
Version Current
Hardware RISC-V
OS openSUSE Tumbleweed
Status NEW
Severity Normal
Priority P5 - None
Component Kernel
Assignee kernel-bugs@opensuse.org
Reporter afaerber@suse.com
QA Contact qa-bugs@suse.de
CC ddavis@suse.com, jeffm@suse.com, schwab@suse.com
Target Milestone ---
Found By ---
Blocker ---

Reboot or poweroff on HiFive Unmatched board is no longer working at
20241016/17 (was working with 20240828; its 6.10.5 kernel with 20241017 does
not help) and runs into a hard CPU lockup kernel panic:

[   56.329837][ T1379] sifive-serial 10010000.serial: BREAK detection not
supported
[   57.664396][    T1] reboot: Power down
[   72.030230][    C3] watchdog: Watchdog detected hard LOCKUP on cpu 0
[   96.026864][    C2] watchdog: Watchdog detected hard LOCKUP on cpu 3
[   96.026873][    C2] Modules linked in: af_packet rfkill nf_tables libcrc32c
snd_hda_codec_hdmi snd_hda_intel mscc snd_intel_dspcfg macsec snd_hda_codec
nls_iso8859_1 nls_cp437 snd_hda_core vfat macb fat snd_hwdep phylink snd_pcm
ofpart of_mdio snd_timer fixed_phy fwnode_mdio libphy snd spi_nor ptp at24
soundcore da9063_onkey pps_core mtd uio_pdrv_genirq uio fuse dm_mod
nvme_fabrics nvme_keyring loop efi_pstore configfs nfnetlink dmi_sysfs
ip_tables x_tables ext4 mbcache jbd2 xe hid_apple hid_generic usbhid
apple_mfi_fastcharge xhci_pci mmc_block xhci_pci_renesas xhci_hcd
drm_ttm_helper ttm i2c_algo_bit gpu_sched drm_buddy video drm_suballoc_helper
drm_gpuvm mmc_spi crc_itu_t crc7 drm_exec of_mmc_spi usbcore drm_display_helper
mmc_core rtc_da9063 da9063_regulator nvme cec lm90 da9063 mfd_core regmap_i2c
rc_core nvme_core usb_common nvme_auth i2c_ocores pwm_sifive spi_sifive
efivarfs
[   96.027165][    C2] Kernel panic - not syncing: Hard LOCKUP
[   96.027178][    C2] CPU: 2 UID: 0 PID: 0 Comm: swapper/2 Tainted: G     U   
         6.11.3-1-default #1 openSUSE Tumbleweed
1400000003000000474e55005c3864d57907faa4
[   96.027195][    C2] Tainted: [U]=USER
[   96.027198][    C2] Hardware name: sifive SiFive HiFive Unmatched A00/SiFive
HiFive Unmatched A00, BIOS 2024.10 10/01/2024
[   96.027204][    C2] Call Trace:
[   96.027211][    C2] [<ffffffff80006b4a>] dump_backtrace+0x1c/0x24
[   96.027236][    C2] [<ffffffff80a00178>] show_stack+0x2c/0x38
[   96.027251][    C2] [<ffffffff80a0cd7c>] dump_stack_lvl+0x50/0x72
[   96.027270][    C2] [<ffffffff80a0cdb2>] dump_stack+0x14/0x1c
[   96.027279][    C2] [<ffffffff80a00410>] panic+0x106/0x312
[   96.027288][    C2] [<ffffffff80022c0a>] add_taint+0x0/0x8c
[   96.027302][    C2] [<ffffffff80123204>]
watchdog_hardlockup_check+0x170/0x1b4
[   96.027322][    C2] [<ffffffff80123ca8>]
watchdog_buddy_check_hardlockup+0x4c/0x5c
[   96.027334][    C2] [<ffffffff80122e98>] watchdog_timer_fn+0x70/0x238
[   96.027344][    C2] [<ffffffff800d0540>] __hrtimer_run_queues+0xfe/0x240
[   96.027364][    C2] [<ffffffff800d133a>] hrtimer_interrupt+0xca/0x1d6
[   96.027374][    C2] [<ffffffff807b5e58>] riscv_timer_interrupt+0x3a/0x52
[   96.027387][    C2] [<ffffffff800a1e0a>] handle_percpu_devid_irq+0x8a/0x1c0
[   96.027403][    C2] [<ffffffff8009c6b0>] generic_handle_domain_irq+0x1c/0x2a
[   96.027423][    C2] [<ffffffff8055d908>] riscv_intc_irq+0x26/0x60
[   96.027441][    C2] [<ffffffff80a0ce20>] handle_riscv_irq+0x4c/0x76
[   96.027451][    C2] [<ffffffff80a1b106>] call_on_irq_stack+0x32/0x40
[   96.788253][    C2] SMP: failed to stop secondary CPUs 0,2-3
[   96.788264][    C2] Kernel Offset: 0x0 from 0xffffffff80000000

After some minutes it then reboots (also for poweroff).

I've manually updated the two U-Boot partitions to latest Factory 2024.10, to
no improvement.
The taint is from xe.force_probe (bug #1231756); happens without as well.


You are receiving this mail because: