[kernel-bugs] [Bug 1175589] New: latest kernel update gives backtrace on boot on Huawei TaiShan 2280
https://bugzilla.suse.com/show_bug.cgi?id=1175589 Bug ID: 1175589 Summary: latest kernel update gives backtrace on boot on Huawei TaiShan 2280 Classification: openSUSE Product: openSUSE Distribution Version: Leap 15.2 Hardware: aarch64 OS: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel Assignee: kernel-bugs@opensuse.org Reporter: ro@suse.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- System Information Manufacturer: Huawei Product Name: TaiShan 2280 Version: V100R001C00 Base Board Information Manufacturer: Huawei Product Name: BC11SPCD Version: VER.A last working kernel: obs-arm-6:~ # uname -a Linux obs-arm-6 5.3.18-lp152.33-default #1 SMP Wed Jul 22 06:32:33 UTC 2020 (e5a8383) aarch64 aarch64 aarch64 GNU/Linux crashing kernel: 5.3.18-lp152.36-default [ 198.659862] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000010 [ 198.678721] Mem abort info: [ 198.681504] ESR = 0x96000004 [ 198.681506] Exception class = DABT (current EL), IL = 32 bits [ 198.681507] SET = 0, FnV = 0 [ 198.681508] EA = 0, S1PTW = 0 [ 198.681508] Data abort info: [ 198.681511] ISV = 0, ISS = 0x00000004 [ 198.684652] audit: type=1400 audit(1598003387.680:9): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/nagios/plugins/check_ntp_time" pid=6829 comm="apparmor_parser" [ 198.690461] CM = 0, WnR = 0 [ 198.690463] user pgtable: 4k pages, 48-bit VAs, pgdp=0000002fb4588000 [ 198.690464] [0000000000000010] pgd=0000000000000000 [ 198.690468] Internal error: Oops: 96000004 [#1] SMP [ 198.690470] Modules linked in: joydev sbsa_gwdt hns_roce_hw_v1(+) ib_uverbs ipmi_si(+) ib_core ipmi_devintf ipmi_msghandler efivarfs ext4 mbcache jbd2 fuse squashfs lz4_decompress loop brd af_packet ses enclosure sd_mod hid_generic usbhid marvell aes_ce_blk hibmc_drm crypto_simd drm_vram_helper ttm cryptd drm_kms_helper aes_ce_cipher syscopyarea crct10dif_ce ehci_platform ghash_ce sysfillrect ehci_hcd hisi_sas_v2_hw aes_arm64 sysimgblt hisi_sas_main fb_sys_fops sha2_ce libsas sha256_arm64 drm scsi_transport_sas sha1_ce usbcore libata hns_dsaf hns_enet_drv hns_mdio hnae sg nbd dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua scsi_mod [ 198.796596] CPU: 20 PID: 6588 Comm: systemd-udevd Not tainted 5.3.18-lp152.36-default #1 openSUSE Leap 15.2 (unreleased) [ 198.796597] Hardware name: Huawei TaiShan 2280 /BC11SPCD, BIOS 1.27 06/13/2017 [ 198.796600] pstate: 60000005 (nZCv daif -PAN -UAO) [ 198.819443] pc : hns_roce_create_cq+0x2c/0x810 [hns_roce_hw_v1] [ 198.821744] audit: type=1400 audit(1598003387.820:10): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/nagios/plugins/check_procs" pid=6848 comm="apparmor_parser" [ 198.825353] lr : hns_roce_v1_rsv_lp_qp+0xa8/0x4f8 [hns_roce_hw_v1] [ 198.848701] sp : ffff0000239035c0 [ 198.852001] x29: ffff0000239035c0 x28: ffff802fbad92000 [ 198.857299] x27: 0000000000000010 x26: ffff802f98eee200 [ 198.862597] x25: ffff80203cbd2010 x24: ffff802fb3b71900 [ 198.867895] x23: ffff80203cbd2010 x22: ffff80203cbd2010 [ 198.872967] audit: type=1400 audit(1598003387.870:11): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/nagios/plugins/check_swap" pid=6858 comm="apparmor_parser" [ 198.873193] x21: ffff000011679000 x20: 0000000000000000 [ 198.895591] x19: ffff000023903708 x18: ffffffffffffffff [ 198.900888] x17: 0000000000000000 x16: ffff802fb4b93d00 [ 198.906186] x15: 00000000000019b8 x14: 000000000000058a [ 198.911483] x13: 0000000000000003 x12: ffffffffffffffff [ 198.916781] x11: 0000000000000040 x10: ffffff7fff7fff7f [ 198.922078] x9 : a6c8de5e8022adff x8 : 0000000000000000 [ 198.927376] x7 : ffffffffffffffff x6 : ffff802fb2b59eb8 [ 198.932673] x5 : ffff802fb2b59eb8 x4 : 0000000000000400 [ 198.937971] x3 : 0000000000000000 x2 : 0000000000000000 [ 198.943268] x1 : ffff000023903708 x0 : ffff0000092ec808 [ 198.948566] Call trace: [ 198.951005] hns_roce_create_cq+0x2c/0x810 [hns_roce_hw_v1] [ 198.956566] hns_roce_v1_rsv_lp_qp+0xa8/0x4f8 [hns_roce_hw_v1] [ 198.962388] hns_roce_v1_init+0x584/0x8b8 [hns_roce_hw_v1] [ 198.967863] hns_roce_init+0x2b4/0xb08 [hns_roce_hw_v1] [ 198.973077] hns_roce_probe+0x328/0x478 [hns_roce_hw_v1] [ 198.978382] platform_drv_probe+0x58/0xa8 [ 198.982378] really_probe+0xdc/0x448 [ 198.985940] driver_probe_device+0x12c/0x148 [ 198.990196] device_driver_attach+0x74/0x98 [ 198.994366] __driver_attach+0x6c/0x168 [ 198.998188] bus_for_each_dev+0x84/0xd8 [ 199.002010] driver_attach+0x30/0x40 [ 199.005572] bus_add_driver+0x170/0x258 [ 199.009394] driver_register+0x64/0x118 [ 199.013217] __platform_driver_register+0x54/0x60 [ 199.017911] hns_roce_driver_init+0x24/0x1000 [hns_roce_hw_v1] [ 199.023731] do_one_initcall+0x54/0x240 [ 199.027554] do_init_module+0x60/0x1f0 [ 199.031289] load_module+0x1614/0x1718 [ 199.035024] __se_sys_finit_module+0xf8/0x110 [ 199.039367] __arm64_sys_finit_module+0x24/0x30 [ 199.043885] el0_svc_common.constprop.0+0xa0/0x1f8 [ 199.048662] el0_svc_handler+0x34/0x90 [ 199.052397] el0_svc+0x10/0x14 [ 199.055442] Code: aa0203f4 aa1e03e0 d503201f b0041c15 (f940037c) [ 199.061522] ---[ end trace 6041bb51aa4731d0 ]--- -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1175589 https://bugzilla.suse.com/show_bug.cgi?id=1175589#c3 Takashi Iwai <tiwai@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Flags| |needinfo?(ro@suse.com) --- Comment #3 from Takashi Iwai <tiwai@suse.com> --- Does the problem still exist with the latest Leap 15.2 kernel? -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1175589 https://bugzilla.suse.com/show_bug.cgi?id=1175589#c4 Ruediger Oertel <ro@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Flags|needinfo?(ro@suse.com) | --- Comment #4 from Ruediger Oertel <ro@suse.com> --- still seen on 5.3.18-lp152.41-default still seen on 5.3.18-lp152.50-default [ 199.952356] audit: type=1400 audit(1606901355.480:10): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/nagios/pl ugins/check_procs" pid=7027 comm="apparmor_parser" [ 199.979132] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000010 [ 199.987957] Mem abort info: [ 199.990745] ESR = 0x96000004 [ 199.993793] Exception class = DABT (current EL), IL = 32 bits [ 199.999711] SET = 0, FnV = 0 [ 199.999712] EA = 0, S1PTW = 0 [ 199.999714] Data abort info: [ 199.999715] ISV = 0, ISS = 0x00000004 [ 199.999719] CM = 0, WnR = 0 [ 200.007767] audit: type=1400 audit(1606901355.540:11): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/nagios/plugins/check_swap" pid=7044 comm="apparmor_parser" [ 200.008764] user pgtable: 4k pages, 48-bit VAs, pgdp=0000001f35b4a000 [ 200.039073] [0000000000000010] pgd=0000000000000000 [ 200.039082] Internal error: Oops: 96000004 [#1] SMP [ 200.048810] Modules linked in: cppc_cpufreq(-) hns_roce_hw_v1(+) joydev ib_uverbs sbsa_gwdt efi_pstore ib_core ipmi_si(+) ipmi_devintf ipmi_msghandler efivarfs ext4 mbcache jbd2 fuse squashfs lz4_decompress loop brd af_packet ses sd_mod enclosure hid_generic usbhid marvell aes_ce_blk crypto_simd hibmc_drm cryptd drm_vram_helper ttm aes_ce_cipher drm_kms_helper crct10dif_ce hisi_sas_v2_hw ehci_platform hisi_sas_main ghash_ce syscopyarea ehci_hcd aes_arm64 sysfillrect sha2_ce libsas sysimgblt fb_sys_fops sha256_arm64 sha1_ce scsi_transport_sas drm usbcore libata hns_dsaf hns_enet_drv hns_mdio hnae sg nbd dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua scsi_mod [ 200.107551] ipmi_si hisi-lpc-ipmi.1.auto: IPMI message handler: Found new BMC (man_id: 0x0007db, prod_id: 0x0001, dev_id: 0x01) [ 200.107967] CPU: 34 PID: 6693 Comm: systemd-udevd Not tainted 5.3.18-lp152.50-default #1 openSUSE Leap 15.2 [ 200.129151] Hardware name: Huawei TaiShan 2280 /BC11SPCD, BIOS 1.27 06/13/2017 [ 200.129153] pstate: 60000005 (nZCv daif -PAN -UAO) [ 200.129166] pc : hns_roce_create_cq+0x2c/0x810 [hns_roce_hw_v1] [ 200.129175] lr : hns_roce_v1_rsv_lp_qp+0xa8/0x4f8 [hns_roce_hw_v1] [ 200.153223] sp : ffff000023b3b5c0 [ 200.156524] x29: ffff000023b3b5c0 x28: ffff841fb7b3c000 [ 200.161824] x27: 0000000000000010 x26: ffff841fb1b1d180 [ 200.167123] x25: ffff841fbb040c10 x24: ffff84103c8b3c00 [ 200.172422] x23: ffff841fbb040c10 x22: ffff841fbb040c10 [ 200.177720] x21: ffff000011699000 x20: 0000000000000000 [ 200.183018] x19: ffff000023b3b708 x18: ffffffffffffffff [ 200.188317] x17: 0000000000000000 x16: ffff801fb4435b80 [ 200.193615] x15: 00000000000019b8 x14: ffff841fbb6236a0 [ 200.198914] x13: 0000000000000000 x12: ffffffffffffffff [ 200.204212] x11: 0000000000000040 x10: ff7fffffffff7f7f [ 200.209510] x9 : b74f94e2a9cd01ff x8 : 0000000000000000 [ 200.214808] x7 : ffffffffffffffff x6 : ffff841faf9cbd38 [ 200.220106] x5 : ffff841faf9cbd38 x4 : 0000000020000000 [ 200.225404] x3 : 0000000000000000 x2 : 0000000000000000 [ 200.230702] x1 : ffff000023b3b708 x0 : ffff000009333808 [ 200.236002] Call trace: [ 200.238441] hns_roce_create_cq+0x2c/0x810 [hns_roce_hw_v1] [ 200.244004] hns_roce_v1_rsv_lp_qp+0xa8/0x4f8 [hns_roce_hw_v1] [ 200.249827] hns_roce_v1_init+0x584/0x8b8 [hns_roce_hw_v1] [ 200.255302] hns_roce_init+0x2b4/0xb08 [hns_roce_hw_v1] [ 200.260517] hns_roce_probe+0x328/0x478 [hns_roce_hw_v1] [ 200.265827] platform_drv_probe+0x58/0xa8 [ 200.269824] really_probe+0xdc/0x448 [ 200.273387] driver_probe_device+0x12c/0x148 [ 200.277644] device_driver_attach+0x74/0x98 [ 200.281813] __driver_attach+0x6c/0x168 [ 200.285636] bus_for_each_dev+0x84/0xd8 [ 200.289459] driver_attach+0x30/0x40 [ 200.293021] bus_add_driver+0x170/0x258 [ 200.296844] driver_register+0x64/0x118 [ 200.300666] __platform_driver_register+0x54/0x60 [ 200.305360] hns_roce_driver_init+0x24/0x1000 [hns_roce_hw_v1] [ 200.311183] do_one_initcall+0x54/0x240 [ 200.315007] do_init_module+0x60/0x1f8 [ 200.318743] load_module+0x1614/0x1718 [ 200.322478] __se_sys_finit_module+0xf8/0x110 [ 200.326822] __arm64_sys_finit_module+0x24/0x30 [ 200.331341] el0_svc_common.constprop.0+0xa0/0x1f8 [ 200.336118] el0_svc_handler+0x34/0x90 [ 200.339854] el0_svc+0x10/0x14 [ 200.342898] Code: aa0203f4 aa1e03e0 d503201f d0041ad5 (f940037c) [ 200.348980] ---[ end trace df719e0eb72ffc81 ]--- -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1175589 https://bugzilla.suse.com/show_bug.cgi?id=1175589#c5 Takashi Iwai <tiwai@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Assignee|kernel-bugs@opensuse.org |tbogendoerfer@suse.com --- Comment #5 from Takashi Iwai <tiwai@suse.com> --- OK, thanks. Reassigned to Thomas, as this looks like a NULL dereference in IB hns stuff. -- You are receiving this mail because: You are the assignee for the bug.
participants (1)
-
bugzilla_noreply@suse.com