[kernel-bugs] [Bug 1173860] New: kernel-5.3.18-lp152.20.7.1 crash at boot with permissions violation
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860 Bug ID: 1173860 Summary: kernel-5.3.18-lp152.20.7.1 crash at boot with permissions violation Classification: openSUSE Product: openSUSE Distribution Version: Leap 15.2 Hardware: x86-64 OS: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel Assignee: kernel-bugs@opensuse.org Reporter: martin.tessun@die-tessuns.de QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- After Updating to the new kernel (5.3.18-lp152.20.7.1) my system does no longer boot. The kernel "hangs" while flooding the console with page fault issues. kernel-5.3.18-lp152.19.2 works perfectly fine. As the issue happens in early stages, I do not have a dump or any logs - just some "tries" of reading the screen. System details: - UEFI - Secure Boot disabled - Manufacturer: ASRock - Product Name: J3455-ITX BIOS Information Vendor: American Megatrends Inc. Version: P1.60 Release Date: 01/16/2018 Address: 0xF0000 Runtime Size: 64 kB ROM Size: 8192 kB Characteristics: PCI is supported BIOS is upgradeable BIOS shadowing is allowed Boot from CD is supported Selectable boot is supported BIOS ROM is socketed EDD is supported 5.25"/1.2 MB floppy services are supported (int 13h) 3.5"/720 kB floppy services are supported (int 13h) 3.5"/2.88 MB floppy services are supported (int 13h) Print screen service is supported (int 5h) 8042 keyboard services are supported (int 9h) Serial services are supported (int 14h) Printer services are supported (int 17h) ACPI is supported USB legacy is supported BIOS boot specification is supported Targeted content distribution is supported UEFI is supported BIOS Revision: 5.12 Unfortunately the system is completely unresponsible so some fragments for the recurring kernel OOPS: Fixing recursive fault, but reboot is needed! BUG: Unable to handle page fault for address: 0x.... #PF: supervisot write access in kernel mode #PF: error_code(0x0003) - permissions violation Oops: 0003 [#8635] SMP NOPTI CPU 0 PID: 495 Comm: systemd-udevd Tainted: ... Hardware name: To be filled by O.E.M RIP: 0010:blk_flush_plug_list+0x6c/0xf0 Some dmesg from the working kernel (5.3.18-lp152.19.2): [ 0.000000] microcode: microcode updated early to revision 0x38, date = 2019-01-15 [ 0.000000] Linux version 5.3.18-lp152.19-default (geeko@buildhost) (gcc version 7.5.0 (SUSE Linux)) #1 SMP Tue Jun 9 20:59:24 UTC 2020 (960cb00) [ 0.000000] Command line: BOOT_IMAGE=/vmlinuz-5.3.18-lp152.19-default root=/dev/mapper/system-root splash=silent resume=/dev/mapper/system-swap quiet mitigations=auto [ 0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers' [ 0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers' [ 0.000000] x86/fpu: Supporting XSAVE feature 0x008: 'MPX bounds registers' [ 0.000000] x86/fpu: Supporting XSAVE feature 0x010: 'MPX CSR' [ 0.000000] x86/fpu: xstate_offset[3]: 576, xstate_sizes[3]: 64 [ 0.000000] x86/fpu: xstate_offset[4]: 640, xstate_sizes[4]: 64 [ 0.000000] x86/fpu: Enabled xstate features 0x1b, context size is 704 bytes, using 'compacted' format. [...] [ 0.000000] NX (Execute Disable) protection: active [ 0.000000] e820: update [mem 0x643e2018-0x643f2057] usable ==> usable [ 0.000000] e820: update [mem 0x643e2018-0x643f2057] usable ==> usable [ 0.000000] e820: update [mem 0x643d3018-0x643e1057] usable ==> usable [ 0.000000] e820: update [mem 0x643d3018-0x643e1057] usable ==> usable [ 0.000000] extended physical RAM map: [...] [ 0.000000] efi: EFI v2.50 by American Megatrends [ 0.000000] efi: TPMFinalLog=0x6d9bd000 ESRT=0x6dd82918 ACPI=0x6d99e000 ACPI 2.0=0x6d99e000 SMBIOS=0x6dc4f000 SMBIOS 3.0=0x6dc4e000 TPMEventLog=0x643f3018 [ 0.000000] secureboot: Secure boot disabled [ 0.000000] SMBIOS 3.0.0 present. [ 0.000000] DMI: To Be Filled By O.E.M. To Be Filled By O.E.M./J3455-ITX, BIOS P1.60 01/16/2018 [ 0.000000] tsc: Detected 1497.600 MHz processor [ 0.000029] e820: update [mem 0x00000000-0x00000fff] usable ==> reserved [ 0.000033] e820: remove [mem 0x000a0000-0x000fffff] usable [ 0.000050] last_pfn = 0x480000 max_arch_pfn = 0x400000000 [ 0.000056] MTRR default type: uncachable [ 0.000058] MTRR fixed ranges enabled: [ 0.000060] 00000-6FFFF write-back [ 0.000062] 70000-7FFFF uncachable [ 0.000064] 80000-9FFFF write-back [ 0.000065] A0000-BFFFF uncachable [ 0.000067] C0000-FFFFF write-protect [ 0.000069] MTRR variable ranges enabled: [...] [ 0.000179] x86/PAT: Configuration [0-7]: WB WC UC- UC WB WP UC- WT [ 0.000255] last_pfn = 0x6f000 max_arch_pfn = 0x400000000 [ 0.007627] esrt: Reserving ESRT space from 0x000000006dd82918 to 0x000000006dd82950. [ 0.007646] check: Scanning 1 areas for low memory corruption [ 0.007655] Using GB pages for direct mapping [ 0.007659] BRK [0x2fe001000, 0x2fe001fff] PGTABLE [ 0.007663] BRK [0x2fe002000, 0x2fe002fff] PGTABLE [ 0.007666] BRK [0x2fe003000, 0x2fe003fff] PGTABLE [ 0.007785] BRK [0x2fe004000, 0x2fe004fff] PGTABLE [ 0.008108] BRK [0x2fe005000, 0x2fe005fff] PGTABLE [ 0.008208] BRK [0x2fe006000, 0x2fe006fff] PGTABLE [ 0.008334] BRK [0x2fe007000, 0x2fe007fff] PGTABLE [ 0.008584] BRK [0x2fe008000, 0x2fe008fff] PGTABLE [ 0.008726] BRK [0x2fe009000, 0x2fe009fff] PGTABLE [ 0.008935] BRK [0x2fe00a000, 0x2fe00afff] PGTABLE [ 0.009292] BRK [0x2fe00b000, 0x2fe00bfff] PGTABLE [ 0.009716] secureboot: Secure boot disabled [ 0.009718] RAMDISK: [mem 0x3efc7000-0x3fffafff] [ 0.009734] ACPI: Early table checksum verification disabled [...] [ 0.010163] No NUMA configuration found [ 0.010165] Faking a node at [mem 0x0000000000000000-0x000000047fffffff] [ 0.010182] NODE_DATA(0) allocated [mem 0x47ffea000-0x47fffffff] [ 0.010286] Zone ranges: [ 0.010288] DMA [mem 0x0000000000001000-0x0000000000ffffff] [ 0.010291] DMA32 [mem 0x0000000001000000-0x00000000ffffffff] [ 0.010293] Normal [mem 0x0000000100000000-0x000000047fffffff] [ 0.010295] Device empty [ 0.010297] Movable zone start for each node [ 0.010300] Early memory node ranges [...] [ 0.010816] Zeroed struct page in unavailable ranges: 22339 pages [ 0.010820] Initmem setup node 0 [mem 0x0000000000001000-0x000000047fffffff] [ 0.010824] On node 0 totalpages: 4106429 [ 0.010827] DMA zone: 64 pages used for memmap [ 0.010828] DMA zone: 21 pages reserved [ 0.010830] DMA zone: 3996 pages, LIFO batch:0 [ 0.011007] DMA32 zone: 6757 pages used for memmap [ 0.011008] DMA32 zone: 432417 pages, LIFO batch:63 [ 0.039794] Normal zone: 57344 pages used for memmap [ 0.039795] Normal zone: 3670016 pages, LIFO batch:63 [ 0.041225] Reserving Intel graphics memory at [mem 0x70000000-0x7fffffff] [ 0.041456] ACPI: PM-Timer IO Port: 0x408 [ 0.041460] ACPI: Local APIC address 0xfee00000 [ 0.041474] ACPI: LAPIC_NMI (acpi_id[0x01] high level lint[0x1]) [ 0.041476] ACPI: LAPIC_NMI (acpi_id[0x02] high level lint[0x1]) [ 0.041478] ACPI: LAPIC_NMI (acpi_id[0x03] high level lint[0x1]) [ 0.041479] ACPI: LAPIC_NMI (acpi_id[0x04] high level lint[0x1]) [ 0.041518] IOAPIC[0]: apic_id 1, version 32, address 0xfec00000, GSI 0-119 [ 0.041522] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) [ 0.041525] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 low level) [ 0.041528] ACPI: IRQ0 used by override. [ 0.041530] ACPI: IRQ9 used by override. [ 0.041533] Using ACPI (MADT) for SMP configuration information [ 0.041536] ACPI: HPET id: 0x8086a701 base: 0xfed00000 [ 0.041558] smpboot: Allowing 4 CPUs, 0 hotplug CPUs [...] [ 0.041696] Booting paravirtualized kernel on bare hardware [ 0.041702] clocksource: refined-jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 7645519600211568 ns [ 0.300440] setup_percpu: NR_CPUS:512 nr_cpumask_bits:512 nr_cpu_ids:4 nr_node_ids:1 [ 0.301012] percpu: Embedded 56 pages/cpu s192512 r8192 d28672 u524288 [ 0.301031] pcpu-alloc: s192512 r8192 d28672 u524288 alloc=1*2097152 [ 0.301033] pcpu-alloc: [0] 0 1 2 3 [ 0.301096] Built 1 zonelists, mobility grouping on. Total pages: 4042243 [ 0.301098] Policy zone: Normal [ 0.301102] Kernel command line: BOOT_IMAGE=/vmlinuz-5.3.18-lp152.19-default root=/dev/mapper/system-root splash=silent resume=/dev/mapper/system-swap quiet mitigations=auto [...] [ 0.412628] Last level iTLB entries: 4KB 48, 2MB 0, 4MB 0 [ 0.412630] Last level dTLB entries: 4KB 0, 2MB 0, 4MB 0, 1GB 0 [ 0.412637] Spectre V1 : Mitigation: usercopy/swapgs barriers and __user pointer sanitization [ 0.412640] Spectre V2 : Mitigation: Full generic retpoline [ 0.412641] Spectre V2 : Spectre v2 / SpectreRSB mitigation: Filling RSB on context switch [ 0.412642] Spectre V2 : Enabling Restricted Speculation for firmware calls [ 0.412645] Spectre V2 : mitigation: Enabling conditional Indirect Branch Prediction Barrier [ 0.413137] Freeing SMP alternatives memory: 36K [ 0.416087] TSC deadline timer enabled [ 0.416096] smpboot: CPU0: Intel(R) Celeron(R) CPU J3455 @ 1.50GHz (family: 0x6, model: 0x5c, stepping: 0x9) [...] [ 0.417097] smp: Bringing up secondary CPUs ... [ 0.417097] x86: Booting SMP configuration: [ 0.417097] .... node #0, CPUs: #1 #2 #3 [ 0.417793] smp: Brought up 1 node, 4 CPUs [ 0.417793] smpboot: Max logical packages: 1 [ 0.417793] smpboot: Total of 4 processors activated (11980.80 BogoMIPS) [ 0.533128] node 0 initialised, 3555943 pages in 116ms [ 0.537100] devtmpfs: initialized [ 0.537239] x86/mm: Memory block size: 128MB [ 0.537808] PM: Registering ACPI NVS region [mem 0x6d99e000-0x6d9d4fff] (225280 bytes) [ 0.537808] PM: Registering ACPI NVS region [mem 0x6e155000-0x6e155fff] (4096 bytes) [ 0.537808] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 7645041785100000 ns [ 0.537808] futex hash table entries: 1024 (order: 4, 65536 bytes, linear) [ 0.537808] pinctrl core: initialized pinctrl subsystem [ 0.537808] PM: RTC time: 17:35:41, date: 2020-07-07 [ 0.541102] NET: Registered protocol family 16 -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c1
--- Comment #1 from Martin Tessun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c3
Matwey Kornilov
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c5
--- Comment #5 from Takashi Iwai
I have the same issue. It was a great surprise to me this morning.
Could you get Oops message including the full stack trace? Also testing KOTD would be helpful. -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c6
--- Comment #6 from Matwey Kornilov
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c7
--- Comment #7 from Martin Tessun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c8
--- Comment #8 from Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c9
--- Comment #9 from Matwey Kornilov
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c10
--- Comment #10 from Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c11
Martin Tessun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c12
--- Comment #12 from Matwey Kornilov
You can try to pass nomodeset boot option. This will disable the native graphics. Also, try to remove quiet and splash=silent boot options.
Well, I'll try netconsole, since nomodeset is still to fast for my camera :-) -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c13
--- Comment #13 from Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c14
--- Comment #14 from Martin Tessun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c15
--- Comment #15 from Martin Tessun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c16
--- Comment #16 from Martin Tessun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c17
--- Comment #17 from Matwey Kornilov
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c18
--- Comment #18 from Martin Tessun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c19
--- Comment #19 from Matwey Kornilov
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c20
--- Comment #20 from Takashi Iwai
I've just triggered this in VM: try to create brand new LVM-based raid1 and it crashes at this point.
A great finding! Could you try to boot VM with serial console enabled and catch the crash traces? -- You are receiving this mail because: You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c21
--- Comment #21 from Martin Tessun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c22
--- Comment #22 from Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c23
--- Comment #23 from Martin Tessun
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860
http://bugzilla.opensuse.org/show_bug.cgi?id=1173860#c24
Takashi Iwai
participants (1)
-
bugzilla_noreply@suse.com