[Bug 1202581] New: kernel 5.14.21-150400.24.18-default emits "soft lockup" errors; sometimes won't even boot

http://bugzilla.opensuse.org/show_bug.cgi?id=1202581 Bug ID: 1202581 Summary: kernel 5.14.21-150400.24.18-default emits "soft lockup" errors; sometimes won't even boot Classification: openSUSE Product: openSUSE Distribution Version: Leap 15.4 Hardware: x86-64 OS: Other Status: NEW Severity: Critical Priority: P5 - None Component: Kernel Assignee: kernel-bugs@opensuse.org Reporter: psychonaut@nothingisreal.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- Created attachment 860955 --> http://bugzilla.opensuse.org/attachment.cgi?id=860955&action=edit Output of hwinfo Upon upgrading from Leap 15.3 to Leap 15.4, which uses 5.14.21-150400.24.18-default, I'm now getting a bunch of kernel log messages of the following form: watchdog: BUG: soft lockup - CPU#1 stuck for 212s! [swapper/1:0] rcu: INFO: rcu_preempt self-detected stall on CPU The time value and the name of the process ("swapper" above) may vary. Sometimes these messages appear on the console while the machine is starting up, and the machine never completes the bootup sequence -- I just see a continuous stream of "soft lockup" errors, a new one appearing every 30 seconds or so. (The messages don't even appear in the system log, apparently because they occur before all the disks get mounted.) Sometimes bootup completes normally and the messages appear later on. Attached is the output of hwinfo, as well as a kernel log that shows one occurrence of the error message. -- You are receiving this mail because: You are on the CC list for the bug.

http://bugzilla.opensuse.org/show_bug.cgi?id=1202581 http://bugzilla.opensuse.org/show_bug.cgi?id=1202581#c1 --- Comment #1 from Tristan Miller <psychonaut@nothingisreal.com> --- Created attachment 860956 --> http://bugzilla.opensuse.org/attachment.cgi?id=860956&action=edit Kernel log including a soft lockup error -- You are receiving this mail because: You are on the CC list for the bug.

http://bugzilla.opensuse.org/show_bug.cgi?id=1202581 http://bugzilla.opensuse.org/show_bug.cgi?id=1202581#c9 Tristan Miller <psychonaut@nothingisreal.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Flags|needinfo?(psychonaut@nothin | |gisreal.com) | --- Comment #9 from Tristan Miller <psychonaut@nothingisreal.com> --- Created attachment 861076 --> http://bugzilla.opensuse.org/attachment.cgi?id=861076&action=edit Log from kernel-default-5.14.21-150400.1.1.g42bb3f2 No, I'm afraid that kernel also produces soft lockup errors. Attached is a log. -- You are receiving this mail because: You are on the CC list for the bug.

http://bugzilla.opensuse.org/show_bug.cgi?id=1202581 http://bugzilla.opensuse.org/show_bug.cgi?id=1202581#c10 --- Comment #10 from Takashi Iwai <tiwai@suse.com> --- Hm, then the i915 stack trace was a red herring, and something was already broken before that point. The new stack trace doesn't make sense, either. Could you try the older Leap 15.4 kernel? There are at least two previous Leap releases (5.14.21-150400.22.1 and 5.14.21-150400.24.11.1). -- You are receiving this mail because: You are on the CC list for the bug.

http://bugzilla.opensuse.org/show_bug.cgi?id=1202581 http://bugzilla.opensuse.org/show_bug.cgi?id=1202581#c11 --- Comment #11 from Tristan Miller <psychonaut@nothingisreal.com> --- I get soft lockup errors with 5.14.21-150400.24.11.1. I assume the same is true also of 5.14.21-150400.22.1, since the machine completely froze during boot with that kernel, before I could even press Escape to see the console messages. -- You are receiving this mail because: You are on the CC list for the bug.

http://bugzilla.opensuse.org/show_bug.cgi?id=1202581 Oliver Kurz <okurz@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |okurz@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@suse.com