[Bug 1166664] New: Boot stops at "Loading initial ramdisk ..."
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664 Bug ID: 1166664 Summary: Boot stops at "Loading initial ramdisk ..." Classification: openSUSE Product: openSUSE Distribution Version: Leap 15.2 Hardware: x86-64 OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel Assignee: kernel-maintainers@forge.provo.novell.com Reporter: nwr10cst-oslnx@yahoo.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Firefox/68.0 Build Identifier: I updated to Build603.1 Upon rebooting, I see: Loading Linux-5.3.18-lp152.5-default ... Loading initial ramdisk ... And, after that, nothing happens. If I wait 2-3 minutes, the system resets and this repeats. Booting with the previous kernel (5.3.18-lp152.4-default) is fine. After booting to the good kernel, I tried running "mkinitrd". But that did not fix the problem. I'll note that I have Leap 15.2Beta on several other computers, and they all boot fine with this kernel. But that one computer will not boot this kernel. Reproducible: Always -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c1
Frank Kruger
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c2
--- Comment #2 from Frank Kruger
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c3
--- Comment #3 from Frank Kruger
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c4
--- Comment #4 from Frank Kruger
Created attachment 832797 [details] hwinfo output
FYI: Since I do not have access to Leap 15.2 beta right now, I have choosen the TW installation, but the machine is the same. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c5
Takashi Iwai
Booting with the previous kernel (5.3.18-lp152.4-default) is fine. After booting to the good kernel, I tried running "mkinitrd". But that did not fix the problem.
Could you boot with the previous running kernel, and get hwinfo, attach the output? Also give the dmesg output from the working kernel, too. Also, at best, if you get a screenshot of the kernel panic, take it, too. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c6
--- Comment #6 from Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c7
Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c8
--- Comment #8 from Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c9
Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c10
--- Comment #10 from Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c12
Neil Rickert
Could you test openSUSE-15.2 KOTD kernel?
I installed "vmlinuz-5.3.18-lp152.43.gb955dad-default" from that repo. Unfortunately, it does not boot -- exactly the same problem as reported in this bug. Likewise "vmlinuz-5.3.18-lp152.7-default" from today's update does not boot. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c13
Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
Frank Kruger
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c19
--- Comment #19 from Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c21
--- Comment #21 from Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c22
--- Comment #22 from Frank Kruger
Thanks.
Now I'm building two kernels, and I'd like to ask you flavors to test them.
First off, increase the number of installable kernels for keeping the more old kernels. Edit /etc/zypp/zypp.conf, and just add more entries in the line multiversions.kernel = xxx line. Currently only two old and one running kernel can be kept.
Then test the first kernel. It's being built in OBS home:tiwai:bsc1166664-p1 repo. This should appear later at
http://download.opensuse.org/repositories/home:/tiwai:/bsc1166664-p1/ standard/ once after the build finishes. This repo contains a kernel that should be equivalent with 5.3.18-lp152.5.x (aka SLE15-SP2 beta2). Please test this kernel at first, and check whether the boot fails (expected).
Does not boot at all.
If the boot with the kernel above fails, it's good, go step 2. The second kernel is in OBS home:tiwai:bsc1166664-p2 repo, will appear at
http://download.opensuse.org/repositories/home:/tiwai:/bsc1166664-p2/ standard/ Please give it a try and check the boot.
Boots fine! -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c23
--- Comment #23 from Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c24
--- Comment #24 from Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c25
--- Comment #25 from Frank Kruger
Thanks for quick testing! This is a good step forward. It points that the regression is likely caused by the cpufreq patchset.
I'm building another test kernel in OBS home:tiwai:bsc1166664-p3 repo. This is based on the very latest openSUSE-15.2 git branch with the revert of suspicious patches. Please check whether this works later.
Your most recent kernel-default-5.3.18-lp152.1.1.g477312f.x86_64 works fine and solves the issue for me. Thx. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c26
Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c27
Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c29
--- Comment #29 from Frank Kruger
(In reply to Frank Kruger from comment #14)
(In reply to Takashi Iwai from comment #13)
Thanks. So Neil's problem looks different from the known regression that was already fixed. Since it hadn't shown any kernel panic message, it might be a different issue from the bug in comment 2.
I still suspect that the issue is fixed for Frank. Frank, could you check openSUSE-15.2 KOTD?
Unfortunately, Leap 15.2 KOTD does not work for me either.
Which kernel version-release numbers did you test? The fix landed very recently, available since yesterday's build.
I am not sure whether this helps, but let me mention that
i) starting the KOTD kernel from Leap's grub leads to the observation described in comment #0;
ii) using Tumbleweed's grub I experience the kernel panic mentioned in comment #2;
iii) Leap 15.2 KOTD worked up to kernel-default-5.3.18-lp152.20.*
Could you give the full version-release numbers? It's not clear whether this is even newer or older than other release packages, unfortunately.
For the sake of completeness, there you are: kernel-default-5.3.18-lp152.20.1.gda3afd4.x86_64 -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c30
--- Comment #30 from Giovanni Gherdovich
Thanks again for a quick test.
So, Giovanni, it's a regression by your patch set. The positive result was by the revert of the whole patches: patches.suse/x86-intel_pstate-Handle-runtime-turbo-disablement-en.patch
patches.suse/x86-sched-Add-support-for-frequency-invariance-on-ATOM.patch
patches.suse/x86-sched-Add-support-for-frequency-invariance-on-ATOM_GOLDMONT. patch patches.suse/x86-sched-Add-support-for-frequency-invariance-on-SK.patch patches.suse/x86-sched-Add-support-for-frequency-invariance-on-XE.patch patches.suse/x86-sched-Add-support-for-frequency-invariance.patch
Could you investigate?
Yes sorry, this slipped under my radar and got confused with other emails. It's definitely a problem with that patchset, I'm looking at it now and will update this thread as soon as possible. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c31
Giovanni Gherdovich
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c32
Giovanni Gherdovich
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c33
Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c34
Frank Kruger
asking to Frank if he can share the "turbostat" output, setting the "need-info" flag. There your are, attached the output of turbostat --interval 1 sleep 0.
-- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c35
--- Comment #35 from Giovanni Gherdovich
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c36
Stakanov Schufter
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c37
--- Comment #37 from Giovanni Gherdovich
Created attachment 834900 [details] Turbostat output of a Thinkpad X201 Lenovo
As I was hit by this too, I join the output of turbostat --interval 1 sleep 0
for a Lenovo Thinkpad X201
Thank you very much Stakanov for adding info to this bug. Your report confirms that the root cause is correctly identified, i.e. this kernel is broken on machines with less than 4 physical cores. From your turbostat output I learn that your machine has two physical cores: cpu0: MSR_TURBO_RATIO_LIMIT: 0x00001416 20 * 133.3 = 2666.6 MHz max turbo 2 active cores 22 * 133.3 = 2933.3 MHz max turbo 1 active cores (that makes four threads with hyper-threading). Let me reiterate that I'm writing a fix for this and a test kernel will soon be available for you to verify that the problem is resolved. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c38
--- Comment #38 from Frank Kruger
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c39
Giovanni Gherdovich
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c40
Giovanni Gherdovich
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c41
--- Comment #41 from Frank Kruger
see comment 39, need-info from Frank as well.
I prefer not to be mentioned. By the way, why is this bug restricted to 5.3.18 and does not show up in the current TW/Kernel:stable version? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c42
--- Comment #42 from Frank Kruger
Hello Neil and Frank,
the fix is ready and committed in the openSUSE 15.2 kernel tree. I'll let you know when a test build is available.
Do you mean kernel-default-5.3.18-lp152.54.1.ga52aaa2.x86_64 from Kernel:openSUSE-15.2? This does not boot either. After "Loading intial ramdisk" it hangs with a black screen (no messages at all). -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c43
Giovanni Gherdovich
(In reply to Giovanni Gherdovich from comment #39)
Hello Neil and Frank,
the fix is ready and committed in the openSUSE 15.2 kernel tree. I'll let you know when a test build is available.
Do you mean kernel-default-5.3.18-lp152.54.1.ga52aaa2.x86_64 from Kernel:openSUSE-15.2? This does not boot either. After "Loading intial ramdisk" it hangs with a black screen (no messages at all).
Yeah if I read that version tag correctly, it's from commit a52aaa2bcea0dca65a0459972c11ab86a800550b which should have the fix (the fix itself is 949f5c145e98b217570829dcff769d7dbecd9343 from https://github.com/SUSE/kernel-source/tree/openSUSE-15.2 ). I'm looking at it, thanks for testing. (In reply to Frank Kruger from comment #41)
(In reply to Giovanni Gherdovich from comment #40)
see comment 39, need-info from Frank as well.
I prefer not to be mentioned.
Noted. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c44
--- Comment #44 from Frank Kruger
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c45
--- Comment #45 from Frank Kruger
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c46
Neil Rickert
are you OK with being credited as ...
Yes, I'm okay with that. I'm used to spam. Actually the email address that I use is a Yahoo disposable address intended for dealing with spam. My normal email provider blocks all opensuse mailing lists as suspected spam. Using a Yahoo disposable address, at worst it will go to the spam folder. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c47
--- Comment #47 from Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c48
--- Comment #48 from Frank Kruger
Let me summarize the current situation (at least for me): Kernel from Kernel:openSUSE-15.1 und TW/Kernel:stable work fine, the KOTD from Kernel:openSUSE-15.2 including your fix does not. The kernel from comment 24 without your patches works fine.
The aforementioned results were obtained by using Graphic Mode=Discrete. Changing it to "switchable" the KOTD (kernel-default-5.3.18-lp152.54.1.ga52aaa2.x86_64) boots fine. Any idea? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c49
--- Comment #49 from Giovanni Gherdovich
(In reply to Frank Kruger from comment #44)
Let me summarize the current situation (at least for me): Kernel from Kernel:openSUSE-15.1 und TW/Kernel:stable work fine, the KOTD from Kernel:openSUSE-15.2 including your fix does not. The kernel from comment 24 without your patches works fine.
The aforementioned results were obtained by using Graphic Mode=Discrete. Changing it to "switchable" the KOTD (kernel-default-5.3.18-lp152.54.1.ga52aaa2.x86_64) boots fine. Any idea?
Uhm no, it doesn't ring any bell. It could be a separate issue. I'm adding Patrik Jakobsson and Thomas Zimmermann for the graphics part. Patrick, Thomas: Frank from above comment can't boot our latest Leap 15.2 KOTD on his 2010 "Acer Aspire 3820T" which has an Intel "Core i5-430M" cpu (Westmere) if he selects "Graphic Mode=Discrete" from the BIOS config menu. See the full hwinfo for that machine in comment 3. According to hwinfo, the graphics card shows up as an "ATI Park [Mobility Radeon HD 5430/5450/5470]". Would you have any idea as of why? For context: this bug started from a kernel panic in the function arch_scale_freq_tick from the kernel source file arch/x86/kernel/smpboot.c , and this new issue came up when verifying whether the fix works. It may or may not be a different problem. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
Giovanni Gherdovich
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c50
--- Comment #50 from Frank Kruger
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c51
--- Comment #51 from Takashi Iwai
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c52
--- Comment #52 from Frank Kruger
Yes, there is another known recent regression on SLE15-SP2 / Leap 15.2 kernel due to the bad padding in pci_fixup table. Some of you might have hit this. The bad commit was introduced recently, and the fix is on its way (but not merged / released yet).
So, please wait for a bit, hopefully it'll get merged after Easter holidays.
KOTD from https://download.opensuse.org/repositories/Kernel:/openSUSE-15.2/standard/ (kernel-default-5.3.18-lp152.55.1.gd43ae1f.x86_64) works for me. Thx. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c53
Giovanni Gherdovich
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c55
Andras FEHER
*** Bug 1169608 has been marked as a duplicate of this bug. ***
openSUSE-Leap-15.2-DVD-x86_64-Build632.2-Media.iso still Install Kernel Panic Created attachment 835885 [details] scr picture Asus X52F laptop Boot from USB -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c56
--- Comment #56 from Neil Rickert
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664
http://bugzilla.opensuse.org/show_bug.cgi?id=1166664#c57
--- Comment #57 from Neil Rickert
participants (1)
-
bugzilla_noreply@novell.com