http://bugzilla.opensuse.org/show_bug.cgi?id=1172541
Bug ID: 1172541
Summary: Mysterious crashes with kernel 5.6.14
Classification: openSUSE
Product: openSUSE Tumbleweed
Version: Current
Hardware: x86-64
OS: openSUSE Factory
Status: NEW
Severity: Normal
Priority: P5 - None
Component: Kernel
Assignee: kernel-bugs(a)opensuse.org
Reporter: jimc(a)jfcarter.net
QA Contact: qa-bugs(a)suse.de
Found By: ---
Blocker: ---
Version: kernel-default-5.6.14-1.1.x86_64 and aarch64 for Tumbleweed
I did a dist-upgrade on 2020-06-02. After I waited for
download.opensuse.org to recover from its illness, the upgrade went
smoothly. But after I rebooted all the machines, 5 to 10 minutes
later two of them went catatonic, and a third (my laptop) froze up
a few minutes after being booted up two days later. After I power cycled
the gateway machine, it again froze about five minutes after booting.
On those hosts I reverted (by the grub menu) to
kernel-default-5.6.12-1.3.x86_64 and aarch64. There were no further
catatonic incidents. The other hosts continued (on 5.6.14) for up to
36 hours and did not go catatonic (but I eventually reverted them also).
Here are the symptoms: On two machines I was working on the console
(XFCE) when it happened. It seemed perfectly normal, but suddenly
keystrokes had no effect and continuously updating apps (load meter
etc.) ceased updating. On the gateway, network thru traffic was no
longer forwarded. After I rebooted it I checked syslog (debug level).
After the last normal message (e.g. DHCP lease renewed) there was a
burst of \0's, and the dmesg dump appeared immediately after, starting
with "microcode updated early to revision..." and "Jun 4 08:47:03
jacinth kernel: [ 0.000000] Linux version 5.6.12-1-default
(geeko@buildhost) (gcc version 9.3.1 20200406 [revision
6db837a5288ee3ca5ec504fbd5a765817e556ac2] (SUSE Linux)) #1 SMP Tue May
12 17:44:12 UTC 2020 (9bff61b)", i.e. the very first expected messages
from dmesg. (I carefully booted it into 5.6.12 not 5.6.14.) I have not
been able to find any useful symptoms that might shed light on what is
killing the machines.
Machines running 5.6.12 did not crash, both before the dist-upgrade and
after I reverted.
For what it's worth, here is some data about the various machines.
These went catatonic:
jacinth Gateway, directory server, never sleeps. Intel NUC6CAYH,
Celeron(R) CPU J3455 @ 1.50GHz, Intel HD Graphics 500
(i915). It runs hostapd as a wireless access point, driver
8812au from
rtl8812au-kmp-default-5.6.4.2+git20200318.49e98ff_k5.6.12_1-1.7.x86_64
xena Laptop, wireless, powered off when the human is sleeping.
Acer Aspire A515-54-51D1, Intel(R) Core(TM) i5-8265U
CPU @ 1.60GHz, Intel UHD Graphics 620 (Whiskey Lake) (i915)
diamond Normal desktop. When it crashed nobody was using it. At
night it hibernates. Intel NUC7i5BNH, Core(TM) i5-7260U CPU
@ 2.20GHz, Intel Iris Plus Graphics 640 (i915)
These survived up to 36 hours on 5.6.14 (called "non-catatonic" below):
claude Webserver (1.5 hits/min). VM (KVM) on Jacinth, x86_64,
video=cirrus
holly Desktop replacement. Raspberry Pi-3B, Broadcom BCM2837
@1.2GHz and VideoCore IV (vc4). aarch64. RPi's can't sleep.
iris Audio-video player (17kbyte/sec), hibernates when unused.
Intel NUC6CAYH, Celeron(R) CPU J3455 @ 1.50GHz (like Jacinth),
Intel HD Graphics 500 (i915)
oso Development VM (KVM) on Diamond, x86_64, video=cirrus
petra Development VM (KVM) on Xena, x86_64, video=cirrus
surya Cloud server, VM (KVM) at Linode, never sleeps :-), Intel(R)
Xeon(R) CPU E5-2680 v3 @ 2.50GHz, no GPU at all.
Thinking that X-Windows activity might be correlated with catatonia,
for about 6 hours I set the non-catatonic machines (except Surya, has no
console) to show screensaver eye candy continuously. I was wrong; none
of them crashed.
Iris hardware is identical to Jacinth, yet it did not go catatonic while
Jacinth failed more times than any other host.
I don't really know what effective action could be taken, beyond trying
to guess which kernel patch may have been the culprit. Anyway, thank
you for whatever you can do with this info.
--
You are receiving this mail because:
You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173402
Bug ID: 1173402
Summary: Kernel delays boot by 12s if ip= option given
Classification: openSUSE
Product: openSUSE Tumbleweed
Version: Current
Hardware: Other
OS: Other
Status: NEW
Severity: Normal
Priority: P5 - None
Component: Kernel
Assignee: kernel-bugs(a)opensuse.org
Reporter: fvogt(a)suse.com
QA Contact: qa-bugs(a)suse.de
CC: dracut-maintainers(a)suse.de, iforster(a)suse.com
Found By: ---
Blocker: ---
For networking in the initrd, the "ip=dhcp" (or similiar) option can be set in
the kernel cmdline. This is parsed by dracut and used to configure wicked.
However, this option is also parsed by the kernel in its ip autoconfig code,
which then ends up trying to bring up the network itself. This bringup includes
a 12s delay to wait for interfaces to appear:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/net…
At this point the initrd isn't even mounted, so there are no kernel modules
available and no interfaces appear.
@kernel maintainers: Is it maybe possible to not do autoconfig in this
circumstance? It might be possible to check whether there are any network
drivers at all (the default kernel seems to have those as modules only) or
whether it's booting an initrd. Though in the latter case I guess it's possible
that some initrd out there relies on the kernel's successful autoconfig?
@dracut maintainers: It seems like "rd.neednet=1" is enough to get dracut to
acquire an address over DHCP, so a workaround is to just drop "ip=dhcp". I'd
like to have confirmation that this is intended behaviour and won't break
without notice in the future. This doesn't help for cases without DHCP though.
If it's not possible to avoid the "ip=foo" induced kernel delay, it might be a
good idea to provide "rd.ip=foo" or similar as alias, which is ignored by the
kernel.
--
You are receiving this mail because:
You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173158
Bug ID: 1173158
Summary: CONFIG_MODULE_SIG=y
Classification: openSUSE
Product: openSUSE Distribution
Version: Leap 15.2
Hardware: Other
OS: Other
Status: NEW
Severity: Normal
Priority: P5 - None
Component: Kernel
Assignee: kernel-bugs(a)opensuse.org
Reporter: lnussel(a)suse.com
QA Contact: qa-bugs(a)suse.de
CC: opensuse-releaseteam(a)suse.de
Found By: ---
Blocker: ---
The 15.2 kernel has CONFIG_MODULE_SIG=y enabled. Correct me if I'm wrong but
that means the NVidia driver won't work with secure boot enabled, right? Users
would have to go to the BIOS and disable secure boot. A step that was not
required in any openSUSE before, right?
Do we really want that? If so, requires explicit documentation IMO.
--
You are receiving this mail because:
You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1173546
Bug ID: 1173546
Summary: Provide some way to circumvent "Disabling f2fs" though
it's unsupported
Classification: openSUSE
Product: openSUSE Tumbleweed
Version: Current
Hardware: All
OS: openSUSE Factory
Status: NEW
Severity: Enhancement
Priority: P5 - None
Component: Kernel
Assignee: kernel-bugs(a)opensuse.org
Reporter: bertocheraphael(a)gmail.com
QA Contact: qa-bugs(a)suse.de
CC: afaerber(a)suse.com, bugs(a)tmarques.com,
dleuenberger(a)suse.com, jeffm(a)suse.com,
lnussel(a)suse.com, nborisov(a)suse.com, rbrown(a)suse.com
Depends on: 1109665
Found By: ---
Blocker: ---
In the discussions about this as the thread linked below it was mentioned that
enabling f2fs would be one line commentary away. But after removing the
commentary I found out I have no kernel module able to mount f2fs.
https://lists.opensuse.org/opensuse-kernel/2018-10/msg00001.html
It's worth mentioning f2fs is currently actively used - especially on mobile
platforms and other SoCs.
Is it possible to provide a better way to mount an f2fs disk, even though you
have dropped it's support, that doesn't involve recompiling the whole kernel
every once in a while?
--
You are receiving this mail because:
You are the assignee for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1109665
Jeff Mahoney <jeffm(a)suse.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
Assignee|kernel-maintainers(a)forge.pr |kernel-bugs(a)opensuse.org
|ovo.novell.com |
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1165790https://bugzilla.suse.com/show_bug.cgi?id=1165790#c10
Miroslav Beneš <mbenes(a)suse.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |mbenes(a)suse.com
Assignee|kernel-maintainers(a)forge.pr |kernel-bugs(a)opensuse.org
|ovo.novell.com |
--- Comment #10 from Miroslav Beneš <mbenes(a)suse.com> ---
If the issue still happens with the latest kernel in Kernel:stable (that is
5.7.x), it is time to report it upstream.
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1107525https://bugzilla.suse.com/show_bug.cgi?id=1107525#c7
--- Comment #7 from Daniel Noga <noga.dany(a)gmail.com> ---
I am not an expert, but according to Valve documentation and Phoronix blog site
it looks like it can be changed both on kernel and on systemd. Systemd change
can be maybe better, because it is only backport from upstream systemd 240
--
You are receiving this mail because:
You are the assignee for the bug.