[Bug 1160604] New: BTRFS related kernel NULL pointer dereference
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604 Bug ID: 1160604 Summary: BTRFS related kernel NULL pointer dereference Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: aarch64 OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel Assignee: kernel-maintainers@forge.provo.novell.com Reporter: guillaume.gardet@arm.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- Created attachment 827267 --> http://bugzilla.opensuse.org/attachment.cgi?id=827267&action=edit kernel 5.4 traces On OBS while building JeOS images based on BTRFS filesystem, kernel crashes with a kernel NULL pointer dereference. There was no problem kernel 5.3, only with 5.4. Traces are in attachment. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c1
--- Comment #1 from Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
Matthias Brugger
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
Anthony Iliopoulos
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c4
--- Comment #4 from Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c5
--- Comment #5 from Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c6
Matthias Brugger
Fix is merged in 5.5-rc6. Could we get it for 5.4 (stable branch)?
that's upstream commit: 46bcff2bfc5e ("btrfs: fix compressed write bio blkcg attribution") and maybe as well: 7b62e66cbbfb ("btrfs: punt all bios created in btrfs_submit_compressed_write()") Wenruo can you take care of this? This issue blocks tumbleweed update on aarch64/arm64. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c7
David Sterba
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c8
--- Comment #8 from David Sterba
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c11
Guillaume GARDET
It would need more tests, but it seems to be better with kernel 5.5.
It still happens with kernel 5.5.11 in current Tumbleweed on ThunderX1 machines in OBS while building aarch64 kiwi images. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c12
--- Comment #12 from Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
Dominique Leuenberger
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c13
--- Comment #13 from Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c14
--- Comment #14 from Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c15
Matthias Brugger
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c16
Guillaume GARDET
Can you get the memory dump from the crashed kernel. Maybe that would help to analyze the problem.
AFAIK, OBS does not produce it and I have no ThunderX1 to reproduce locally. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
Matthias Brugger
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c17
--- Comment #17 from Matthias Brugger
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
Matthias Brugger
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c18
--- Comment #18 from Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c19
Wenruo Qu
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c20
Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c21
Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c22
Wenruo Qu
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c23
--- Comment #23 from Guillaume GARDET
I guess Nik has pinned down the root cause, and the upstream fix is here:
https://patchwork.kernel.org/project/linux-btrfs/patch/20211102124916.433836... 1-nborisov@suse.com/
Mind to test if this patch solves the problem?
This patch is in kernel 5.15.5 which is currently tested in staging, and thus, should land in :Factory:ARM in the coming days. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604
http://bugzilla.opensuse.org/show_bug.cgi?id=1160604#c24
Guillaume GARDET
participants (2)
-
bugzilla_noreply@novell.com
-
bugzilla_noreply@suse.com