[Bug 1188896] "BUG: Bad rss-counter state mm:00000000e555b579 type:MM_FILEPAGES val:-256" during build of cross-epiphany-gcc11-bootstrap
https://bugzilla.suse.com/show_bug.cgi?id=1188896 https://bugzilla.suse.com/show_bug.cgi?id=1188896#c49 --- Comment #49 from LTC BugProxy <bugproxy@us.ibm.com> --- ------- Comment From geraldsc@de.ibm.com 2022-02-08 14:08 EDT------- (In reply to comment #46)
no, the link is purely internal and the dump was only created using gdb and the dump-guest-memory script in the qemu repo on a hanging vm that was stuck for longer.
will look at the details later for upload if you are interested in that one
In the best case, such a dump could at least show the kernel messages, while the rest of it would be useless. However, we previously already had such a dump from a long stuck system, where the kernel log buffer was completely filled with repeating rcu_sched or similar errors, so that it still could not reveal the messages at the time when it all started. It still would be better than nothing, and at least serve as an exercise how to collect and submit a dump :-) The best chance would be the "panic_on_taint=5" kernel parameter approach, although I cannot 100% guarantee that it will work as expected. It was suggested by Fabian in an earlier comment here: "The "bad page state" warning causes a taint, so panic_on_taint=5 should help there. Needs an OBS admin to add that option and get a dump though, or alternatively someone who can reproduce it (somewhat reliably) with osc build --vm-type=kvm." I frankly wasn't aware of that feature myself, but a quick check in the kernel source code seems to confirm that this kernel parameter should trigger a panic on a "bad page state" event (and no other possible kernel taints), at least since kernel 5.8. After that, it should be possible to collect a dump in the same way that you did already. -- You are receiving this mail because: You are the assignee for the bug.
participants (1)
-
bugzilla_noreply@suse.com