[Bug 1021260] New: bcache garbage collection thread crashes on blocking operation
http://bugzilla.opensuse.org/show_bug.cgi?id=1021260 Bug ID: 1021260 Summary: bcache garbage collection thread crashes on blocking operation Classification: openSUSE Product: openSUSE Distribution Version: Leap 42.2 Hardware: x86-64 OS: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel Assignee: kernel-maintainers@forge.provo.novell.com Reporter: seroton10@gmail.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- I have starting seeing errors like the one quoted below in the system log. It occurs infrequently, but quite regularly, about 1-3 times a week, on a server running 24x7. Around the time it began, I started running a beta version of Leap 42.2, upgraded from 42.1. Also, I enabled the "discard" option (SSD TRIM) on the bcache cache about 3-6 months ago. I believe one of those two events caused the bug to appear. Not sure what other info is useful, please ask for whatever you need. Oct 10 00:00:02 blackbox kernel: ------------[ cut here ]------------ Oct 10 00:00:02 blackbox kernel: WARNING: CPU: 4 PID: 1269 at ../kernel/sched/core.c:7891 __might_sleep+0x76/0x80() Oct 10 00:00:02 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa09e2325>] bch_gc_thread+0x25/0x100 [ Oct 10 00:00:02 blackbox kernel: Modules linked in: vhost_net vhost macvtap macvlan fuse ebt_arp ebt_ip ebtable_nat ebtable_filter ebtables Oct 10 00:00:02 blackbox kernel: mxm_wmi Oct 10 00:00:02 blackbox kernel: bcache aesni_intel raid1 snd_hda_codec_realtek aes_x86_64 lrw snd_hda_codec_generic gf128mul md_mod glue_h Oct 10 00:00:02 blackbox kernel: Oct 10 00:00:02 blackbox kernel: CPU: 4 PID: 1269 Comm: bcache_gc Not tainted 4.4.21-2-default #1 Oct 10 00:00:02 blackbox kernel: Hardware name: To be filled by O.E.M. To be filled by O.E.M./M5A99X EVO R2.0, BIOS 2301 01/06/2014 Oct 10 00:00:02 blackbox kernel: 0000000000000000 ffffffff81326967 ffff8800b605be10 ffffffff81a5e431 Oct 10 00:00:02 blackbox kernel: ffffffff8107e7d1 ffffffff81a5f54f ffff8800b605be60 0000000000000061 Oct 10 00:00:02 blackbox kernel: 0000000000000000 Oct 10 00:00:02 blackbox kernel: 0000000000000000 ffffffff8107e84c ffffffff81a4ef88 Oct 10 00:00:02 blackbox kernel: Call Trace: Oct 10 00:00:02 blackbox kernel: [<ffffffff81019e69>] dump_trace+0x59/0x320 Oct 10 00:00:02 blackbox kernel: [<ffffffff8101a22a>] show_stack_log_lvl+0xfa/0x180 Oct 10 00:00:02 blackbox kernel: [<ffffffff8101afd1>] show_stack+0x21/0x40 Oct 10 00:00:02 blackbox kernel: [<ffffffff81326967>] dump_stack+0x5c/0x85 Oct 10 00:00:02 blackbox kernel: [<ffffffff8107e7d1>] warn_slowpath_common+0x81/0xb0 Oct 10 00:00:02 blackbox kernel: [<ffffffff8107e84c>] warn_slowpath_fmt+0x4c/0x50 Oct 10 00:00:02 blackbox kernel: [<ffffffff810a3026>] __might_sleep+0x76/0x80 Oct 10 00:00:02 blackbox kernel: [<ffffffff81605cac>] mutex_lock+0x1c/0x38 Oct 10 00:00:02 blackbox kernel: [<ffffffffa09e2365>] bch_gc_thread+0x65/0x100 [bcache] Oct 10 00:00:02 blackbox kernel: [<ffffffff8109d268>] kthread+0xc8/0xe0 Oct 10 00:00:02 blackbox kernel: [<ffffffff8160828f>] ret_from_fork+0x3f/0x70 Oct 10 00:00:02 blackbox kernel: DWARF2 unwinder stuck at ret_from_fork+0x3f/0x70 Oct 10 00:00:02 blackbox kernel: Oct 10 00:00:02 blackbox kernel: Leftover inexact backtrace: Oct 10 00:00:02 blackbox kernel: [<ffffffff8109d1a0>] ? kthread_park+0x50/0x50 Oct 10 00:00:02 blackbox kernel: ---[ end trace c63abcb6c473e79b ]--- # journalctl|grep "blocking ops" Oct 10 00:00:02 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa09e2325>] bch_gc_thread+0x25/0x100 [bcache] Oct 15 20:45:04 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa0883325>] bch_gc_thread+0x25/0x100 [bcache] Oct 22 20:44:19 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa09b0325>] bch_gc_thread+0x25/0x100 [bcache] Oct 25 23:01:02 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa0e2f325>] bch_gc_thread+0x25/0x100 [bcache] Oct 29 13:46:09 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa0a26325>] bch_gc_thread+0x25/0x100 [bcache] Nov 03 21:59:51 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa07f0325>] bch_gc_thread+0x25/0x100 [bcache] Nov 07 00:00:01 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa087c325>] bch_gc_thread+0x25/0x100 [bcache] Nov 08 22:29:33 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa0835325>] bch_gc_thread+0x25/0x100 [bcache] Nov 17 02:29:37 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa09a9325>] bch_gc_thread+0x25/0x100 [bcache] Dec 03 01:44:06 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa0999325>] bch_gc_thread+0x25/0x100 [bcache] Dec 03 21:51:51 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa077b325>] bch_gc_thread+0x25/0x100 [bcache] Dec 10 14:29:28 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa09e0325>] bch_gc_thread+0x25/0x100 [bcache] Dec 12 00:00:01 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa07c1325>] bch_gc_thread+0x25/0x100 [bcache] Dec 16 12:07:41 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa07f2325>] bch_gc_thread+0x25/0x100 [bcache] Dec 30 22:19:00 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa08ca325>] bch_gc_thread+0x25/0x100 [bcache] Jan 04 02:22:08 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa09df325>] bch_gc_thread+0x25/0x100 [bcache] Jan 07 20:57:24 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa09c8325>] bch_gc_thread+0x25/0x100 [bcache] Jan 09 00:00:02 blackbox kernel: do not call blocking ops when !TASK_RUNNING; state=1 set at [<ffffffffa0844325>] bch_gc_thread+0x25/0x100 [bcache] -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1021260
http://bugzilla.opensuse.org/show_bug.cgi?id=1021260#c2
--- Comment #2 from Olav Reinert
http://bugzilla.opensuse.org/show_bug.cgi?id=1021260
Coly Li
participants (1)
-
bugzilla_noreply@novell.com