[Bug 1051686] New: XFS regression: metadata I/O error: block 0x575b6d70,("xfs_trans_read_buf_map") error 4 numblks 16
http://bugzilla.opensuse.org/show_bug.cgi?id=1051686 Bug ID: 1051686 Summary: XFS regression: metadata I/O error: block 0x575b6d70,("xfs_trans_read_buf_map") error 4 numblks 16 Classification: openSUSE Product: openSUSE Distribution Version: Leap 42.3 Hardware: All OS: All Status: NEW Severity: Critical Priority: P5 - None Component: Kernel Assignee: kernel-maintainers@forge.provo.novell.com Reporter: s.priebe@profihost.ag QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- Hello, after an upgrade from 42.2 based kernel to 42.3 current git master i'm getting XFS in memory data corruptions on more than 10 different servers under high load. complete dmesg output is: XFS (bcache1): metadata I/O error: block 0x575b6d70 ("xfs_trans_read_buf_map") error 4 numblks 16 XFS (bcache1): xfs_imap_to_bp: xfs_trans_read_buf() returned error -4. XFS (bcache1): xfs_do_force_shutdown(0x8) called from line 3519 of file fs/xfs/xfs_inode.c. Return address = 0xffffffff993441ff XFS (bcache1): Corruption of in-memory data detected. Shutting down filesystem XFS (bcache1): Please umount the filesystem and rectify the problem(s) XFS (bcache1): xfs_log_force: error -5 returned. XFS (bcache1): xfs_log_force: error -5 returned. XFS (bcache1): xfs_log_force: error -5 returned. Greets, Stefan -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1051686 Stefan Priebe <s.priebe@profihost.ag> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |s.priebe@profihost.ag -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1051686 http://bugzilla.opensuse.org/show_bug.cgi?id=1051686#c1 --- Comment #1 from Jeff Mahoney <jeffm@suse.com> --- Are all the hosts you're seeing this on using bcache? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1051686 http://bugzilla.opensuse.org/show_bug.cgi?id=1051686#c2 --- Comment #2 from Stefan Priebe <s.priebe@profihost.ag> --- Yes. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1051686 http://bugzilla.opensuse.org/show_bug.cgi?id=1051686#c3 --- Comment #3 from Stefan Priebe <s.priebe@profihost.ag> --- But they were working fine under 42.2 -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1051686 Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |colyli@suse.com, | |jeffm@suse.com, | |nborisov@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1051686 Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |rgoldwyn@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1051686 http://bugzilla.opensuse.org/show_bug.cgi?id=1051686#c6 Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Assignee|kernel-maintainers@forge.pr |colyli@suse.com |ovo.novell.com | --- Comment #6 from Jeff Mahoney <jeffm@suse.com> --- linux-4.4-openSUSE-42.3/drivers/md/bcache> grep -r EINTR . ./request.c: s->iop.error = -EINTR; ./request.c: ret = miss == bio ? MAP_DONE : -EINTR; ./btree.c: * going to have to split), set op->lock and return -EINTR; btree_root() will ./btree.c: int _r = -EINTR; \ ./btree.c: if (_r == -EINTR) \ ./btree.c: } while (_r == -EINTR); \ ./btree.c: return -EINTR; ./btree.c: return ERR_PTR(-EINTR); ./btree.c: return -EINTR; ./btree.c: r[nodes - 1].b = ERR_PTR(-EINTR); ./btree.c: return -EINTR; ./btree.c: return -EINTR; ./btree.c: i->b = ERR_PTR(-EINTR); ./btree.c: return -EINTR; ./btree.c: return -EINTR; ./btree.c: return -EINTR; ./btree.c: return -EINTR; ./btree.c: int ret = -EINTR; ./btree.c: w->private = ERR_PTR(-EINTR); ./btree.h: * the lock field and returns -EINTR, which causes the btree_root() macro to ./super.c: return -EINTR; ./movinggc.c: io->op.error = -EINTR; That looks more promising, though. Coly, could you take a look here? -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com