[Bug 795937] New: DRBD disconnecting on heavy load: "BAD! BarrierAck #XXX received, expected #XXX"
https://bugzilla.novell.com/show_bug.cgi?id=795937 https://bugzilla.novell.com/show_bug.cgi?id=795937#c0 Summary: DRBD disconnecting on heavy load: "BAD! BarrierAck #XXX received, expected #XXX" Classification: openSUSE Product: openSUSE 12.2 Version: Final Platform: x86-64 OS/Version: openSUSE 12.2 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: servizioclienti@livenet.it QAContact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.11 (KHTML, like Gecko) Chrome/23.0.1271.97 Safari/537.11 I'm experiencing the same problem as in bug 714510 I reported a year ago, closed WONTFIX. This is a freshly installed 12.2 with updates. On heavy load (rsync over gigabit, disks are hw raid on both sides), the primary prints on logs Dec 26 22:21:29 maria kernel: [ 4406.648798] block drbd1: BAD! BarrierAck #221567884 received, expected #221567882! Dec 26 22:21:29 maria kernel: [ 4406.648808] block drbd1: peer( Secondary -> Unknown ) conn( Connected -> ProtocolError ) pdsk( UpToDate -> DUnknown ) Dec 26 22:21:29 maria kernel: [ 4406.648840] block drbd1: new current UUID EBAFCE24637DB1EB:B7914897B687272B:DD55206DD0404A9D:DD54206DD0404A9D Dec 26 22:21:29 maria kernel: [ 4406.648858] block drbd1: asender terminated Dec 26 22:21:29 maria kernel: [ 4406.648861] block drbd1: Terminating drbd1_asender Dec 26 22:21:29 maria kernel: [ 4406.649372] block drbd1: Connection closed Dec 26 22:21:29 maria kernel: [ 4406.649378] block drbd1: conn( ProtocolError -> Unconnected ) Dec 26 22:21:29 maria kernel: [ 4406.649386] block drbd1: receiver terminated The secondary writes Dec 26 22:21:29 annunziata kernel: [ 341.664099] block drbd1: meta connection shut down by peer. Dec 26 22:21:29 annunziata kernel: [ 341.664109] block drbd1: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) Dec 26 22:21:29 annunziata kernel: [ 341.664145] block drbd1: error receiving Data, l: 45080! Dec 26 22:21:29 annunziata kernel: [ 341.714355] block drbd1: asender terminated Dec 26 22:21:29 annunziata kernel: [ 341.714362] block drbd1: Terminating drbd1_asender Dec 26 22:21:29 annunziata kernel: [ 341.714560] block drbd1: Connection closed Dec 26 22:21:29 annunziata kernel: [ 341.714565] block drbd1: conn( NetworkFailure -> Unconnected ) Dec 26 22:21:29 annunziata kernel: [ 341.714570] block drbd1: receiver terminated (and then the connection restarts) While the connection restarts (two-three seconds) disks freeze. Sometimes connection stucks in 'syncing 100%' state and I need to reboot both sides (drbd stop and start doesn't work). Only once I've seen the primary to reboot. Building the module from source tarball (http://oss.linbit.com/drbd/8.3/drbd-8.3.11.tar.gz) the problem disappears, so I think you need to check differences between original sources and your sources. Does openSUSE team think that drbd module is not important? Valerio Reproducible: Always Steps to Reproduce: 1. install two opensuse 12.2 2. configure a drbd disk pri/sec and do strong traffic 3. check logs Actual Results: disconnections and locks Expected Results: none of the above -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=795937 https://bugzilla.novell.com/show_bug.cgi?id=795937#c Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|kernel-maintainers@forge.pr |lmb@suse.com |ovo.novell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=795937 https://bugzilla.novell.com/show_bug.cgi?id=795937#c1 Lars Marowsky-Bree <lmb@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Priority|P5 - None |P3 - Medium CC| |lmb@suse.com AssignedTo|lmb@suse.com |xli@suse.com --- Comment #1 from Lars Marowsky-Bree <lmb@suse.com> 2013-07-16 04:30:11 UTC --- Needs fixing in 12.3 or 13.1, latest. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=795937 https://bugzilla.novell.com/show_bug.cgi?id=795937#c2 Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |WONTFIX --- Comment #2 from Jeff Mahoney <jeffm@suse.com> 2014-08-08 16:29:00 EDT --- This report is against openSUSE 12.2 which is no longer under maintenance. If you are able to reproduce it with openSUSE 13.1 or openSUSE Factory, please re-open and reset the the "Product" field to the appropriate release. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com