https://bugzilla.novell.com/show_bug.cgi?id=795937 https://bugzilla.novell.com/show_bug.cgi?id=795937#c0 Summary: DRBD disconnecting on heavy load: "BAD! BarrierAck #XXX received, expected #XXX" Classification: openSUSE Product: openSUSE 12.2 Version: Final Platform: x86-64 OS/Version: openSUSE 12.2 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: servizioclienti@livenet.it QAContact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.11 (KHTML, like Gecko) Chrome/23.0.1271.97 Safari/537.11 I'm experiencing the same problem as in bug 714510 I reported a year ago, closed WONTFIX. This is a freshly installed 12.2 with updates. On heavy load (rsync over gigabit, disks are hw raid on both sides), the primary prints on logs Dec 26 22:21:29 maria kernel: [ 4406.648798] block drbd1: BAD! BarrierAck #221567884 received, expected #221567882! Dec 26 22:21:29 maria kernel: [ 4406.648808] block drbd1: peer( Secondary -> Unknown ) conn( Connected -> ProtocolError ) pdsk( UpToDate -> DUnknown ) Dec 26 22:21:29 maria kernel: [ 4406.648840] block drbd1: new current UUID EBAFCE24637DB1EB:B7914897B687272B:DD55206DD0404A9D:DD54206DD0404A9D Dec 26 22:21:29 maria kernel: [ 4406.648858] block drbd1: asender terminated Dec 26 22:21:29 maria kernel: [ 4406.648861] block drbd1: Terminating drbd1_asender Dec 26 22:21:29 maria kernel: [ 4406.649372] block drbd1: Connection closed Dec 26 22:21:29 maria kernel: [ 4406.649378] block drbd1: conn( ProtocolError -> Unconnected ) Dec 26 22:21:29 maria kernel: [ 4406.649386] block drbd1: receiver terminated The secondary writes Dec 26 22:21:29 annunziata kernel: [ 341.664099] block drbd1: meta connection shut down by peer. Dec 26 22:21:29 annunziata kernel: [ 341.664109] block drbd1: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) Dec 26 22:21:29 annunziata kernel: [ 341.664145] block drbd1: error receiving Data, l: 45080! Dec 26 22:21:29 annunziata kernel: [ 341.714355] block drbd1: asender terminated Dec 26 22:21:29 annunziata kernel: [ 341.714362] block drbd1: Terminating drbd1_asender Dec 26 22:21:29 annunziata kernel: [ 341.714560] block drbd1: Connection closed Dec 26 22:21:29 annunziata kernel: [ 341.714565] block drbd1: conn( NetworkFailure -> Unconnected ) Dec 26 22:21:29 annunziata kernel: [ 341.714570] block drbd1: receiver terminated (and then the connection restarts) While the connection restarts (two-three seconds) disks freeze. Sometimes connection stucks in 'syncing 100%' state and I need to reboot both sides (drbd stop and start doesn't work). Only once I've seen the primary to reboot. Building the module from source tarball (http://oss.linbit.com/drbd/8.3/drbd-8.3.11.tar.gz) the problem disappears, so I think you need to check differences between original sources and your sources. Does openSUSE team think that drbd module is not important? Valerio Reproducible: Always Steps to Reproduce: 1. install two opensuse 12.2 2. configure a drbd disk pri/sec and do strong traffic 3. check logs Actual Results: disconnections and locks Expected Results: none of the above -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.