[Bug 1045935] New: can not create DLM resource with hawk
http://bugzilla.suse.com/show_bug.cgi?id=1045935 Bug ID: 1045935 Summary: can not create DLM resource with hawk Classification: openSUSE Product: openSUSE Distribution Version: Leap 42.2 Hardware: Other OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: High Availability Assignee: kgronlund@suse.com Reporter: lszhu@suse.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- In Leap 42.2, updated to latest release, try to create a DLM resource with HAWK, failed with the messages: 2017-06-26 16:53: Operation start failed for resource dlm_scst on node scst_node2: call-id=32, rc-code=configuration error (6), exit-reason= I configured fence, even with parameter: allow_stonith_disable, still see the error. Then I noticed that dlm_controld is not running, occasionally I even see dlm kernel module is not loaded. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c1
--- Comment #1 from Kristoffer Gronlund
In Leap 42.2, updated to latest release, try to create a DLM resource with HAWK, failed with the messages:
2017-06-26 16:53: Operation start failed for resource dlm_scst on node scst_node2: call-id=32, rc-code=configuration error (6), exit-reason=
I configured fence, even with parameter: allow_stonith_disable, still see the error.
Then I noticed that dlm_controld is not running, occasionally I even see dlm kernel module is not loaded.
Hi Lingshan, could you create a hb_report and attach to this issue? Thank you -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c2
--- Comment #2 from Lingshan Zhu
(In reply to Lingshan Zhu from comment #0)
In Leap 42.2, updated to latest release, try to create a DLM resource with HAWK, failed with the messages:
2017-06-26 16:53: Operation start failed for resource dlm_scst on node scst_node2: call-id=32, rc-code=configuration error (6), exit-reason=
I configured fence, even with parameter: allow_stonith_disable, still see the error.
Then I noticed that dlm_controld is not running, occasionally I even see dlm kernel module is not loaded.
Hi Lingshan, could you create a hb_report and attach to this issue? Thank you
Hi Kristoffer, I resolved this issue "manually", I found some causes for it: (1)Stonith is default disabled even after I configured it. (2)It seems the parameter allow_stonith_disable does not work. Hope these can help we narrow down the issue, I will keep you updated when I find more. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
Kristoffer Gronlund
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c3
Kristoffer Gronlund
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c4
Lingshan Zhu
Hi Lingshan, did you have time to research this issue any further? It sounds like maybe there is a bug with the dlm resource agent.
Let's see whether there are any input from our DLM maintainer eric. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c5
--- Comment #5 from zhen ren
http://bugzilla.suse.com/show_bug.cgi?id=1045935
zhen ren
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c6
zhen ren
Hi Kristoffer,
I resolved this issue "manually", I found some causes for it:
How did you manually get rid of this problem? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c7
--- Comment #7 from zhen ren
Hi Lingshan, could you create a hb_report and attach to this issue? Thank you
Is it still possible to make a hb_report on that system? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c8
Lingshan Zhu
(In reply to Kristoffer Gronlund from comment #1)
[....]
Hi Lingshan, could you create a hb_report and attach to this issue? Thank you
Is it still possible to make a hb_report on that system?
It's manually resolved by add a vmware fence. It is easy to reproduce, please collect information from a new setup env. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c9
--- Comment #9 from zhen ren
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c10
--- Comment #10 from zhen ren
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c11
zhen ren
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c12
zhen ren
With Hawk, I removed every resource except libvirt_stonith. Then, I try to add DLM resource back, failed with the error below showing on Hawk:
==== 2017-07-25 10:16: Operation monitor failed for resource dlm on node clvm2: call-id=44, rc-code=installation error (5), exit-reason=Config FILE /etc/conntrackd/conntrackd.conf does not exist 2017-07-25 10:16: Operation monitor failed for resource dlm on node clvm2: call-id=44, rc-code=installation error (5), exit-reason=Config FILE /etc/conntrackd/conntrackd.conf does not exist 2017-07-25 10:16: Operation monitor failed for resource dlm on node clvm1: call-id=41, rc-code=installation error (5), exit-reason=Config FILE /etc/conntrackd/conntrackd.conf does not exist 2017-07-25 10:16: Operation monitor failed for resource dlm on node clvm1: call-id=41, rc-code=installation error (5), exit-reason=Config FILE /etc/conntrackd/conntrackd.conf does not exist ===
-- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c13
--- Comment #13 from zhen ren
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c14
--- Comment #14 from zhen ren
Oops, I picked the wrong RA for DLM:
ocf::heartbeat:conntrackd
right one should be: ocf::pacemaker:controld
To be clear, after choosing the right RA, it works fine. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c15
--- Comment #15 from Kristoffer Gronlund
So, I cannot reproduce this problem with Hack. It would surprise me a lot if we can easily get trouble with Hack adding DLM resource.
@Kristoffer, have you ever reproduced this problem?
No, I haven't tried to reproduce it myself. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1045935
http://bugzilla.suse.com/show_bug.cgi?id=1045935#c16
Kristoffer Gronlund
participants (1)
-
bugzilla_noreply@novell.com