<div dir="ltr"><div dir="ltr"><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, 12 Dec 2023 at 16:17, Andrei Borzenkov <<a href="mailto:arvidjaar@gmail.com">arvidjaar@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On Fri, Dec 8, 2023 at 5:44 PM Artem <<a href="mailto:tyomikh@gmail.com" target="_blank">tyomikh@gmail.com</a>> wrote:<br>> pcs constraint location FAKE3 rule score=0 pingd lt 1 or not_defined pingd<br>

> pcs constraint location FAKE4 rule score=0 pingd lt 1 or not_defined pingd<br>

> pcs constraint location FAKE3 rule score=125 pingd gt 0 or defined pingd<br>

> pcs constraint location FAKE4 rule score=125 pingd gt 0 or defined pingd<br>

><br>

<br>

These rules are contradicting. You set the score to 125 if pingd is<br>

defined and at the same time set it to 0 if the score is less than 1.<br>

To be "less than 1" it must be defined to start with so both rules<br>

will always apply. I do not know how the rules are ordered. Either you<br>

get random behavior, or one pair of these rules is effectively<br>

ignored.<br></blockquote><div><br></div><div>"pingd lt 1 or not_defined pingd" means to me ==0 or not_defined, that is ping fails to ping GW or fails to report to corosync/pacemaker. Am I wrong?<br></div><div>"pingd gt 0 or defined pingd" means to me that ping gets reply from GW and reports it to cluster.</div><div>Are they really contradicting?<br></div><div>I read this article and tried to do in a similar way: <a href="https://habr.com/ru/articles/118925/">https://habr.com/ru/articles/118925/</a></div><div><br></div><div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">

><br>

> Question #1) Why I cannot see accumulated score from pingd in crm_simulate output? Only location score and stickiness.<br>

> pcmk__primitive_assign: FAKE3 allocation score on lustre3: 210<br>

> pcmk__primitive_assign: FAKE3 allocation score on lustre4: 90<br>

> pcmk__primitive_assign: FAKE4 allocation score on lustre3: 90<br>

> pcmk__primitive_assign: FAKE4 allocation score on lustre4: 210<br>

> Either when all is OK or when VM is down - score from pingd not added to total score of RA<br>

><br>

><br>

> Question #2) I shut lustre3 VM down and leave it like that. pcs status:<br>

>   * FAKE3       (ocf::pacemaker:Dummy):  Stopped<br>

>   * FAKE4       (ocf::pacemaker:Dummy):  Started lustre4<br>

>   * Clone Set: ping-clone [ping]:<br>

>     * Started: [ lustre-mds1 lustre-mds2 lustre-mgs lustre1 lustre2 lustre4 ] << lustre3 missing<br>

> OK for now<br>

> VM boots up. pcs status:<br>

>   * FAKE3       (ocf::pacemaker:Dummy):  FAILED (blocked) [ lustre3 lustre4 ]  << what is it?<br>

>   * Clone Set: ping-clone [ping]:<br>

>     * ping      (ocf::pacemaker:ping):   FAILED lustre3 (blocked)    << why not started?<br>

>     * Started: [ lustre-mds1 lustre-mds2 lustre-mgs lustre1 lustre2 lustre4 ]<br>

<br>

If this is full pcs status output, I miss stonith resource.<br>

<br></blockquote><div>I have "pcs property set stonith-enabled=false" and don't plan to use it. I want simple active-passive cluster, like Veritas or ServiceGuard with most duties automated. And our production servers have their iBMC in a locked network segment<br></div><div> </div></div></div>