<div dir="ltr">I suppose the failure is because I do not have a DC yet.<div><br></div><div><div>[root@airv_cu xml]# pcs cluster status</div><div>Cluster Status:</div><div> Stack: corosync</div><div> Current DC: NONE</div></div><div><br></div><div>Can I bring it up when I have just 1 node?</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Fri, May 6, 2016 at 4:59 PM, Nikhil Utane <span dir="ltr">&lt;<a href="mailto:nikhil.subscribed@gmail.com" target="_blank">nikhil.subscribed@gmail.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div><div>The command failed. </div><div>[root@airv_cu pacemaker]# cibadmin --upgrade --force</div><div>Call cib_upgrade failed (-62): Timer expired</div></div><div><br></div>I did not do any tooling. (Not even aware how to)<div><br></div><div>As I mentioned, I am cross-compiling and copying the relevant files on target platform.</div><div>In one of the earlier run pacemaker cribbed out not finding <span style="font-family:Calibri,sans-serif;font-size:11pt">/usr/share/pacemaker/</span>pacemaker-1.0.rng.</div><div><br></div><div>I found this file under xml folder in the build folder, so I copied all the files under xml folder onto the target.</div><div>Did that screw it up?</div><div><br></div><div>This is the content of the folder:</div><div><div>[root@airv_cu pacemaker]# ls /usr/share/pacemaker/</div><div>Makefile              constraints-2.1.rng   nodes-1.0.rng         pacemaker-2.1.rng     rule.rng</div><div>Makefile.am           constraints-2.2.rng   nodes-1.2.rng         pacemaker-2.2.rng     score.rng</div><div>Makefile.in           constraints-2.3.rng   nodes-1.3.rng         pacemaker-2.3.rng     status-1.0.rng</div><div>Readme.md             constraints-next.rng  nvset-1.3.rng         pacemaker-2.4.rng     tags-1.3.rng</div><div>acls-1.2.rng          context-of.xsl        nvset.rng             pacemaker-next.rng    upgrade-1.3.xsl</div><div>acls-2.0.rng          crm-transitional.dtd  ocf-meta2man.xsl      pacemaker.rng         upgrade06.xsl</div><div>best-match.sh         crm.dtd               options-1.0.rng       <a href="http://regression.core.sh" target="_blank">regression.core.sh</a>    versions.rng</div><div>cib-1.0.rng           crm.xsl               pacemaker-1.0.rng     regression.sh</div><div>cib-1.2.rng           crm_mon.rng           pacemaker-1.2.rng     resources-1.0.rng</div><div>constraints-1.0.rng   fencing-1.2.rng       pacemaker-1.3.rng     resources-1.2.rng</div><div>constraints-1.2.rng   fencing-2.4.rng       pacemaker-2.0.rng     resources-1.3.rng</div></div><div><br></div><div>-Regards</div><span class="HOEnZb"><font color="#888888"><div>Nikhil</div></font></span></div><div class="HOEnZb"><div class="h5"><div class="gmail_extra"><br><div class="gmail_quote">On Fri, May 6, 2016 at 4:41 PM, Klaus Wenninger <span dir="ltr">&lt;<a href="mailto:kwenning@redhat.com" target="_blank">kwenning@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div>On 05/06/2016 12:40 PM, Nikhil Utane wrote:<br>
&gt; Hi,<br>
&gt;<br>
&gt; I used the blackbox feature which showed the reason for failure.<br>
&gt; As I am cross-compiling pacemaker on a build machine and later moving<br>
&gt; the binaries to the target, few binaries were missing. After fixing<br>
&gt; that and bunch of other errors/warning, I am able to get pacemaker<br>
&gt; started though not completely running fine.<br>
&gt;<br>
&gt; The node is not getting added:<br>
&gt; airv_cu        cib:    error: xml_log:Element node failed to validate<br>
&gt; attributes<br>
&gt;<br>
&gt; I suppose it is because of this error:<br>
&gt; crmd:    error: node_list_update_callback:Node update 4 failed: Update<br>
&gt; does not conform to the configured schema (-203)<br>
&gt;<br>
&gt; I am suspecting this is caused because of<br>
&gt; validate-with=&quot;pacemaker-0.7&quot; in the cib. In another installation this<br>
&gt; is being set to &#39;&quot;pacemaker-2.0&quot;&#39;<br>
&gt;<br>
&gt; [root@airv_cu pacemaker]# pcs cluster cib<br>
&gt; &lt;cib crm_feature_set=&quot;3.0.10&quot; validate-with=&quot;pacemaker-0.7&quot; epoch=&quot;3&quot;<br>
&gt; num_updates=&quot;0&quot; admin_epoch=&quot;0&quot; cib-last-written=&quot;Fri May  6 09:28:10<br>
&gt; 2016&quot; have-quorum=&quot;1&quot;&gt;<br>
&gt;   &lt;configuration&gt;<br>
&gt;     &lt;crm_config&gt;<br>
&gt;       &lt;cluster_property_set id=&quot;cib-bootstrap-options&quot;&gt;<br>
&gt;         &lt;nvpair id=&quot;cib-bootstrap-options-have-watchdog&quot;<br>
&gt; name=&quot;have-watchdog&quot; value=&quot;true&quot;/&gt;<br>
&gt;         &lt;nvpair id=&quot;cib-bootstrap-options-dc-version&quot;<br>
&gt; name=&quot;dc-version&quot; value=&quot;1.1.14-5a6cdd1&quot;/&gt;<br>
&gt;         &lt;nvpair id=&quot;cib-bootstrap-options-cluster-infrastructure&quot;<br>
&gt; name=&quot;cluster-infrastructure&quot; value=&quot;corosync&quot;/&gt;<br>
&gt;       &lt;/cluster_property_set&gt;<br>
&gt;     &lt;/crm_config&gt;<br>
&gt;     &lt;nodes/&gt;<br>
&gt;     &lt;resources/&gt;<br>
&gt;     &lt;constraints/&gt;<br>
&gt;   &lt;/configuration&gt;<br>
&gt;   &lt;status/&gt;<br>
&gt; &lt;/cib&gt;<br>
&gt;<br>
&gt; Any idea why/where this is being set to 0.7. I am using latest<br>
&gt; pacemaker from GitHub.<br>
<br>
</div></div>What kind of tooling did you use to create the cib?<br>
Try &#39;cibadmin --upgrade&#39;. That should set the cib-version to what your<br>
pacemaker-version supports.<br>
<span><br>
&gt;<br>
&gt; [root@airv_cu pacemaker]# pacemakerd --version<br>
&gt; Pacemaker 1.1.14<br>
&gt; Written by Andrew Beekhof<br>
&gt;<br>
&gt; Attaching the corosync.log and corosync.conf file.<br>
&gt;<br>
&gt; -Thanks<br>
&gt; Nikhil<br>
&gt;<br>
&gt;<br>
&gt; On Thu, May 5, 2016 at 10:21 PM, Ken Gaillot &lt;<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a><br>
</span><div><div>&gt; &lt;mailto:<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>&gt;&gt; wrote:<br>
&gt;<br>
&gt;     On 05/05/2016 11:25 AM, Nikhil Utane wrote:<br>
&gt;     &gt; Thanks Ken for your quick response as always.<br>
&gt;     &gt;<br>
&gt;     &gt; But what if I don&#39;t want to use quorum? I just want to bring up<br>
&gt;     &gt; pacemaker + corosync on 1 node to check that it all comes up fine.<br>
&gt;     &gt; I added corosync_votequorum as you suggested. Additionally I<br>
&gt;     also added<br>
&gt;     &gt; these 2 lines:<br>
&gt;     &gt;<br>
&gt;     &gt; expected_votes: 2<br>
&gt;     &gt; two_node: 1<br>
&gt;<br>
&gt;     There&#39;s actually nothing wrong with configuring a single-node cluster.<br>
&gt;     You can list just one node in corosync.conf and leave off the above.<br>
&gt;<br>
&gt;     &gt; However still pacemaker is not able to run.<br>
&gt;<br>
&gt;     There must be other issues involved. Even if pacemaker doesn&#39;t have<br>
&gt;     quorum, it will still run, it just won&#39;t start resources.<br>
&gt;<br>
&gt;     &gt; [root@airv_cu root]# pcs cluster start<br>
&gt;     &gt; Starting Cluster...<br>
&gt;     &gt; Starting Pacemaker Cluster Manager[FAILED]<br>
&gt;     &gt;<br>
&gt;     &gt; Error: unable to start pacemaker<br>
&gt;     &gt;<br>
&gt;     &gt; Corosync.log:<br>
&gt;     &gt; *May 05 16:15:20 [16294] airv_cu pacemakerd:     info:<br>
&gt;     &gt; pcmk_quorum_notification: Membership 240: quorum still lost (1)*<br>
&gt;     &gt; May 05 16:15:20 [16259] airv_cu corosync debug   [QB    ] Free&#39;ing<br>
&gt;     &gt; ringbuffer: /dev/shm/qb-cmap-request-16259-16294-21-header<br>
&gt;     &gt; May 05 16:15:20 [16294] airv_cu pacemakerd:   notice:<br>
&gt;     &gt; crm_update_peer_state_iter:       pcmk_quorum_notification: Node<br>
&gt;     &gt; airv_cu[181344357] - state is now member (was (null))<br>
&gt;     &gt; May 05 16:15:20 [16294] airv_cu pacemakerd:     info:<br>
&gt;     &gt; pcmk_cpg_membership:      Node 181344357 joined group pacemakerd<br>
&gt;     &gt; (counter=0.0)<br>
&gt;     &gt; May 05 16:15:20 [16294] airv_cu pacemakerd:     info:<br>
&gt;     &gt; pcmk_cpg_membership:      Node 181344357 still member of group<br>
&gt;     &gt; pacemakerd (peer=airv_cu, counter=0.0)<br>
&gt;     &gt; May 05 16:15:20 [16294] airv_cu pacemakerd:  warning:<br>
&gt;     pcmk_child_exit:<br>
&gt;     &gt;  The cib process (16353) can no longer be respawned, shutting the<br>
&gt;     &gt; cluster down.<br>
&gt;     &gt; May 05 16:15:20 [16294] airv_cu pacemakerd:   notice:<br>
&gt;     &gt; pcmk_shutdown_worker:     Shutting down Pacemaker<br>
&gt;     &gt;<br>
&gt;     &gt; The log and conf file is attached.<br>
&gt;     &gt;<br>
&gt;     &gt; -Regards<br>
&gt;     &gt; Nikhil<br>
&gt;     &gt;<br>
&gt;     &gt; On Thu, May 5, 2016 at 8:04 PM, Ken Gaillot &lt;<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a><br>
&gt;     &lt;mailto:<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>&gt;<br>
</div></div><div><div>&gt;     &gt; &lt;mailto:<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a> &lt;mailto:<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>&gt;&gt;&gt; wrote:<br>
&gt;     &gt;<br>
&gt;     &gt;     On 05/05/2016 08:36 AM, Nikhil Utane wrote:<br>
&gt;     &gt;     &gt; Hi,<br>
&gt;     &gt;     &gt;<br>
&gt;     &gt;     &gt; Continuing with my adventure to run Pacemaker &amp; Corosync<br>
&gt;     on our<br>
&gt;     &gt;     &gt; big-endian system, I managed to get past the corosync<br>
&gt;     issue for now. But<br>
&gt;     &gt;     &gt; facing an issue in running Pacemaker.<br>
&gt;     &gt;     &gt;<br>
&gt;     &gt;     &gt; Seeing following messages in corosync.log.<br>
&gt;     &gt;     &gt;  pacemakerd:  warning: pcmk_child_exit:  The cib process<br>
&gt;     (20000) can no<br>
&gt;     &gt;     &gt; longer be respawned, shutting the cluster down.<br>
&gt;     &gt;     &gt;  pacemakerd:  warning: pcmk_child_exit:  The stonith-ng<br>
&gt;     process (20001)<br>
&gt;     &gt;     &gt; can no longer be respawned, shutting the cluster down.<br>
&gt;     &gt;     &gt;  pacemakerd:  warning: pcmk_child_exit:  The lrmd process<br>
&gt;     (20002) can no<br>
&gt;     &gt;     &gt; longer be respawned, shutting the cluster down.<br>
&gt;     &gt;     &gt;  pacemakerd:  warning: pcmk_child_exit:  The attrd process<br>
&gt;     (20003) can<br>
&gt;     &gt;     &gt; no longer be respawned, shutting the cluster down.<br>
&gt;     &gt;     &gt;  pacemakerd:  warning: pcmk_child_exit:  The pengine<br>
&gt;     process (20004) can<br>
&gt;     &gt;     &gt; no longer be respawned, shutting the cluster down.<br>
&gt;     &gt;     &gt;  pacemakerd:  warning: pcmk_child_exit:  The crmd process<br>
&gt;     (20005) can no<br>
&gt;     &gt;     &gt; longer be respawned, shutting the cluster down.<br>
&gt;     &gt;     &gt;<br>
&gt;     &gt;     &gt; I see following error before these messages. Not sure if<br>
&gt;     this is the cause.<br>
&gt;     &gt;     &gt; May 05 11:26:24 [19998] airv_cu pacemakerd:    error:<br>
&gt;     &gt;     &gt; cluster_connect_quorum:   Corosync quorum is not configured<br>
&gt;     &gt;     &gt;<br>
&gt;     &gt;     &gt; I tried removing the quorum block (which is anyways blank)<br>
&gt;     from the conf<br>
&gt;     &gt;     &gt; file but still had the same error.<br>
&gt;     &gt;<br>
&gt;     &gt;     Yes, that is the issue. Pacemaker can&#39;t do anything if it<br>
&gt;     can&#39;t ask<br>
&gt;     &gt;     corosync about quorum. I don&#39;t know what the issue is at the<br>
&gt;     corosync<br>
&gt;     &gt;     level, but your corosync.conf should have:<br>
&gt;     &gt;<br>
&gt;     &gt;     quorum {<br>
&gt;     &gt;         provider: corosync_votequorum<br>
&gt;     &gt;     }<br>
&gt;     &gt;<br>
&gt;     &gt;<br>
&gt;     &gt;     &gt; Attaching the log and conf files. Please let me know if<br>
&gt;     there is any<br>
&gt;     &gt;     &gt; obvious mistake or how to investigate it further.<br>
&gt;     &gt;     &gt;<br>
&gt;     &gt;     &gt; I am using pcs cluster start command to start the cluster<br>
&gt;     &gt;     &gt;<br>
&gt;     &gt;     &gt; -Thanks<br>
&gt;     &gt;     &gt; Nikhil<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
</div></div><div><div>&gt; _______________________________________________<br>
&gt; Users mailing list: <a href="mailto:Users@clusterlabs.org" target="_blank">Users@clusterlabs.org</a><br>
&gt; <a href="http://clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://clusterlabs.org/mailman/listinfo/users</a><br>
&gt;<br>
&gt; Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>
&gt; Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>
&gt; Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>
<br>
<br>
_______________________________________________<br>
Users mailing list: <a href="mailto:Users@clusterlabs.org" target="_blank">Users@clusterlabs.org</a><br>
<a href="http://clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://clusterlabs.org/mailman/listinfo/users</a><br>
<br>
Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>
Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>
Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>