<div dir="ltr"><div class="gmail_extra"><div><div class="gmail_signature"><div dir="ltr"><div><br></div></div></div></div>
<br><div class="gmail_quote">2015-10-01 9:30 GMT-04:00 Dejan Muhamedagic <span dir="ltr">&lt;<a href="mailto:dejanmm@fastmail.fm" target="_blank">dejanmm@fastmail.fm</a>&gt;</span>:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">Hi,<br>
<span class=""><br>
On Wed, Sep 30, 2015 at 02:24:32PM -0400, Luc Paulin wrote:<br>
&gt; Hi Everyone,<br>
&gt; I have experience a weird issue last night where our cluster try to<br>
&gt; failover due to an &quot;Unkown interface&quot;<br>
&gt;<br>
&gt; Look like when the IPaddr2 monitor try to perform a status on eth0, it<br>
&gt; didn&#39;t find the device. Both node are VM. I haven&#39;t found any reason as why<br>
&gt; eth0 would have &quot;disapear&quot;<br>
&gt;<br>
&gt; &lt;LOG NODE1&gt;<br>
</span>&gt; [...]<br>
<span class="">&gt; Sep 29 21:25:06 node-02 pengine[3240]:    error: unpack_rsc_op: Preventing<br>
&gt; vip_v207_174 from re-starting anywhere: operation monitor failed &#39;not<br>
&gt; configured&#39; (6)<br>
<br>
</span>The RA exits with the error code which says that the resource<br>
configuration is invalid. Hence PE won&#39;t try to start that<br>
resource again. Normally, we don&#39;t expect network interfaces to<br>
disappear, but this should probably be the &quot;not installed&quot; error,<br>
so that the resource can be started on another node. Or even the<br>
&quot;generic&quot; error in case it may be expected that interfaces can<br>
come and go. Did you figure why the interface disappeared?<br>
<br>
</blockquote><div><br></div><div>No we haven&#39;t been able to figure out why the interface disappeared. Actually it doesn&#39;t seem to have disappeared as we have no evidence that interface was gone from kernel log.  As you say this should probably have be in the &quot;not intstalled&quot; or &quot;generic&quot; error so it tries to start it on another node, but obviously, network interface that disapear is not something that we expect to see. </div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">Thanks,<br>
<br>
Dejan<br>
<div><div class="h5"><br>
&gt; I know that I found some post that say to run sysctl -w<br>
&gt; net.ipv4.conf.all.promote_secondaries=1 to avoid secondary nic to be remove<br>
&gt; when primary is gone, but in this case the eth0 has a single nic that is<br>
&gt; manage through IPaddr2 within crm configuration<br>
&gt;<br>
&gt; Here&#39;s the configuration or node:<br>
&gt;<br>
&gt; &lt;CONFIGURATION&gt;<br>
&gt; Cluster Name: nodecluster1<br>
&gt; Corosync Nodes:<br>
&gt;  node-01 node-02<br>
&gt; Pacemaker Nodes:<br>
&gt;  node-01 node-02<br>
&gt;<br>
&gt; Resources:<br>
&gt;  Group: lbpcivip<br>
&gt;   Resource: vip_v207_174 (class=ocf provider=heartbeat type=IPaddr2)<br>
&gt;    Attributes: ip=x.x.x.174 cidr_netmask=27 broadcast=x.x.x.191 nic=eth0<br>
&gt;    Operations: monitor interval=10s (vip_v207_174-monitor-interval-10s)<br>
&gt;   Resource: vip_v26_1 (class=ocf provider=heartbeat type=IPaddr2)<br>
&gt;    Attributes: ip=x.x.26.1<br>
&gt;    Operations: monitor interval=10s (vip_v26_1-monitor-interval-10s)<br>
&gt;   Resource: vip_v27_1 (class=ocf provider=heartbeat type=IPaddr2)<br>
&gt;    Attributes: ip=x.x.27.1<br>
&gt;    Operations: monitor interval=10s (vip_v27_1-monitor-interval-10s)<br>
&gt;   Resource: vip_v254_230 (class=ocf provider=heartbeat type=IPaddr2)<br>
&gt;    Attributes: ip=x.x.254.230<br>
&gt;    Operations: monitor interval=10s (vip_v254_230-monitor-interval-10s)<br>
&gt;   Resource: change-default-fw (class=lsb type=fwdefaultgw)<br>
&gt;    Operations: monitor interval=60s (change-default-fw-monitor-interval-60s)<br>
&gt;   Resource: fwcorp-mailto-sysadmin (class=ocf provider=heartbeat<br>
&gt; type=MailTo)<br>
&gt;    Attributes: email=<a href="mailto:its@touchtunes.com">its@touchtunes.com</a> subject=&quot;[node - Clustered<br>
&gt; services]&quot;<br>
&gt;    Operations: monitor interval=60s<br>
&gt; (fwcorp-mailto-sysadmin-monitor-interval-60s)<br>
&gt;<br>
&gt; Stonith Devices:<br>
&gt; Fencing Levels:<br>
&gt;<br>
&gt; Location Constraints:<br>
&gt; Ordering Constraints:<br>
&gt; Colocation Constraints:<br>
&gt;<br>
&gt; Cluster Properties:<br>
&gt;  cluster-infrastructure: cman<br>
&gt;  dc-version: 1.1.11-97629de<br>
&gt;  last-lrm-refresh: 1412269491<br>
&gt;  no-quorum-policy: ignore<br>
&gt;  stonith-enabled: false<br>
&gt; &lt;/CONFIGURATION&gt;<br>
&gt;<br>
&gt; Has anyone have suggestion on how I can solve this issue? Why did the<br>
&gt; failover from node1 to node2 didn&#39;t work ?<br>
&gt;<br>
&gt; If more information is require let me know, any suggestion would be<br>
&gt; appreciated!<br>
&gt;<br>
&gt; Thanx!<br>
&gt;<br>
&gt;<br>
&gt; --<br>
&gt;                          !!!!!<br>
&gt;                        ( o o )<br>
&gt;  --------------oOO----(_)----OOo--------------<br>
&gt;    Luc Paulin<br>
&gt;    email: paulinster(at)<a href="http://gmail.com" rel="noreferrer" target="_blank">gmail.com</a><br>
&gt;    Skype: paulinster<br>
<br>
</div></div>&gt; _______________________________________________<br>
&gt; Users mailing list: <a href="mailto:Users@clusterlabs.org">Users@clusterlabs.org</a><br>
&gt; <a href="http://clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://clusterlabs.org/mailman/listinfo/users</a><br>
&gt;<br>
&gt; Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>
&gt; Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>
&gt; Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>
<br>
<br>
_______________________________________________<br>
Users mailing list: <a href="mailto:Users@clusterlabs.org">Users@clusterlabs.org</a><br>
<a href="http://clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://clusterlabs.org/mailman/listinfo/users</a><br>
<br>
Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>
Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>
Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>
</blockquote></div><br></div></div>