[ClusterLabs] Pacemaker fatal shutdown

Priyanka Balotra priyanka.14balotra at gmail.com
Thu Jul 20 03:13:56 EDT 2023


What I mainly want to understand is that:
- why "fatal failure" is coming
- why does pacemaker not start on the node after a node boots followed by
"pacemaker fatal failure" .
- How can this be handled?

Thanks
Priyanka

On Thu, Jul 20, 2023 at 12:41 PM Priyanka Balotra <
priyanka.14balotra at gmail.com> wrote:

> Hi,
>
> Here are FILE-6 logs:
>
> 65710:Jul 17 14:16:51.517 FILE-6 pacemaker-controld  [19415]
> (throttle_mode)    debug: Current load is 0.760000 across 10 core(s)
> 65711:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (throttle_update)  debug: Node FILE-2 has negligible load and supports at
> most 20 jobs; new job limit 20
> 65712:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (handle_request)   debug: The throttle changed. Trigger a graph.
> 65713:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x00020000 (new_actions)
> for controller set by s_crmd_fsa:198
> 65714:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_JOIN_REQUEST: [ state=S_INTEGRATION
> cause=C_HA_MESSAGE origin=route_message ]
> 65715:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00020000 (an_action)
> for controller cleared by do_fsa_action:108
> 65716:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (do_dc_join_filter_offer)  debug: Accepting join-1 request from FILE-2 |
> ref=join_request-crmd-1689603392-8
> 65717:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__update_peer_expected)       info: do_dc_join_filter_offer: Node
> FILE-2[2] - expected state is now member (was (null))
> 65718:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (do_dc_join_filter_offer)  debug: 2 nodes currently integrated in join-1
> 65719:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (check_join_state)         debug: join-1: Integration of 2 peers complete |
> state=S_INTEGRATION for=do_dc_join_filter_offer
> 65720:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x00040000 (new_actions)
> for controller set by s_crmd_fsa:198
> 65721:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_INTEGRATED: [ state=S_INTEGRATION
> cause=C_FSA_INTERNAL origin=check_join_state ]
> 65722:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (do_state_transition)      info: State transition S_INTEGRATION ->
> S_FINALIZE_JOIN | input=I_INTEGRATED cause=C_FSA_INTERNAL
> origin=check_join_state
> 65723:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x00000020
> (A_INTEGRATE_TIMER_STOP) for controller set by do_state_transition:559
> 65724:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x00000040
> (A_FINALIZE_TIMER_START) for controller set by do_state_transition:563
> 65725:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x00000200
> (A_DC_TIMER_STOP) for controller set by do_state_transition:569
> 65726:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (do_state_transition)      debug: All cluster nodes (2) responded to join
> offer
> 65727:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000200 (an_action)
> for controller cleared by do_fsa_action:108
> 65728:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000020 (an_action)
> for controller cleared by do_fsa_action:108
> 65729:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000040 (an_action)
> for controller cleared by do_fsa_action:108
> 65730:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (controld_start_timer)     debug: Started Finalization Timer (inject
> I_ELECTION if pops after 1800000ms, source=119)
> 65731:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00040000 (an_action)
> for controller cleared by do_fsa_action:108
> 65732:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (do_dc_join_finalize)      debug: Finalizing join-1 for 2 nodes (sync'ing
> from local CIB)
> 65733:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (do_dc_join_finalize)      debug: Requested CIB version   <generation_tuple
> crm_feature_set="3.11.0" validate-with="pacemaker-3.7" epoch="24"
> num_updates="72" admin_epoch="0" cib-last-written="Thu Jul 13 13:11:46
> 2023" update-origin="FILE-1" update-client="cibadmin" update-user="root"
> have-quorum="1" dc-uuid="6"/>
> 65734:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-6=integrated
> 65735:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-2=integrated
> 65736:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-3=confirmed
> 65737:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-1=none
> 65738:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-5=confirmed
> 65739:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-4=confirmed
> 65740:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000
> (fsa_data->actions) for controller set by s_crmd_fsa:193
> 65741:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000000
> (new_actions) for controller set by s_crmd_fsa:198
> 65742:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=do_te_invoke ]
> 65743:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65744:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (do_log)
> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
> do_te_invoke
> 65745:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (do_log)
> debug: do_log   <create_request_adv origin="do_cl_join_query" t="crmd"
> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
> 65746:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65747:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (abort_transition_graph)   info: Transition 0 aborted: Peer Halt |
> source=do_te_invoke:135 complete=false
> 65748:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (register_fsa_input_adv)   debug: Stalling the FSA pending further input:
> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c6194ed4c0 queue=0
> 65749:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Exiting the FSA: queue=1, fsa_actions=0x0, stalled=true
> 65750:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (fsa_dump_queue)   debug: queue[0.72]: input I_WAIT_FOR_EVENT raised by
> do_te_invoke(0x55c619869580.1)   (cause=C_HA_MESSAGE)
> 65751:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000
> (fsa_data->actions) for controller set by s_crmd_fsa:193
> 65752:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000000
> (new_actions) for controller set by s_crmd_fsa:198
> 65753:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=do_te_invoke ]
> 65754:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65755:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (do_log)
> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
> do_te_invoke
> 65756:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (do_log)
> debug: do_log   <create_request_adv origin="do_cl_join_query" t="crmd"
> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
> 65757:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65758:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (abort_transition_graph)   info: Transition 0 aborted: Peer Halt |
> source=do_te_invoke:135 complete=false
> 65759:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (register_fsa_input_adv)   debug: Stalling the FSA pending further input:
> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c619869580 queue=0
> 65760:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Exiting the FSA: queue=1, fsa_actions=0x0, stalled=true
> 65761:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (fsa_dump_queue)   debug: queue[0.73]: input I_WAIT_FOR_EVENT raised by
> do_te_invoke(0x55c6194ed4c0.1)   (cause=C_HA_MESSAGE)
> 65762:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__execute_graph)      debug: Transition 0 (Complete=33, Pending=2,
> Fired=0, Skipped=0, Incomplete=24,
> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
> 65764:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (check_join_state)         debug: join-1: Still waiting on 2 integrated
> nodes | state=S_FINALIZE_JOIN for=finalize_sync_callback
> 65765:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-6=integrated
> 65766:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-2=integrated
> 65767:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-3=confirmed
> 65768:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-1=none
> 65769:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-5=confirmed
> 65770:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-4=confirmed
> 65771:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (finalize_sync_callback)   debug: Notifying 2 nodes of join-1 results
> 65772:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (finalize_join_for)        debug: Acknowledging join-1 request from FILE-6
> 65773:Jul 17 14:16:55.085 FILE-6 pacemaker-controld  [19415]
> (finalize_join_for)        debug: Acknowledging join-1 request from FILE-2
> 65776:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (handle_request)   debug: Raising I_JOIN_RESULT: join-1
> 65777:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000
> (fsa_data->actions) for controller set by s_crmd_fsa:193
> 65778:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000000
> (new_actions) for controller set by s_crmd_fsa:198
> 65779:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=do_te_invoke ]
> 65780:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65781:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415] (do_log)
> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
> do_te_invoke
> 65782:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415] (do_log)
> debug: do_log   <create_request_adv origin="do_cl_join_query" t="crmd"
> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
> 65783:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65784:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (abort_transition_graph)   info: Transition 0 aborted: Peer Halt |
> source=do_te_invoke:135 complete=false
> 65785:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (register_fsa_input_adv)   debug: Stalling the FSA pending further input:
> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c6194ed4c0 queue=1
> 65786:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Exiting the FSA: queue=2, fsa_actions=0x0, stalled=true
> 65787:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (fsa_dump_queue)   debug: queue[0.74]: input I_JOIN_RESULT raised by
> route_message(0x55c619861a90.1)     (cause=C_HA_MESSAGE)
> 65788:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (fsa_dump_queue)   debug: queue[1.75]: input I_WAIT_FOR_EVENT raised by
> do_te_invoke(0x55c61986ed80.1)   (cause=C_HA_MESSAGE)
> 65789:Jul 17 14:16:55.093 FILE-6 pacemaker-controld  [19415]
> (pcmk__execute_graph)      debug: Transition 0 (Complete=33, Pending=2,
> Fired=0, Skipped=0, Incomplete=24,
> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
> 65792:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x00880000 (new_actions)
> for controller set by s_crmd_fsa:198
> 65793:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_JOIN_RESULT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=route_message ]
> 65794:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00800000 (an_action)
> for controller cleared by do_fsa_action:108
> 65795:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource stonith-sbd after monitor op complete (interval=0)
> 65796:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource FILE_Filesystem after monitor op complete (interval=0)
> 65797:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Service_pfile after monitor op complete (interval=0)
> 65798:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Service_Postgresql after monitor op complete (interval=0)
> 65799:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Service_esm_primary after monitor op complete (interval=0)
> 65800:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Service_Postgrest after monitor op complete (interval=0)
> 65801:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource IP_Floating after monitor op complete (interval=0)
> 65802:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Shared_Cluster_Backup after monitor op complete (interval=0)
> 65803:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (do_cl_join_finalize_respond)      debug: Confirming join-1: sending local
> operation history to FILE-6
> 65804:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00080000 (an_action)
> for controller cleared by do_fsa_action:108
> 65805:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (do_dc_join_ack)   debug: Ignoring 'join_ack_nack' message from FILE-6
> while waiting for 'join_confirm'
> 65806:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000
> (fsa_data->actions) for controller set by s_crmd_fsa:193
> 65807:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000000
> (new_actions) for controller set by s_crmd_fsa:198
> 65808:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=do_te_invoke ]
> 65809:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65810:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (do_log)
> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
> do_te_invoke
> 65811:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (do_log)
> debug: do_log   <create_request_adv origin="do_cl_join_query" t="crmd"
> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
> 65812:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65813:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (abort_transition_graph)   info: Transition 0 aborted: Peer Halt |
> source=do_te_invoke:135 complete=false
> 65814:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (register_fsa_input_adv)   debug: Stalling the FSA pending further input:
> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c61986ed80 queue=1
> 65815:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Exiting the FSA: queue=2, fsa_actions=0x0, stalled=true
> 65816:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (fsa_dump_queue)   debug: queue[0.76]: input I_JOIN_RESULT raised by
> route_message(0x55c619871630.1)     (cause=C_HA_MESSAGE)
> 65817:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (fsa_dump_queue)   debug: queue[1.77]: input I_WAIT_FOR_EVENT raised by
> do_te_invoke(0x55c619861a90.1)   (cause=C_HA_MESSAGE)
> 65818:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__execute_graph)      debug: Transition 0 (Complete=33, Pending=2,
> Fired=0, Skipped=0, Incomplete=24,
> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
> 65821:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x00880000 (new_actions)
> for controller set by s_crmd_fsa:198
> 65822:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_JOIN_RESULT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=route_message ]
> 65823:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00800000 (an_action)
> for controller cleared by do_fsa_action:108
> 65824:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00080000 (an_action)
> for controller cleared by do_fsa_action:108
> 65825:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (controld_delete_node_state)       info: Deleting resource history for node
> FILE-2 (via CIB call 71) | xpath=//node_state[@uname='FILE-2']/lrm
> 65826:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (do_dc_join_ack)   debug: Updating node history for FILE-2 from join-1
> confirmation (via CIB call 72)
> 65827:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000
> (fsa_data->actions) for controller set by s_crmd_fsa:193
> 65828:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000000
> (new_actions) for controller set by s_crmd_fsa:198
> 65829:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=do_te_invoke ]
> 65830:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65831:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (do_log)
> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
> do_te_invoke
> 65832:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (do_log)
> debug: do_log   <create_request_adv origin="do_cl_join_query" t="crmd"
> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
> 65833:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65834:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (abort_transition_graph)   info: Transition 0 aborted: Peer Halt |
> source=do_te_invoke:135 complete=false
> 65835:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (register_fsa_input_adv)   debug: Stalling the FSA pending further input:
> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c619861a90 queue=1
> 65836:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Exiting the FSA: queue=2, fsa_actions=0x0, stalled=true
> 65837:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (fsa_dump_queue)   debug: queue[0.78]: input I_JOIN_RESULT raised by
> route_message(0x55c6198798d0.1)     (cause=C_HA_MESSAGE)
> 65838:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (fsa_dump_queue)   debug: queue[1.79]: input I_WAIT_FOR_EVENT raised by
> do_te_invoke(0x55c619871630.1)   (cause=C_HA_MESSAGE)
> 65839:Jul 17 14:16:55.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__execute_graph)      debug: Transition 0 (Complete=33, Pending=2,
> Fired=0, Skipped=0, Incomplete=24,
> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
> 65851:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (cib_delete_callback)      debug: Deletion of resource history for node
> FILE-2 (via CIB call 71) succeeded
> 65861:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (te_update_diff)   debug: Processing (cib_modify) diff: 0.24.72 -> 0.24.73
> (S_FINALIZE_JOIN)
> 65862:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (join_update_complete_callback)    debug: join-1 node history update (via
> CIB call 72) complete
> 65863:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (check_join_state)         debug: join-1: Still waiting on 1 finalized node
> | state=S_FINALIZE_JOIN for=join_update_complete_callback
> 65864:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-6=finalized
> 65865:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-2=confirmed
> 65866:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-3=confirmed
> 65867:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-1=none
> 65868:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-5=confirmed
> 65869:Jul 17 14:16:55.109 FILE-6 pacemaker-controld  [19415]
> (crmd_join_phase_log)      debug: join-1: FILE-4=confirmed
> 65876:Jul 17 14:17:21.517 FILE-6 pacemaker-controld  [19415]
> (throttle_cib_load)        debug: cib load: 0.001000 (3 ticks in 30s)
> 65877:Jul 17 14:17:21.517 FILE-6 pacemaker-controld  [19415]
> (throttle_mode)    debug: Current load is 0.960000 across 10 core(s)
> 65878:Jul 17 14:17:51.517 FILE-6 pacemaker-controld  [19415]
> (throttle_cib_load)        debug: cib load: 0.000333 (1 ticks in 30s)
> 65879:Jul 17 14:17:51.517 FILE-6 pacemaker-controld  [19415]
> (throttle_mode)    debug: Current load is 0.580000 across 10 core(s)
> 65883:Jul 17 14:18:20.085 FILE-6 pacemaker-fenced    [19411]
> (process_remote_stonith_exec)      debug: Finalizing action 'reboot'
> targeting FILE-2 on behalf of pacemaker-controld.19415 at FILE-6: OK | rc=0
> id=4e523b34
> 65884:Jul 17 14:18:20.085 FILE-6 pacemaker-fenced    [19411]
> (remote_op_done)   notice: Operation 'reboot' targeting FILE-2 by FILE-4
> for pacemaker-controld.19415 at FILE-6: OK | id=4e523b34
> 65886:Jul 17 14:18:20.085 FILE-6 pacemaker-controld  [19415]
> (tengine_stonith_callback)         notice: Stonith operation
> 3/63:0:0:232e6505-2e98-4a79-b6ce-5f26d9cba645: OK (0)
> 65887:Jul 17 14:18:20.085 FILE-6 pacemaker-controld  [19415]
> (tengine_stonith_callback)         info: Stonith operation 3 for FILE-2
> passed
> 65888:Jul 17 14:18:20.085 FILE-6 pacemaker-controld  [19415]
> (pcmk__update_peer_expected)       info: crmd_peer_down: Node FILE-2[2] -
> expected state is now down (was member)
> 65889:Jul 17 14:18:20.085 FILE-6 pacemaker-controld  [19415]
> (send_stonith_update)      debug: Sending fencing update 73 for FILE-2
> 65890:Jul 17 14:18:20.085 FILE-6 pacemaker-controld  [19415]
> (controld_delete_node_state)       info: Deleting all state for node FILE-2
> (via CIB call 74) | xpath=//node_state[@uname='FILE-2']/*
> 65892:Jul 17 14:18:20.089 FILE-6 pacemaker-controld  [19415]
> (exec_alert_list)  info: Sending fencing alert via pf-ha-alert to (null)
> 65896:Jul 17 14:18:20.089 FILE-6 pacemaker-controld  [19415]
> (tengine_stonith_notify)   notice: Peer FILE-2 was terminated (reboot) by
> FILE-4 on behalf of pacemaker-controld.19415: OK | initiator=FILE-6
> ref=4e523b34-dcb1-40bc-a296-5e984b4e6b00
> 65897:Jul 17 14:18:20.089 FILE-6 pacemaker-controld  [19415]
> (send_stonith_update)      debug: Sending fencing update 75 for FILE-2
> 65898:Jul 17 14:18:20.089 FILE-6 pacemaker-controld  [19415]
> (controld_delete_node_state)       info: Deleting all state for node FILE-2
> (via CIB call 76) | xpath=//node_state[@uname='FILE-2']/*
> 65899:Jul 17 14:18:20.089 FILE-6 pacemaker-controld  [19415]
> (pcmk__execute_graph)      debug: Transition 0 (Complete=34, Pending=1,
> Fired=0, Skipped=0, Incomplete=24,
> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
> 65907:Jul 17 14:18:20.089 FILE-6 pacemaker-controld  [19415]
> (te_update_diff)   debug: Processing (cib_modify) diff: 0.24.73 -> 0.24.74
> (S_FINALIZE_JOIN)
> 65908:Jul 17 14:18:20.089 FILE-6 pacemaker-controld  [19415]
> (cib_fencing_updated)      info: Fencing update 73 for FILE-2: complete
> 65916:Jul 17 14:18:20.093 FILE-6 pacemaker-controld  [19415]
> (te_update_diff)   debug: Processing (cib_delete) diff: 0.24.74 -> 0.24.75
> (S_FINALIZE_JOIN)
> 65919:Jul 17 14:18:20.093 FILE-6 pacemaker-controld  [19415]
> (match_down_event)         debug: Shutdown action 63
> (stonith-FILE-2-reboot) found for node 2
> 65920:Jul 17 14:18:20.093 FILE-6 pacemaker-controld  [19415]
> (cib_delete_callback)      debug: Deletion of all state for node FILE-2
> (via CIB call 74) succeeded
> 65921:Jul 17 14:18:20.093 FILE-6 pacemaker-controld  [19415]
> (cib_fencing_updated)      info: Fencing update 75 for FILE-2: complete
> 65924:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (cib_delete_callback)      debug: Deletion of all state for node FILE-2
> (via CIB call 76) succeeded
> 65927:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415] (node_left)
>      info: Group crmd event 5: FILE-2 (node 2 pid 15962) left for unknown
> reason
> 65928:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (crm_update_peer_proc)     info: node_left: Node FILE-2[2] - corosync-cpg
> is now offline
> 65929:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (peer_update_callback)     info: Node FILE-2 is no longer a peer | DC=true
> old=0x4000000 new=0x0000000
> 65930:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (controld_delete_node_state)       info: Deleting transient attributes for
> node FILE-2 (via CIB call 77) |
> xpath=//node_state[@uname='FILE-2']/transient_attributes
> 65932:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (match_down_event)         debug: Shutdown action 63
> (stonith-FILE-2-reboot) found for node 2
> 65933:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk_cpg_membership)      info: Group crmd event 5: FILE-3 (node 3 pid
> 19250) is member
> 65934:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk_cpg_membership)      info: Group crmd event 5: FILE-4 (node 4 pid
> 19122) is member
> 65935:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk_cpg_membership)      info: Group crmd event 5: FILE-5 (node 5 pid
> 19273) is member
> 65936:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk_cpg_membership)      info: Group crmd event 5: FILE-6 (node 6 pid
> 19415) is member
> 65938:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x00880000 (new_actions)
> for controller set by s_crmd_fsa:198
> 65939:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_JOIN_RESULT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=route_message ]
> 65940:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00800000 (an_action)
> for controller cleared by do_fsa_action:108
> 65941:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x00080000 (an_action)
> for controller cleared by do_fsa_action:108
> 65942:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (controld_delete_node_state)       info: Deleting resource history for node
> FILE-6 (via CIB call 79) | xpath=//node_state[@uname='FILE-6']/lrm
> 65943:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource stonith-sbd after monitor op complete (interval=0)
> 65945:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource FILE_Filesystem after monitor op complete (interval=0)
> 65946:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Service_pfile after monitor op complete (interval=0)
> 65947:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Service_Postgresql after monitor op complete (interval=0)
> 65948:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Service_esm_primary after monitor op complete (interval=0)
> 65949:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Service_Postgrest after monitor op complete (interval=0)
> 65950:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource IP_Floating after monitor op complete (interval=0)
> 65951:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__create_history_xml)         debug: build_active_RAs: Updating
> resource Shared_Cluster_Backup after monitor op complete (interval=0)
> 65952:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (do_dc_join_ack)   debug: Updating local node history for join-1 from query
> result (via CIB call 80)
> 65954:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000
> (fsa_data->actions) for controller set by s_crmd_fsa:193
> 65955:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000000
> (new_actions) for controller set by s_crmd_fsa:198
> 65956:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=do_te_invoke ]
> 65957:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65958:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415] (do_log)
> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
> do_te_invoke
> 65959:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415] (do_log)
> debug: do_log   <create_request_adv origin="do_cl_join_query" t="crmd"
> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
> 65960:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65961:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (abort_transition_graph)   info: Transition 0 aborted: Peer Halt |
> source=do_te_invoke:135 complete=false
> 65962:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (register_fsa_input_adv)   debug: Stalling the FSA pending further input:
> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c619871630 queue=0
> 65963:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Exiting the FSA: queue=1, fsa_actions=0x0, stalled=true
> 65964:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (fsa_dump_queue)   debug: queue[0.80]: input I_WAIT_FOR_EVENT raised by
> do_te_invoke(0x55c6198798d0.1)   (cause=C_HA_MESSAGE)
> 65966:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__execute_graph)      debug: Transition 0 (Complete=34, Pending=1,
> Fired=0, Skipped=0, Incomplete=24,
> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
> 65967:Jul 17 14:18:20.097 FILE-6 pacemaker-fenced    [19411]
> (process_remote_stonith_exec)      debug: Finalizing action 'reboot'
> targeting FILE-1 on behalf of pacemaker-controld.19415 at FILE-6: OK | rc=0
> id=446afc42
> 65968:Jul 17 14:18:20.097 FILE-6 pacemaker-fenced    [19411]
> (remote_op_done)   notice: Operation 'reboot' targeting FILE-1 by FILE-5
> for pacemaker-controld.19415 at FILE-6: OK | id=446afc42
> 65970:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (tengine_stonith_callback)         notice: Stonith operation
> 4/62:0:0:232e6505-2e98-4a79-b6ce-5f26d9cba645: OK (0)
> 65971:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (tengine_stonith_callback)         info: Stonith operation 4 for FILE-1
> passed
> 65972:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (pcmk__update_peer_expected)       info: crmd_peer_down: Node FILE-1[1] -
> expected state is now down (was pending)
> 65973:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (send_stonith_update)      debug: Sending fencing update 81 for FILE-1
> 65974:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (controld_delete_node_state)       info: Deleting all state for node FILE-1
> (via CIB call 82) | xpath=//node_state[@uname='FILE-1']/*
> 65975:Jul 17 14:18:20.097 FILE-6 pacemaker-controld  [19415]
> (exec_alert_list)  info: Sending fencing alert via pf-ha-alert to (null)
> 65979:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (tengine_stonith_notify)   notice: Peer FILE-1 was terminated (reboot) by
> FILE-5 on behalf of pacemaker-controld.19415: OK | initiator=FILE-6
> ref=446afc42-b46e-47af-9fac-0fa87c1c5e57
> 65980:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (send_stonith_update)      debug: Sending fencing update 83 for FILE-1
> 65982:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (controld_delete_node_state)       info: Deleting all state for node FILE-1
> (via CIB call 84) | xpath=//node_state[@uname='FILE-1']/*
> 65983:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (cib_delete_callback)      debug: Deletion of transient attributes for node
> FILE-2 (via CIB call 77) succeeded
> 65984:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (pcmk__execute_graph)      notice: Transition 0 (Complete=35, Pending=0,
> Fired=0, Skipped=3, Incomplete=24,
> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): Stopped
> 65985:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (te_graph_trigger)         debug: Transition 0 is now complete
> 65986:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415] (notify_crmd)
>      debug: Processing transition completion in state S_FINALIZE_JOIN
> 65987:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415] (notify_crmd)
>      debug: Transition 0 status: restart - Node join
> 65988:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000
> (fsa_data->actions) for controller set by s_crmd_fsa:193
> 65989:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000000
> (new_actions) for controller set by s_crmd_fsa:198
> 65990:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
> cause=C_HA_MESSAGE origin=do_te_invoke ]
> 65991:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65992:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415] (do_log)
> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
> do_te_invoke
> 65993:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415] (do_log)
> debug: do_log   <create_request_adv origin="do_cl_join_query" t="crmd"
> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
> 65994:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000
> (an_action) for controller cleared by do_fsa_action:108
> 65995:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (abort_transition_graph)   info: Transition 0 aborted: Peer Halt |
> source=do_te_invoke:135 complete=true
> 65996:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415] (s_crmd_fsa)
>       debug: Processing I_PE_CALC: [ state=S_FINALIZE_JOIN
> cause=C_FSA_INTERNAL origin=abort_transition_graph ]
> 66024:Jul 17 14:18:20.101 FILE-6 pacemaker-controld  [19415]
> (cib_delete_callback)      debug: Deletion of resource history for node
> FILE-6 (via CIB call 79) succeeded
> 66063:Jul 17 14:18:20.105 FILE-6 pacemaker-controld  [19415]
> (join_update_complete_callback)    debug: join-1 node history update (via
> CIB call 80) complete
> 66064:Jul 17 14:18:20.105 FILE-6 pacemaker-controld  [19415]
> (check_join_state)         debug: join-1: Complete | state=S_FINALIZE_JOIN
> for=join_update_complete_callback
> 66068:Jul 17 14:18:20.105 FILE-6 pacemaker-controld  [19415]
> (pcmk__set_flags_as)       debug: FSA action flags 0x800400000000
> (new_actions) for controller set by s_crmd_fsa:198
>
> Thanks
> Priyanka
>
> On Thu, Jul 20, 2023 at 11:53 AM Reid Wahl <nwahl at redhat.com> wrote:
>
>> On Wed, Jul 19, 2023 at 8:33 PM Priyanka Balotra
>> <priyanka.14balotra at gmail.com> wrote:
>> >
>> > Sure,
>> > Here are the logs:
>> >
>> >
>> > 63138:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (post_cache_update)        debug: Updated cache after membership event 44.
>> > 63139:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x200000000
>> (A_ELECTION_CHECK) for controller set by post_cache_update:81
>> > 63140:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000002 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63141:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (do_started)       info: Delaying start, Config not read (0000000000000040)
>> > 63142:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (register_fsa_input_adv)   debug: Stalling the FSA pending further input:
>> source=do_started cause=C_FSA_INTERNAL data=(nil) queue=0
>> > 63143:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x00000002
>> (with_actions) for controller set by register_fsa_input_adv:88
>> > 63144:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (s_crmd_fsa)       debug: Exiting the FSA: queue=0,
>> fsa_actions=0x200000002, stalled=true
>> > 63145:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (config_query_callback)    debug: Call 3 : Parsing CIB options
>> > 63146:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (config_query_callback)    debug: Shutdown escalation occurs if DC has not
>> responded to request in 1200000ms
>> > 63147:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (config_query_callback)    debug: Re-run scheduler after 900000ms of
>> inactivity
>> > 63148:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pe_unpack_alerts)         debug: Alert pf-ha-alert:
>> path=/usr/lib/ocf/resource.d/pacemaker/pf_ha_alert.sh timeout=30000ms
>> tstamp-format='%H:%M:%S.%06N' 0 vars
>> > 63149:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000002 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63150:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (do_started)       debug: Init server comms
>> > 63151:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcs_us_publish)       info: server name: crmd
>> > 63152:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (do_started)       notice: Pacemaker controller successfully started and
>> accepting connections
>> > 63153:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x200000000 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63154:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (do_election_check)        debug: Ignoring election check because we are
>> not in an election
>> > 63155:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000100100
>> (new_actions) for controller set by s_crmd_fsa:198
>> > 63156:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (s_crmd_fsa)       debug: Processing I_PENDING: [ state=S_STARTING
>> cause=C_FSA_INTERNAL origin=do_started ]
>> > 63157:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> > 63158:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962] (do_log)
>>  info: Input I_PENDING received in state S_STARTING from do_started
>> > 63159:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (do_state_transition)      notice: State transition S_STARTING -> S_PENDING
>> | input=I_PENDING cause=C_FSA_INTERNAL origin=do_started
>> > 63160:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x00000020
>> (A_INTEGRATE_TIMER_STOP) for controller set by do_state_transition:559
>> > 63161:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x00000080
>> (A_FINALIZE_TIMER_STOP) for controller set by do_state_transition:565
>> > 63162:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000020 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63163:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000080 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63164:Jul 17 14:16:25.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00100000 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63165:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (do_cl_join_query)         debug: Querying for a DC
>> > 63166:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000100 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63167:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (controld_start_timer)     debug: Started Election Trigger (inject
>> I_DC_TIMEOUT if pops after 20000ms, source=18)
>> > 63168:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (stonith_api_signon)       debug: Attempting fencer connection by
>> pacemaker-controld with mainloop
>> > 63175:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:131085; real_size:135168;
>> rb->word_size:33792
>> > 63176:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:131085; real_size:135168;
>> rb->word_size:33792
>> > 63177:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:131085; real_size:135168;
>> rb->word_size:33792
>> > 63178:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processing register 8 from client
>> pacemaker-controld.15962 with call options 0x00000000
>> > 63179:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processed register from client
>> pacemaker-controld.15962: OK (rc=0)
>> > 63180:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (stonith_api_signon)       debug: Connection to fencer by
>> pacemaker-controld succeeded (registration token:
>> 5552b1b4-f725-46ac-b239-e404cadd8d94)
>> > 63181:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processing st_notify 9 from client
>> pacemaker-controld.15962 with call options 0x00000000
>> > 63182:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (handle_request)   debug: Enabling st_notify_disconnect callbacks for
>> client pacemaker-controld.15962
>> > 63183:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processed st_notify from client
>> pacemaker-controld.15962: OK (rc=0)
>> > 63184:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processing st_notify 10 from client
>> pacemaker-controld.15962 with call options 0x00000000
>> > 63185:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (handle_request)   debug: Enabling st_notify_fence callbacks for client
>> pacemaker-controld.15962
>> > 63186:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processed st_notify from client
>> pacemaker-controld.15962: OK (rc=0)
>> > 63187:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processing st_notify 11 from client
>> pacemaker-controld.15962 with call options 0x00000000
>> > 63188:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (handle_request)   debug: Enabling st_notify_history_synced callbacks for
>> client pacemaker-controld.15962
>> > 63189:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processed st_notify from client
>> pacemaker-controld.15962: OK (rc=0)
>> > 63190:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (te_trigger_stonith_history_sync)  info: Fence history will be synchronized
>> cluster-wide within 30 seconds
>> > 63191:Jul 17 14:16:26.132 FILE-2 pacemaker-controld  [15962]
>> (te_connect_stonith)       notice: Fencer successfully connected
>> > 63192:Jul 17 14:16:32.664 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   info: Quorum retained | membership=48 members=5
>> > 63193:Jul 17 14:16:32.664 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   debug: Member[0] 2
>> > 63194:Jul 17 14:16:32.664 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   debug: Member[1] 4
>> > 63195:Jul 17 14:16:32.668 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63196:Jul 17 14:16:32.668 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63197:Jul 17 14:16:32.668 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63198:Jul 17 14:16:32.668 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63199:Jul 17 14:16:32.668 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-e4qK7U/qb-request-cmap-header
>> > 63200:Jul 17 14:16:32.668 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-e4qK7U/qb-response-cmap-header
>> > 63201:Jul 17 14:16:32.668 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-e4qK7U/qb-event-cmap-header
>> > 63202:Jul 17 14:16:32.668 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 4
>> > 63203:Jul 17 14:16:32.668 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 4
>> > 63204:Jul 17 14:16:32.672 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63205:Jul 17 14:16:32.672 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63206:Jul 17 14:16:32.672 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63209:Jul 17 14:16:32.676 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63210:Jul 17 14:16:32.676 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-YYxILU/qb-request-cmap-header
>> > 63211:Jul 17 14:16:32.676 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-YYxILU/qb-response-cmap-header
>> > 63212:Jul 17 14:16:32.676 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-YYxILU/qb-event-cmap-header
>> > 63213:Jul 17 14:16:32.676 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 4
>> > 63214:Jul 17 14:16:32.676 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   info: Obtaining name for new node 4
>> > 63218:Jul 17 14:16:32.684 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63222:Jul 17 14:16:32.684 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63225:Jul 17 14:16:32.684 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63240:Jul 17 14:16:32.688 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63241:Jul 17 14:16:32.688 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-Cy8QVV/qb-request-cmap-header
>> > 63242:Jul 17 14:16:32.688 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-Cy8QVV/qb-response-cmap-header
>> > 63243:Jul 17 14:16:32.688 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-Cy8QVV/qb-event-cmap-header
>> > 63244:Jul 17 14:16:32.688 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 4
>> > 63245:Jul 17 14:16:32.688 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 4
>> > 63246:Jul 17 14:16:32.688 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   debug: Member[2] 3
>> > 63259:Jul 17 14:16:32.700 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63265:Jul 17 14:16:32.700 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63267:Jul 17 14:16:32.700 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63298:Jul 17 14:16:32.712 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63299:Jul 17 14:16:32.712 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-34-0DHKhX/qb-request-cmap-header
>> > 63300:Jul 17 14:16:32.712 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-34-0DHKhX/qb-response-cmap-header
>> > 63301:Jul 17 14:16:32.712 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-34-0DHKhX/qb-event-cmap-header
>> > 63302:Jul 17 14:16:32.712 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 3
>> > 63303:Jul 17 14:16:32.712 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 3
>> > 63307:Jul 17 14:16:32.720 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63313:Jul 17 14:16:32.720 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63320:Jul 17 14:16:32.720 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63351:Jul 17 14:16:32.728 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63352:Jul 17 14:16:32.728 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-34-V0bQlV/qb-request-cmap-header
>> > 63353:Jul 17 14:16:32.728 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-34-V0bQlV/qb-response-cmap-header
>> > 63355:Jul 17 14:16:32.728 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-34-V0bQlV/qb-event-cmap-header
>> > 63356:Jul 17 14:16:32.728 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 3
>> > 63357:Jul 17 14:16:32.728 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   info: Obtaining name for new node 3
>> > 63365:Jul 17 14:16:32.736 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63372:Jul 17 14:16:32.736 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63374:Jul 17 14:16:32.736 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63415:Jul 17 14:16:32.748 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63416:Jul 17 14:16:32.748 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-34-EAFzTX/qb-request-cmap-header
>> > 63417:Jul 17 14:16:32.748 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-34-EAFzTX/qb-response-cmap-header
>> > 63418:Jul 17 14:16:32.748 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-34-EAFzTX/qb-event-cmap-header
>> > 63419:Jul 17 14:16:32.748 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 3
>> > 63420:Jul 17 14:16:32.748 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 3
>> > 63421:Jul 17 14:16:32.748 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   debug: Member[3] 6
>> > 63425:Jul 17 14:16:32.752 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63426:Jul 17 14:16:32.752 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63427:Jul 17 14:16:32.752 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63479:Jul 17 14:16:32.756 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63480:Jul 17 14:16:32.756 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-33-q3mFYU/qb-request-cmap-header
>> > 63481:Jul 17 14:16:32.756 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-33-q3mFYU/qb-response-cmap-header
>> > 63482:Jul 17 14:16:32.756 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-33-q3mFYU/qb-event-cmap-header
>> > 63483:Jul 17 14:16:32.756 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 6
>> > 63484:Jul 17 14:16:32.756 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 6
>> > 63485:Jul 17 14:16:32.760 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63486:Jul 17 14:16:32.760 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63487:Jul 17 14:16:32.760 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63490:Jul 17 14:16:32.760 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63491:Jul 17 14:16:32.760 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-EcEbfV/qb-request-cmap-header
>> > 63492:Jul 17 14:16:32.760 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-EcEbfV/qb-response-cmap-header
>> > 63493:Jul 17 14:16:32.760 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-EcEbfV/qb-event-cmap-header
>> > 63494:Jul 17 14:16:32.760 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 6
>> > 63495:Jul 17 14:16:32.760 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   info: Obtaining name for new node 6
>> > 63499:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63502:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63505:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63508:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63509:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-fLk4xW/qb-request-cmap-header
>> > 63510:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-fLk4xW/qb-response-cmap-header
>> > 63511:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-fLk4xW/qb-event-cmap-header
>> > 63512:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 6
>> > 63513:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 6
>> > 63514:Jul 17 14:16:32.764 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   debug: Member[4] 5
>> > 63517:Jul 17 14:16:32.768 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63518:Jul 17 14:16:32.768 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63521:Jul 17 14:16:32.768 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63528:Jul 17 14:16:32.768 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63529:Jul 17 14:16:32.768 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-ushXmW/qb-request-cmap-header
>> > 63530:Jul 17 14:16:32.768 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-ushXmW/qb-response-cmap-header
>> > 63531:Jul 17 14:16:32.768 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-ushXmW/qb-event-cmap-header
>> > 63532:Jul 17 14:16:32.768 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 5
>> > 63533:Jul 17 14:16:32.768 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 5
>> > 63534:Jul 17 14:16:32.772 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63535:Jul 17 14:16:32.772 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63536:Jul 17 14:16:32.772 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63537:Jul 17 14:16:32.772 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63538:Jul 17 14:16:32.772 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-x3qVkW/qb-request-cmap-header
>> > 63539:Jul 17 14:16:32.772 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-x3qVkW/qb-response-cmap-header
>> > 63540:Jul 17 14:16:32.772 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-x3qVkW/qb-event-cmap-header
>> > 63541:Jul 17 14:16:32.772 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 5
>> > 63542:Jul 17 14:16:32.772 FILE-2 pacemaker-controld  [15962]
>> (quorum_notification_cb)   info: Obtaining name for new node 5
>> > 63543:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63544:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63545:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63546:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63547:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-gUNSFU/qb-request-cmap-header
>> > 63548:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-gUNSFU/qb-response-cmap-header
>> > 63549:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-gUNSFU/qb-event-cmap-header
>> > 63550:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 5
>> > 63551:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 5
>> > 63552:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (update_peer_state_iter)   notice: Node (null) state is now lost | nodeid=1
>> previous=member source=pcmk__reap_unseen_nodes
>> > 63553:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (post_cache_update)        debug: Updated cache after membership event 48.
>> > 63554:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x200000000
>> (A_ELECTION_CHECK) for controller set by post_cache_update:81
>> > 63555:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x200000000 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63556:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (do_election_check)        debug: Ignoring election check because we are
>> not in an election
>> > 63557:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (pcmk_cpg_membership)      info: Group crmd event 0: node 2 pid 15962
>> joined via cpg_join
>> > 63558:Jul 17 14:16:32.776 FILE-2 pacemaker-controld  [15962]
>> (pcmk_cpg_membership)      info: Group crmd event 0: FILE-2 (node 2 pid
>> 15962) is member
>> > 63559:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63560:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63561:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63564:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63565:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-5PH1gV/qb-request-cmap-header
>> > 63566:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-5PH1gV/qb-response-cmap-header
>> > 63567:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-5PH1gV/qb-event-cmap-header
>> > 63568:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 3
>> > 63569:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 3
>> > 63570:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (pcmk_cpg_membership)      info: Group crmd event 0: peer node (node 3 pid
>> 19250) is member
>> > 63571:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (crm_update_peer_proc)     info: pcmk_cpg_membership: Node (null)[3] -
>> corosync-cpg is now online
>> > 63572:Jul 17 14:16:32.780 FILE-2 pacemaker-controld  [15962]
>> (peer_update_callback)     debug: Sending hello to node 3 so that it learns
>> our node name
>> > 63573:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63574:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63575:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63576:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63577:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-QATDEV/qb-request-cmap-header
>> > 63578:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-QATDEV/qb-response-cmap-header
>> > 63579:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-QATDEV/qb-event-cmap-header
>> > 63580:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 4
>> > 63581:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 4
>> > 63582:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (pcmk_cpg_membership)      info: Group crmd event 0: peer node (node 4 pid
>> 19122) is member
>> > 63583:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (crm_update_peer_proc)     info: pcmk_cpg_membership: Node (null)[4] -
>> corosync-cpg is now online
>> > 63584:Jul 17 14:16:32.784 FILE-2 pacemaker-controld  [15962]
>> (peer_update_callback)     debug: Sending hello to node 4 so that it learns
>> our node name
>> > 63585:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63586:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63587:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63588:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63589:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-TVzR1T/qb-request-cmap-header
>> > 63590:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-TVzR1T/qb-response-cmap-header
>> > 63591:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-TVzR1T/qb-event-cmap-header
>> > 63592:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 5
>> > 63593:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 5
>> > 63594:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (pcmk_cpg_membership)      info: Group crmd event 0: peer node (node 5 pid
>> 19273) is member
>> > 63595:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (crm_update_peer_proc)     info: pcmk_cpg_membership: Node (null)[5] -
>> corosync-cpg is now online
>> > 63596:Jul 17 14:16:32.788 FILE-2 pacemaker-controld  [15962]
>> (peer_update_callback)     debug: Sending hello to node 5 so that it learns
>> our node name
>> > 63597:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63598:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63599:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_open_2)     debug: shm size:1048589; real_size:1052672;
>> rb->word_size:263168
>> > 63600:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (qb_ipcc_disconnect)       debug: qb_ipcc_disconnect()
>> > 63601:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-8LRaoV/qb-request-cmap-header
>> > 63602:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-8LRaoV/qb-response-cmap-header
>> > 63603:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (qb_rb_close_helper)       debug: Closing ringbuffer:
>> /dev/shm/qb-13142-15962-31-8LRaoV/qb-event-cmap-header
>> > 63604:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (pcmk__corosync_name)      info: Unable to get node name for nodeid 6
>> > 63605:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (get_node_name)    notice: Could not obtain a node name for corosync node
>> with id 6
>> > 63606:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (pcmk_cpg_membership)      info: Group crmd event 0: peer node (node 6 pid
>> 19415) is member
>> > 63607:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (crm_update_peer_proc)     info: pcmk_cpg_membership: Node (null)[6] -
>> corosync-cpg is now online
>> > 63608:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (peer_update_callback)     debug: Sending hello to node 6 so that it learns
>> our node name
>> > 63609:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (get_xpath_object)         debug: No match for //st_notify_history_synced
>> in /notify
>> > 63610:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (stonith_api_del_notification)     debug: Removing callback for
>> st_notify_history_synced events
>> > 63611:Jul 17 14:16:32.792 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processing st_notify 12 from client
>> pacemaker-controld.15962 with call options 0x00000000
>> > 63612:Jul 17 14:16:32.792 FILE-2 pacemaker-fenced    [15958]
>> (handle_request)   debug: Disabling st_notify_history_synced callbacks for
>> client pacemaker-controld.15962
>> > 63613:Jul 17 14:16:32.792 FILE-2 pacemaker-fenced    [15958]
>> (stonith_command)  debug: Processed st_notify from client
>> pacemaker-controld.15962: OK (rc=0)
>> > 63614:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (tengine_stonith_history_synced)   debug: Fence-history synced - cancel all
>> timers
>> > 63615:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (crm_get_peer)     info: Node 4 is now known as FILE-4
>> > 63616:Jul 17 14:16:32.792 FILE-2 pacemaker-controld  [15962]
>> (update_peer_uname)        warning: Node names with capitals are
>> discouraged, consider changing 'FILE-4'
>> > 63617:Jul 17 14:16:32.796 FILE-2 pacemaker-controld  [15962]
>> (peer_update_callback)     info: Cluster node FILE-4 is now member
>> > 63618:Jul 17 14:16:32.796 FILE-2 pacemaker-controld  [15962]
>> (crm_get_peer)     info: Node 3 is now known as FILE-3
>> > 63619:Jul 17 14:16:32.796 FILE-2 pacemaker-controld  [15962]
>> (update_peer_uname)        warning: Node names with capitals are
>> discouraged, consider changing 'FILE-3'
>> > 63620:Jul 17 14:16:32.796 FILE-2 pacemaker-controld  [15962]
>> (peer_update_callback)     info: Cluster node FILE-3 is now member
>> > 63621:Jul 17 14:16:32.796 FILE-2 pacemaker-controld  [15962]
>> (crm_get_peer)     info: Node 5 is now known as FILE-5
>> > 63622:Jul 17 14:16:32.796 FILE-2 pacemaker-controld  [15962]
>> (update_peer_uname)        warning: Node names with capitals are
>> discouraged, consider changing 'FILE-5'
>> > 63623:Jul 17 14:16:32.796 FILE-2 pacemaker-controld  [15962]
>> (peer_update_callback)     info: Cluster node FILE-5 is now member
>> > 63640:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (crm_get_peer)     info: Node 6 is now known as FILE-6
>> > 63641:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (update_peer_uname)        warning: Node names with capitals are
>> discouraged, consider changing 'FILE-6'
>> > 63642:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (peer_update_callback)     info: Cluster node FILE-6 is now member
>> > 63643:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (handle_request)   debug: Raising I_JOIN_OFFER: join-1
>> > 63644:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x00400200 (new_actions)
>> for controller set by s_crmd_fsa:198
>> > 63645:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (s_crmd_fsa)       debug: Processing I_JOIN_OFFER: [ state=S_PENDING
>> cause=C_HA_MESSAGE origin=route_message ]
>> > 63646:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000200 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63647:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00400000 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63648:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (update_dc)        info: Set DC to FILE-6 (3.11.0)
>> > 63649:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (pcmk__update_peer_expected)       info: update_dc: Node FILE-6[6] -
>> expected state is now member (was (null))
>> > 63650:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x00000200
>> (A_DC_TIMER_STOP) for controller set by do_cl_join_offer_respond:147
>> > 63651:Jul 17 14:16:32.880 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000200 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63788:Jul 17 14:16:32.884 FILE-2 pacemaker-controld  [15962]
>> (do_cib_replaced)  debug: Updating the CIB after a replace: DC=false
>> > 63811:Jul 17 14:16:32.892 FILE-2 pacemaker-controld  [15962]
>> (join_query_callback)      debug: Respond to join offer join-1 from FILE-6
>> > 63819:Jul 17 14:16:55.080 FILE-2 pacemaker-controld  [15962]
>> (pcmk__procfs_pid_of)      info: Found pacemaker-based active as process
>> 15957
>> > 63820:Jul 17 14:16:55.080 FILE-2 pacemaker-controld  [15962]
>> (throttle_cib_load)        debug: Init 6 + 2 ticks at 1689603415 (100 tps)
>> > 63821:Jul 17 14:16:55.080 FILE-2 pacemaker-controld  [15962]
>> (throttle_mode)    debug: Current load is 0.980000 across 10 core(s)
>> > 63822:Jul 17 14:16:55.080 FILE-2 pacemaker-controld  [15962]
>> (throttle_send_command)    info: New throttle mode: negligible load (was
>> undetermined)
>> > 63823:Jul 17 14:16:55.080 FILE-2 pacemaker-controld  [15962]
>> (throttle_update)  debug: Node FILE-2 has negligible load and supports at
>> most 20 jobs; new job limit 20
>> > 63824:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (handle_request)   debug: Raising I_JOIN_RESULT: join-1
>> > 63825:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x00800000 (new_actions)
>> for controller set by s_crmd_fsa:198
>> > 63826:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (s_crmd_fsa)       debug: Processing I_JOIN_RESULT: [ state=S_PENDING
>> cause=C_HA_MESSAGE origin=route_message ]
>> > 63827:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00800000 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63828:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (do_cl_join_finalize_respond)      debug: Confirming join-1: sending local
>> operation history to FILE-6
>> > 63829:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x1000000000000200
>> (new_actions) for controller set by s_crmd_fsa:198
>> > 63830:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (s_crmd_fsa)       debug: Processing I_NOT_DC: [ state=S_PENDING
>> cause=C_HA_MESSAGE origin=do_cl_join_finalize_respond ]
>> > 63831:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x1000000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> > 63832:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962] (do_log)
>>  info: Input I_NOT_DC received in state S_PENDING from
>> do_cl_join_finalize_respond
>> > 63833:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (do_state_transition)      notice: State transition S_PENDING -> S_NOT_DC |
>> input=I_NOT_DC cause=C_HA_MESSAGE origin=do_cl_join_finalize_respond
>> > 63834:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x00000020
>> (A_INTEGRATE_TIMER_STOP) for controller set by do_state_transition:559
>> > 63835:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (pcmk__set_flags_as)       debug: FSA action flags 0x00000080
>> (A_FINALIZE_TIMER_STOP) for controller set by do_state_transition:565
>> > 63836:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000200 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63837:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000020 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63838:Jul 17 14:16:55.092 FILE-2 pacemaker-controld  [15962]
>> (pcmk__clear_flags_as)     debug: FSA action flags 0x00000080 (an_action)
>> for controller cleared by do_fsa_action:108
>> > 63863:Jul 17 14:17:25.073 FILE-2 pacemaker-controld  [15962]
>> (throttle_cib_load)        debug: cib load: 0.000667 (2 ticks in 30s)
>> > 63864:Jul 17 14:17:25.073 FILE-2 pacemaker-controld  [15962]
>> (throttle_mode)    debug: Current load is 0.650000 across 10 core(s)
>> > 63865:Jul 17 14:17:55.073 FILE-2 pacemaker-controld  [15962]
>> (throttle_cib_load)        debug: cib load: 0.000333 (1 ticks in 30s)
>> > 63866:Jul 17 14:17:55.073 FILE-2 pacemaker-controld  [15962]
>> (throttle_mode)    debug: Current load is 0.850000 across 10 core(s)
>> > 63868:Jul 17 14:18:20.085 FILE-2 pacemaker-fenced    [15958]
>> (process_remote_stonith_exec)      debug: Finalizing action 'reboot'
>> targeting FILE-2 on behalf of pacemaker-controld.19415 at FILE-6: OK | rc=0
>> id=4e523b34
>> > 63869:Jul 17 14:18:20.085 FILE-2 pacemaker-fenced    [15958]
>> (remote_op_done)   notice: Operation 'reboot' targeting FILE-2 by FILE-4
>> for pacemaker-controld.19415 at FILE-6: OK | id=4e523b34
>> > 63872:Jul 17 14:18:20.089 FILE-2 pacemaker-controld  [15962]
>> (exec_alert_list)  info: Sending fencing alert via pf-ha-alert to (null)
>> > 63875:Jul 17 14:18:20.089 FILE-2 pacemaker-controld  [15962]
>> (tengine_stonith_notify)   crit: We were allegedly just fenced by FILE-4
>> for FILE-6!
>> > 63876:Jul 17 14:18:20.089 FILE-2 pacemaker-controld  [15962]
>> (crm_xml_cleanup)  info: Cleaning up memory from libxml2
>> > 63877:Jul 17 14:18:20.089 FILE-2 pacemaker-controld  [15962]
>> (crm_exit)         info: Exiting pacemaker-controld | with status 100
>> > 63900:Jul 17 14:18:20.093 FILE-2 pacemakerd          [15956]
>> (pcmk_child_exit)  warning: Shutting cluster down because
>> pacemaker-controld[15962] had fatal failure
>> > 63902:Jul 17 14:18:20.093 FILE-2 pacemakerd          [15956]
>> (pcmk_shutdown_worker)     debug: pacemaker-controld confirmed stopped
>> > 63956:Jul 17 14:18:20.101 FILE-2 pacemaker-fenced    [15958]
>> (process_remote_stonith_exec)      debug: Finalizing action 'reboot'
>> targeting FILE-1 on behalf of pacemaker-controld.19415 at FILE-6: OK | rc=0
>> id=446afc42
>> > 63957:Jul 17 14:18:20.101 FILE-2 pacemaker-fenced    [15958]
>> (remote_op_done)   notice: Operation 'reboot' targeting FILE-1 by FILE-5
>> for pacemaker-controld.19415 at FILE-6: OK | id=446afc42>
>> > Thanks
>> > Priyanka
>>
>> Hi, node FILE-6 requested that node FILE-2 be fenced by node FILE-4.
>> FILE-2's controller daemon received notification that it was being
>> fenced, and it shut down. You'd want to check the logs on FILE-6 to
>> determine why FILE-2 was fenced.
>>
>> >
>> > On Thu, Jul 20, 2023 at 12:07 AM Ken Gaillot <kgaillot at redhat.com>
>> wrote:
>> >>
>> >> On Wed, 2023-07-19 at 23:49 +0530, Priyanka Balotra wrote:
>> >> > Hi All,
>> >> > I am using SLES 15 SP4. One of the nodes of the cluster is brought
>> >> > down and boot up after sometime. Pacemaker service came up first but
>> >> > later it faced a fatal shutdown. Due to that crm service is down.
>> >> >
>> >> > The logs from /var/log/pacemaker.pacemaker.log are as follows:
>> >> >
>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd          [15956]
>> >> > (pcmk_child_exit)        warning: Shutting cluster down because
>> >> > pacemaker-controld[15962] had fatal failure
>> >>
>> >> The interesting messages will be before this. The ones with "pacemaker-
>> >> controld" will be the most relevant, at least initially.
>> >>
>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd          [15956]
>> >> > (pcmk_shutdown_worker)   notice: Shutting down Pacemaker
>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd          [15956]
>> >> > (pcmk_shutdown_worker)   debug: pacemaker-controld confirmed stopped
>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd          [15956] (stop_child)
>> >> >   notice: Stopping pacemaker-schedulerd | sent signal 15 to process
>> >> > 15961
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961]
>> >> > (crm_signal_dispatch)    notice: Caught 'Terminated' signal | 15
>> >> > (invoking handler)
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961]
>> >> > (qb_ipcs_us_withdraw)    info: withdrawing server sockets
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961]
>> >> > (qb_ipcs_unref)  debug: qb_ipcs_unref() - destroying
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961]
>> >> > (crm_xml_cleanup)        info: Cleaning up memory from libxml2
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961] (crm_exit)
>> >> >   info: Exiting pacemaker-schedulerd | with status 0
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-based     [15957]
>> >> > (qb_ipcs_event_sendv)    debug: new_event_notification (/dev/shm/qb-
>> >> > 15957-15962-12-RDPw6O/qb): Broken pipe (32)
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-based     [15957]
>> >> > (cib_notify_send_one)    warning: Could not notify client crmd:
>> >> > Broken pipe | id=e29d175e-7e91-4b6a-bffb-fabfdd7a33bf
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-based     [15957]
>> >> > (cib_process_request)    info: Completed cib_delete operation for
>> >> > section //node_state[@uname='FILE-2']/*: OK (rc=0, origin=FILE-
>> >> > 6/crmd/74, version=0.24.75)
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-fenced    [15958]
>> >> > (xml_patch_version_check)        debug: Can apply patch 0.24.75 to
>> >> > 0.24.74
>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd          [15956]
>> >> > (pcmk_child_exit)        info: pacemaker-schedulerd[15961] exited
>> >> > with status 0 (OK)
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-based     [15957]
>> >> > (cib_process_request)    info: Completed cib_modify operation for
>> >> > section status: OK (rc=0, origin=FILE-6/crmd/75, version=0.24.75)
>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd          [15956]
>> >> > (pcmk_shutdown_worker)   debug: pacemaker-schedulerd confirmed
>> >> > stopped
>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd          [15956] (stop_child)
>> >> >   notice: Stopping pacemaker-attrd | sent signal 15 to process 15960
>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-attrd     [15960]
>> >> > (crm_signal_dispatch)    notice: Caught 'Terminated' signal | 15
>> >> > (invoking handler)
>> >> >
>> >> > Could you please help me understand the issue here.
>> >> >
>> >> > Regards
>> >> > Priyanka
>> >> > _______________________________________________
>> >> > Manage your subscription:
>> >> > https://lists.clusterlabs.org/mailman/listinfo/users
>> >> >
>> >> > ClusterLabs home: https://www.clusterlabs.org/
>> >> --
>> >> Ken Gaillot <kgaillot at redhat.com>
>> >>
>> >> _______________________________________________
>> >> Manage your subscription:
>> >> https://lists.clusterlabs.org/mailman/listinfo/users
>> >>
>> >> ClusterLabs home: https://www.clusterlabs.org/
>> >
>> > _______________________________________________
>> > Manage your subscription:
>> > https://lists.clusterlabs.org/mailman/listinfo/users
>> >
>> > ClusterLabs home: https://www.clusterlabs.org/
>>
>>
>>
>> --
>> Regards,
>>
>> Reid Wahl (He/Him)
>> Senior Software Engineer, Red Hat
>> RHEL High Availability - Pacemaker
>>
>> _______________________________________________
>> Manage your subscription:
>> https://lists.clusterlabs.org/mailman/listinfo/users
>>
>> ClusterLabs home: https://www.clusterlabs.org/
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20230720/18295bb1/attachment-0001.htm>


More information about the Users mailing list