FAQ

From Cluster Labs

Jump to: navigation, search


Contents

Technical

How Do I Install Pacemaker?

Installation from source and from pre-built packages is described on the Install page.

What Messaging Layers are Supported?

Which Messaging Layer Should I Choose?

OpenAIS. The Heartbeat project has been effectively unmaintained for several years (with the exception of the OCF resource agents).

Can I Choose which Messaging Layer to use at Run Time?

Yes. The CRM will automatically detect who started it and behave accordingly.

Can I Have a Mixed Heartbeat-OpenAIS Cluster?

No.

What Do the Strange Validation Errors Mean?

Check out the Validation FAQ

How Do I Test My Cluster?

Pacemaker comes with a Cluster Test Suite (CTS for short) which is an integral part of our release testing. Traditionally this had been hard to set up and use however a new tool has been written to simplify the process.

It can be found at:

 http://hg.clusterlabs.org/pacemaker/stable-1.0/file/tip/cts/cluster_test

Please give it a try and send feedback via the mailing list.

Resource is Too Active

Pacemaker will try and determine what resources are active on a machine when it starts. To do this, it sends what we call a probe which uses the monitor operation of your ResourceAgent.

There are two common reasons for seeing this message:

  • Your resource really is active on more than one node
    • Check you are _not_ starting it on boot
    • Did Pacemaker suffer an internal failure? If so, please check the Help:Contents page and report it
  • Your resource doesn't implement the monitor operation correctly
    • Make sure your Resource Agent conforms to the OCF-spec by using the ocf-tester script

What Do the Prefixes in Commit Messages Mean?

  • High, Med, Low: These all indicate how much the end-user/admin should care about the change.
  • Dev: These are changes that fix bugs that don't exist in any released version of the project

Examples:

  • High - Preventing a segfault, implementing an important new feature or major changes to the behavior of a feature
  • Med - Hard to trigger bugs, bugs with workarounds, minor functional changes
  • Low - Non-functional changes, formatting or logging changes, changes to test code

OpenAIS/CoroSync Specific

Is OpenAIS Suitable for Production?

Red Hat certainly believes so... they include and support it in RHEL.

I Killed a Node but the Cluster Didn't Recover

One of the most common reasons for this is the way quorum is calculated for a 2-node cluster. Unlike Heartbeat, OpenAIS doesn't pretend 2-node clusters always have quorum.

In order to have quorum, more than half of the total number of cluster nodes need to be online. Clearly this is not the case when a node failure occurs in a 2-node cluster.

If you want to allow the remaining node to provide all the cluster services, you need to set the no-quorum-policy to ignore.

 crm configure property  no-quorum-policy=ignore

This provides the same behavior as Heartbeat, just be sure to set up STONITH to ensure data integrity.

Heartbeat Specific

How Do I Upgrade from Heartbeat?

If you plan to continue using the Heartbeat stack (as opposed to OpenAIS), simply stop Heartbeat on a node and follow the directions on the Install page.

There are no configuration changes to make and you can even do a rolling upgrade of the cluster since the over-the-wire formats are fully compatible (ie. didn't change at all ;-)

Pacemaker 0.6.x packages essentially contain (just) the 2.1.3 Heartbeat CRM with some additional bug-fixes and support for OpenAIS.

Will I be Able to do a Rolling Upgrade from Heartbeat?

Only if you still use the Heartbeat communications/membership layer.

GUI

How Do I Enable the GUI? (OpenAIS)

First you need to install the pacemaker-pygui package. Then you need to find the following lines in openais.conf

service {
	# Load the Pacemaker Cluster Resource Manager
	name: pacemaker
	ver:  0
}

and add

	use_mgmtd: 1

before the closing bracket.

How Do I Enable the GUI? (Heartbeat)

First you need to install the pacemaker-pygui package. Then you need to add the following lines to ha.cf

 apiauth	mgmtd	uid=root
 respawn	root	/usr/lib/heartbeat/mgmtd -v

These used to be implied when crm yes was present but only when heartbeat is built with the built-in mgmtd (which it no longer is).

NOTE: People on 64-bit platforms will probably need to replace lib with lib64

Administrivia

Why is the Project Called Pacemaker?

First of all, the reason its not called the CRM is because of the abundance of terms that are commonly abbreviated to those three letters.

The Pacemaker name came from Kham, a good friend of mine, and was originally used by a Java GUI that I was prototyping in early 2007. Alas other commitments have prevented the GUI from progressing much and, when it came time to choose a name for this project, Lars suggested it was an even better fit for an independent CRM.


The idea stems from the analogy between the role of this software and that of the little device that keeps the human heart pumping.

Pacemaker monitors the cluster and intervenes when necessary to ensure the smooth operation of the services it provides.


There were a number of other names (and acronyms) tossed around, but suffice to say Pacemaker was the best of the lot :-)

Why was the First Release 0.6.0?

Although the Pacemaker code had been around for over 4 years, and used in many production clusters, we wanted to wait until we achieved feature parity with both stacks before claiming 1.0 status.

There were also a number of things that we wanted to revisit before 1.0.0

  • Fix the configuration DTD
    • Stricter enforcement to prevent bad configurations
    • More intuitive attribute naming
    • Make common use-cases intuiiive
    • Remove 'development mistakes'
      • Increase consistency
      • Reduce duplication
  • Use libxml2 for XML handling
    • Add support for XPath in Pacemaker and any 3rd party apps
    • Can be done whilst maintaining over-the-wire compatibility
  • Simplify migration due to resource failure (failcount)
  • Rewrite pingd to be stack independant

We achieved this and more by the November 2008 deadline and the current stable release is 1.0.

Where Can I Get the Source Code?

 hg clone http://hg.clusterlabs.org/pacemaker/dev pacemaker

Where Can I Get Pre-built Packages?

Official packages for most major .rpm and .deb based distributions are available from:

 http://download.opensuse.org/repositories/server:/ha-clustering/

For more information, we have a description of the available packages.

What Versions of Pacemaker Are Supported?

Please refer to the Releases page for a list of versions supported directly by the project.

When seeking assistance, please try to ensure you have one of these versions.

Organizational

Are you Forking Heartbeat?

No. Only the CRM code (code that I am the sole author and maintainer of) is part of the new project (and no longer part of Heartbeat).

Why was the Pacemaker Project Created?

The decision was made to spin-off the CRM into its own project after the 2.1.3 Heartbeat release in order to

  • support both the OpenAIS and Heartbeat cluster stacks equally
  • decouple the release cycles of two projects at very different stages of their life-cycles
  • foster the clearer package boundaries, thus leading to
  • better and more stable interfaces

What About the GUI, TSA agent, CIM and SNMP?

All of these components require Pacemaker libraries to build and therefor cannot be included Heartbeat any longer.

These high-level management components have been bundled together as part of the (badly named) pyGUI project which is ready and waiting for someone out there to pick up and maintain.

Until someone volunteers, I'll apply patches and update the packages but I wont be doing any active development myself.

Links:

Why Can't I Create a Wiki Account?

At the moment I've not sorted out the site security so I've disabled the creation of new users. If you'd like an account, just email us at pacemaker@oss.clusterlabs.org and we can create one manually for you.

Personal tools
communication