Savaged by Softdog, a Cautionary Tale 6 minutes readHardware is imperfect, and software contains bugs. Don’t use software based watchdogs and expect to survive the latter.
A New Fencing Mechanism (TBD) 4 minutes readProtecting database centric applications in the absense of power fencing
Two Nodes - The Devil is in the Details 9 minutes readMany people love 2-node clusters because they seem conceptually simpler and 33% cheaper, but most will have subtle failure modes
Containerizing Databases with Kubernetes and Stateful Sets 5 minutes readAn examination of the canonical StatefulSet example for managing databases with Kubernetes from a rigorous HA perspective.
HA for Composible Deployments of OpenStack 5 minutes readComposable roles are a hot topic, I present a proposal for how to accommodate cluster-managed services.
Thoughts on HA for Multi-Subnet Deployments of OpenStack 3 minutes readInstalling OpenStack in a spine-and-leaf network presents problems for high availability, these are some thoughts on what do to about it.
Evolving the OpenStack HA Architecture 3 minutes readA future revision of the HA architecture should limit Pacemaker involvement to services like Galera, Rabbit and the few remaining OpenStack services that can...
Minimum Viable Cluster 9 minutes readThe is a spectrum for availability solutions, where you fit depends on what assumptions you can make about your application stack.
Receiving Reliable Notification of Cluster Events 3 minutes readA more reliable way to receive notification of cluster events is coming in Pacemaker 1.1.14.
Fencing for Fun and Profit with SBD 6 minutes readSBD can be particularly useful in environments where traditional fencing mechanisms are not possible.
Double Failure - Get out of Jail Free? Not so Fast less than 1 minute readIt’s tempting to explain away some recovery scenarios, but it’s not always appropriate.
Life at the Intersection of Pets and Cattle 5 minutes readThe theory goes that pets have no place in the server room. Everything should be stateless and replicated. If one copy dies, who cares, there are 100 more.
Adding Managed Compute Nodes to a Highly Available Openstack Control Plane 5 minutes readAs previously announced on RDO list and GitHub, we now have a way to allow Pacemaker to manage compute nodes within a single cluster while still allowing us ...
Feature Spotlight - Smart Resource Restart from the Command Line 1 minute readAnatomy of resource restart
Feature Spotlight - Controllable Resource Discovery 1 minutes readComing in 1.1.13 is a new option for location constraints: resource-discovery
Release Candidate: 1.1.12-rc1 6 minutes readAs promised, this announcement brings the first release candidate for Pacemaker 1.1.12
Potential for data corruption affecting Pacemaker 1.1.6 through 1.1.9 2 minutes readIt has come to my attention that the potential for data corruption exists in Pacemaker versions 1.1.6 to 1.1.9
Announcing 1.1.11 Beta Testing 1 minute readWith over 400 updates since the release of 1.1.10, its time to start thinking about a new release.
Pacemaker and RHEL 6.4 (redux) 1 minute readThe good news is that as of Novemeber 1st, Pacemaker is now supported on RHEL 6.4 - with two caveats.
Changes to the Remote Wire Protocol in 1.1.11 less than 1 minute readUnfortunately the current wire protocol used by pacemaker_remoted for exchanging messages was found to be suboptimal and we have taken the decision to change...
Release candidate: 1.1.10-rc7 1 minute readAnnouncing the seventh release candidate for Pacemaker 1.1.10
Release candidate: 1.1.10-rc6 1 minute readAnnouncing the sixth release candidate for Pacemaker 1.1.10
GPG Quickstart 5 minutes readIt seemed timely that I should refresh both my GPG knowledge and my keys. I am summarizing my method (and sources) below in the event that they may prove use...
Release candidate: 1.1.10-rc5 2 minutes readLets try this again… Announcing the fourth and a half release candidate for Pacemaker 1.1.10
Release candidate: 1.1.10-rc3 2 minutes readAnnouncing the third release candidate for Pacemaker 1.1.10
Pacemaker on RHEL6.4 3 minutes readOver the last couple of years, we have been evolving the stack in two ways of particular relevance to RHEL customers:
Release candidate: 1.1.10-rc2 1 minute readAnnouncing the second release candidate for Pacemaker 1.1.10
Mixing Pacemaker versions 2 minutes readWhen mixing Pacemaker versions, there are two factors that need to be considered. The first is obviously the package version - if that is the same, then the...
Now powered by Octopress less than 1 minute readWith Posterous being shut down (even though I wasn’t using it for TheClusterGuy), I’ve decided the time has come to take back control of my content.
Large Cluster Performance: Redux 3 minutes readNormally I have access to 4 virtual cluster nodes on my laptop, however for the first time since leaving SUSE, I have had the opportunity to test with 8 and ...
Pacemaker 1.0.13 now available 1 minutes readThanks once again to the efforts of the fine folks from NTT, the latest bug fixes have been back-ported from 1.1 and another instalment of the Pacemaker 1.0 ...
Pacemaker and Cluster Filesystems 2 minutes readThere is some confusion out there on how to use Pacemaker with the OCFS2 and GFS2 cluster filesystems.
Can Pacemaker 1.1.8 be used with… less than 1 minute readShort answer: yes Longer answer: seriously, yes :-)
Pacemaker 1.1.7 Now Available less than 1 minute readAfter much hard work, the latest installment of the Pacemaker 1.1 release series is now ready for general consumption.
Pacemaker 1.0.12 Released 1 minute readThanks once again to the efforts of Keisuke MORI from NTT, the latest bug fixes have been back-ported from 1.1 and another instalment of the Pacemaker 1.0 re...
New Version Control System less than 1 minute readSince September, Pacemaker has started using Git for the 1.1 and devel trees.
New Issue Tracker less than 1 minute readSince it’s clearly not acceptable for our issue tracker to be offline for months at a time, it is time to replace the Bugzilla instance hosted by the Linux F...
Pacemaker 1.0.11 Released 2 minutes readThe latest installment of the Pacemaker 1.0 release series is now ready for general consumption.
Pacemaker 1.1.5 Released less than 1 minute readThe latest installment of the Pacemaker 1.1 release series is now ready for general consumption.
Pacemaker Release Roundup 1 minute readIt may have seemed quiet since July, but things were actually so busy that I couldn’t find the time to publicize our new releases.
New Logo? less than 1 minute readOne unexpected outcome from the recent Linux Plumbers conference was the contribution of a new logo to the project by NTT.
Pacemaker, Heartbeat, Corosync, WTF? 1 minute readOne question I still get a lot is what all these projects are/do and how they all relate.
Large Cluster Performance 2 minutes readOver the last few days, I’ve spent a bunch of time improving Pacemaker’s performance in large clusters.
Introducing the Pacemaker Master Control Process for Corosync-based Clusters 2 minutes readThe latest addition to the Pacemaker 1.1 series is a master control process (MCP) and associated init script.
Pacemaker 1.0.9 Released 5 minutes readThe latest installment of the Pacemaker 1.0 stable series is now ready for general consumption.
Feature Spotlight: Utilization 2 minutes readNew in 1.1 is the ability for Pacemaker to factor the system resources (RAM, CPU, etc) into its placement algorithms.
Pacemaker ships as part of Ubuntu 10.4 - Lucid Lynx less than 1 minute readUbuntu LTS 10.04 now comes with full support for Pacemaker on Corosync and Heartbeat: http://fghaas.wordpress.com/2010/05/03/ubuntu-10-04-with-full-cluster-s...
Pacemaker in Debian less than 1 minute readGood news for Debian fans, Pacemaker has officially made it into Sid.
New Pacemaker Packages less than 1 minute readI’ve begun uploading 1.0.8-3 to the clusterlabs.org servers.
Website Updates less than 1 minute readThe http://www.clusterlabs.org server has been migrated and now features a new splash-page and a custom skin for the wiki.
Pacemaker 1.0.8 Released 4 minutes readThe latest installment of the Pacemaker 1.0 stable series is now ready for general consumption.
New Pacemaker Release Series 3 minutes readA number of new branches have been created in the last few days which are integral to how we plan to add new features in a controlled manner.
Pacemaker removed from OBS less than 1 minute readToday I removed Pacemaker from server:ha-clustering on the openSUSE build service.
Pacemaker 1.0.7 Released 4 minutes readThe latest installment of the Pacemaker 1.0 stable series is now ready for general consumption.
Ubuntu looking for Pacemaker testers less than 1 minute readUbuntu is looking to switch its supported cluster stack to Corosync+Pacemaker and has put out a “Call for testers”.
Pre-Announce: End of Pacemaker 0.6 support is near less than 1 minute readUnless there are violent objections, I plan to officially stop supporting 0.6 at the end of February.
New Documentation Formats less than 1 minute readI’m pleased to report that the core Pacemaker documentation is now available in PDF, HTML (chunked and single page) and even TXT formats.
Pacemaker 1.0.6 Released 4 minutes readThe next installment of the Pacemaker 1.0 stable series is now ready for general consumption.
Advisory: Don’t use Pacemaker on Corosync (yet) less than 1 minute readI spent some time looking into the state of the Pacemaker/Corosync integration today and I can only recommend Pacemaker users stay on the previous version of...
Version Control Prompt 1 minute readI find it convenient to include current SCM data before my regular Bash prompt (reduces the chance of “accidents”). Perhaps someone else will find it useful ...
Clusters From Scratch less than 1 minute readThe first of a new series of step-by-step guides for Pacemaker.
Another Documentation Update less than 1 minute readQuick FYI… I’ve made some more improvements to the Configuration Explained PDF http://clusterlabs.org/mediawiki/images/f/fb/Configuration_Explained.pdf
Dev Repository Recreated less than 1 minute readFor a variety of reasons, the Pacemaker dev repository has been recreated and its history pruned of non-pacemaker related changes.
Poll: Which Distro do you use for Pacemaker? less than 1 minute readPlease let us know by filling out the following poll: http://www.clusterlabs. org/wiki/UsagePoll
Pacemaker 1.0.5 Released 2 minutes readI’m back from vacation so it’s time for another Pacemaker bug-fix release. Testing went flawlessly and so without further ado, here it is…
Choose the Right Hardware 1 minute readRecently I was asked to help diagnose a cluster that was behaving incredibly badly.
Updated Documentation less than 1 minute readThe Configuration Explained PDF has been updated for 1.0.4/5 [http://clusterlabs.org/mediawiki/images/f/fb/Configuration_Explained.pdf]
Pacemaker 1.0.5: Testing In Progress 1 minute readA quick note to say that 1.0.5 testing officially started today. Release testing usually takes 1-2 weeks.
Pacemaker in Fedora 12 less than 1 minute readGood news for Fedora fans, we’ve successfully navigated the required red tape and Pacemaker will ship in Fedora 12.
Pacemaker 1.0.4 Released 2 minutes readIt took a little longer than expected, but the latest 1.0 maintenance release (1.0.4) is finally available.
Highly Available Data Corruption 4 minutes read Whenever there is doubt, there is no doubt - Robert De Niro, Ronin
A Brief, Incomplete, and Mostly Wrong History of Programming Languages less than 1 minute readA Brief, Incomplete, and Mostly Wrong History of Programming Languages
Why Wont the Cluster Start my Services? 2 minutes readIts a common question and a worthy topic for an extended article. Here’s the steps I usually follow when diagnosing such issues.
raison d’etre 1 minute readThis tumbl/blog/thingy exists because I’ve finally accepted that “If we build it, they will come” is a fallacy. The internet is a big place and if you don’t...
Is this thing on? less than 1 minute readNothing to see here yet. Just taking the software for a spin.