Hello all,
I am trying to create a corosync/pacemaker cluster using CentOS 6.0. However, I'm having a great deal of difficulty doing so.
Corosync has a valid configuration file and an authkey has been generated.
When I run /etc/init.d/corosync I see that only corosync is started.
From experience working with corosync/pacemaker before, I know that
this is not enough to have a functioning cluster. For some reason the base install (with or without updates) is not starting corosync dependencies.
I've even tried using corosync/pacemaker for the EPEL 6 repo, and still the init-script will not start corosync dependencies.
Expected: corosync /usr/lib64/heartbeat/stonithd /usr/lib64/heartbeat/cib /usr/lib64/heartbeat/lrmd /usr/lib64/heartbeat/attrd /usr/lib64/heartbeat/pengine
Observed: corosync
My install options are: %packages @base @core @ha @nfs-file-server @network-file-system-client @resilient-storage @server-platform @server-policy @storage-client-multipath @system-admin-tools pax oddjob sgpio pacemaker dlm-pcmk screen lsscsi -rgmanager %end
The logs from the server aren't terribly helpful either: Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2515 for process stonith-ng Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2516 for process cib Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2517 for process lrmd Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2518 for process attrd Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2519 for process pengine Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2520 for process crmd Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: Pacemaker Cluster Manager 1.1.2 Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync extended virtual synchrony service Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync configuration service Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync cluster closed process group service v1.01 Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync cluster config database access v1.01 Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync profile loading service Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync cluster quorum service v0.1 Nov 23 12:13:45 cheapo4 corosync[2509]: [MAIN ] Compatibility mode set to whitetank. Using V1 and V2 of the synchronization engine. Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process lrmd exited (pid=2517, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process lrmd no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000111302 (1118978) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process cib exited (pid=2516, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process cib no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000111202 (1118722) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process crmd exited (pid=2520, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process crmd no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000111002 (1118210) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process attrd exited (pid=2518, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process attrd no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000110002 (1114114) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process pengine exited (pid=2519, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process pengine no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000100002 (1048578) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process stonith-ng exited (pid=2515, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process stonith-ng no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000000002 (2)
Googling suggests this is the result of mis-matched versions of pacemaker/corosync; but this is the base install and the problem persists even when I install it from EPEL6 repos.
Is anyone else experiencing difficulties with corosync/pacemaker on CentOS6?
Thanks, -Hal
Hi,
Did you configure corosync ? Normally corosync starts pacemaker, which in turn starts the heartbeat deamons. But you have to configure the latter using for example a pcmk file with configuration in /etc/corosync/conf.d/ (from the top of my head). I normally use : http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratc...
Regards,
Michel
On Wed, 2011-11-23 at 12:16 -0500, Hal Martin wrote:
Hello all,
I am trying to create a corosync/pacemaker cluster using CentOS 6.0. However, I'm having a great deal of difficulty doing so.
Corosync has a valid configuration file and an authkey has been generated.
When I run /etc/init.d/corosync I see that only corosync is started.
From experience working with corosync/pacemaker before, I know that
this is not enough to have a functioning cluster. For some reason the base install (with or without updates) is not starting corosync dependencies.
I've even tried using corosync/pacemaker for the EPEL 6 repo, and still the init-script will not start corosync dependencies.
Expected: corosync /usr/lib64/heartbeat/stonithd /usr/lib64/heartbeat/cib /usr/lib64/heartbeat/lrmd /usr/lib64/heartbeat/attrd /usr/lib64/heartbeat/pengine
Observed: corosync
My install options are: %packages @base @core @ha @nfs-file-server @network-file-system-client @resilient-storage @server-platform @server-policy @storage-client-multipath @system-admin-tools pax oddjob sgpio pacemaker dlm-pcmk screen lsscsi -rgmanager %end
The logs from the server aren't terribly helpful either: Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2515 for process stonith-ng Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2516 for process cib Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2517 for process lrmd Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2518 for process attrd Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2519 for process pengine Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2520 for process crmd Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: Pacemaker Cluster Manager 1.1.2 Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync extended virtual synchrony service Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync configuration service Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync cluster closed process group service v1.01 Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync cluster config database access v1.01 Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync profile loading service Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync cluster quorum service v0.1 Nov 23 12:13:45 cheapo4 corosync[2509]: [MAIN ] Compatibility mode set to whitetank. Using V1 and V2 of the synchronization engine. Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process lrmd exited (pid=2517, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process lrmd no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000111302 (1118978) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process cib exited (pid=2516, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process cib no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000111202 (1118722) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process crmd exited (pid=2520, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process crmd no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000111002 (1118210) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process attrd exited (pid=2518, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process attrd no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000110002 (1114114) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process pengine exited (pid=2519, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process pengine no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000100002 (1048578) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process stonith-ng exited (pid=2515, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process stonith-ng no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000000002 (2)
Googling suggests this is the result of mis-matched versions of pacemaker/corosync; but this is the base install and the problem persists even when I install it from EPEL6 repos.
Is anyone else experiencing difficulties with corosync/pacemaker on CentOS6?
Thanks, -Hal _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
Am 23.11.2011 14:24 schrieb "Michel van Deventer" michel@van.deventer.cx:
Hi,
Did you configure corosync ? Normally corosync starts pacemaker, which in turn starts the heartbeat deamons. But you have to configure the latter using for example a pcmk file with configuration in /etc/corosync/conf.d/ (from the top of my head).
Yes. /etc/corosync/services.d/pcmk is configured as per that document; still not getting the expected start behaviour.
Corosync does start properly, just not via the init-script.
Thanks, -Hal
I normally use :
http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratc...
Regards, Michel
On Wed, 2011-11-23 at 12:16 -0500, Hal Martin wrote:
Hello all,
I am trying to create a corosync/pacemaker cluster using CentOS 6.0. However, I'm having a great deal of difficulty doing so.
Corosync has a valid configuration file and an authkey has been
generated.
When I run /etc/init.d/corosync I see that only corosync is started.
From experience working with corosync/pacemaker before, I know that
this is not enough to have a functioning cluster. For some reason the base install (with or without updates) is not starting corosync dependencies.
I've even tried using corosync/pacemaker for the EPEL 6 repo, and still the init-script will not start corosync dependencies.
Expected: corosync /usr/lib64/heartbeat/stonithd /usr/lib64/heartbeat/cib /usr/lib64/heartbeat/lrmd /usr/lib64/heartbeat/attrd /usr/lib64/heartbeat/pengine
Observed: corosync
My install options are: %packages @base @core @ha @nfs-file-server @network-file-system-client @resilient-storage @server-platform @server-policy @storage-client-multipath @system-admin-tools pax oddjob sgpio pacemaker dlm-pcmk screen lsscsi -rgmanager %end
The logs from the server aren't terribly helpful either: Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2515 for process stonith-ng Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2516 for process cib Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2517 for process lrmd Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2518 for process attrd Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2519 for process pengine Nov 23 12:13:45 cheapo4 corosync[2509]: [pcmk ] info: spawn_child: Forked child 2520 for process crmd Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: Pacemaker Cluster Manager 1.1.2 Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync extended virtual synchrony service Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync configuration service Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync cluster closed process group service v1.01 Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync cluster config database access v1.01 Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync profile loading service Nov 23 12:13:45 cheapo4 corosync[2509]: [SERV ] Service engine loaded: corosync cluster quorum service v0.1 Nov 23 12:13:45 cheapo4 corosync[2509]: [MAIN ] Compatibility mode set to whitetank. Using V1 and V2 of the synchronization engine. Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process lrmd exited (pid=2517, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process lrmd no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000111302 (1118978) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process cib exited (pid=2516, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process cib no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000111202 (1118722) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process crmd exited (pid=2520, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process crmd no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000111002 (1118210) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process attrd exited (pid=2518, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process attrd no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000110002 (1114114) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process pengine exited (pid=2519, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process pengine no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000100002 (1048578) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] ERROR: pcmk_wait_dispatch: Child process stonith-ng exited (pid=2515, rc=100) Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] notice: pcmk_wait_dispatch: Child process stonith-ng no longer wishes to be respawned Nov 23 12:13:46 cheapo4 corosync[2509]: [pcmk ] info: update_member: Node cheapo4.jrz.cbn now has process list: 00000000000000000000000000000002 (2)
Googling suggests this is the result of mis-matched versions of pacemaker/corosync; but this is the base install and the problem persists even when I install it from EPEL6 repos.
Is anyone else experiencing difficulties with corosync/pacemaker on
CentOS6?
Thanks, -Hal _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos