Linux HA安装维护手册

Posted 长城之上是千亿的星空,星空之上是不灭的守望。

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了Linux HA安装维护手册相关的知识,希望对你有一定的参考价值。

 

 

1概述

顶新苏州IDC机房配置了四个Linux HA+ Oracle数据库集群。其配置信息如下:

1.1店会系统&FC-arap系统集群

 

集群资源组名称

ac_group

fc_group

集群资源文件挂载点

/ac-oradata

/fc-oradata

集群服务IP地址

fmpracora-vip/ 10.0.101.11

fmprfcora-vip/ 10.0.101.12

集群Listener

cvs_lsn

arap2_lsn

集群数据库

cvsdb

arap2db

优先运行的节点名称

fmpracora-priv

fmprfcora-priv

优先运行的节点IP

10.0.101.9

10.0.101.10

集群fence通讯IP

10.0.202.15

10.0.202.19

 

1.2订购-营收&主档-预算检讨-基盘集群

 

集群资源组名称

ma_group

po_group

集群资源文件挂载点

/ma-oradata

/po-oradata

集群服务IP地址

fmprmaora-vip/ 10.0.101.7

fmprpoora-vip/ 10.0.101.8

集群Listener

arpdb_lsn

yingshou_lsn

集群数据库

master_db, arpdb_db

dingoudb, yingshou_db

优先运行的节点名称

fmprmaora-priv

fmprpoora-priv

优先运行的节点IP

10.0.101.5

10.0.101.6

集群fence通讯IP

10.0.202.16

10.0.202.20

 

1.3IDC-SCM系统集群

 

集群资源组名称

idc_group

集群资源文件挂载点

/idc-oradata

集群服务IP地址

fmpridcora-vip/ 10.0.101.4

集群Listener

arpdb_lsn

集群数据库

fmidcdb

优先运行的节点名称

fmpridcora-priv

优先运行的节点IP

10.0.101.4

集群fence通讯IP

10.0.202.17

集群另外一个节点是10.0.101.3。

1.4JDE系统&JDE测试系统集群

 

集群资源组名称

pr_group

dr_group

集群资源文件挂载点

/jdepr-oradata

/jdedr-oradata

集群服务IP地址

fmprjdeora-vip/ 10.0.101. 15

fmdrjdeora-vip/ 10.0.101. 16

集群Listener

jde_lsn

jdedbdr_lsn

集群数据库

jde_db

jdedr_db

优先运行的节点名称

fmprjdeora-priv

fmdrjdeora-priv

优先运行的节点IP

10.0.101.13

10.0.101.14

集群fence通讯IP

10.0.202.18

10.0.202.22

 

 

2安装配置手册

RHEL 6.8 HA和RHEL 7.2 HA的安装过程不相同,但是集群的配置过程相同。我们将软件安装过程分开,配置过程放在一起描述。

2.1       . RHEL 6.8 HA软件安装

将RHEL 6.8 ISO文件挂载到/software/rh68目录。编辑/etc/yum.conf文件。

2.1.1  编辑/etc/yum.conf

[local]

 

name=local

baseurl=file:///software/rh68

enabled=1

gpgcheck=1

gpgkey=file:///software/rh68/RPM-GPG-KEY-redhat-release

 

[HighAvailability]

name=HighAvailability

baseurl=file:///software/rh68/HighAvailability

enabled=1

gpgcheck=0

2.1.2  安装软件

# yum install pcs pacemaker cman fence-agents

从RHEL 6.5以后,RHEL的HA通过pcs来管理,不再通过rgmanager管理。只有RHEL 6.8才有Oracle的Agent。

2.1.3  禁用和停止主机防火墙服务

禁用防火墙

#chkconfig iptables off

#chkconfig ip6tables off

停止防火墙服务

#service iptables stop

#service ip6tables off

 

For TCP: Ports 2224, 3121, 21064

For UDP: Ports, 5405

 

2.1.4  启动和配置服务pcsd

# chkconfig  pcsd on

# service pcsd start

 

2.1.5  重置hacluster密码

hacluster是pcs 管理员账号,建议集群中的服务器设置相同的密码。

# passwd hacluster (hacluster1234)

更改用户 hacluster 的密码 。

新的 密码:

无效的密码: 密码包含用户名在某些地方

重新输入新的 密码:

passwd:所有的身份验证令牌已经成功更新。

2.2RHEL 7.2 HA软件安装

2.2.1  /etc/yum.conf

[HighAvailability]

name=HighAvailability

baseurl=file:///software/rh72/addons/HighAvailability

enabled=1

gpgcheck=0

 

[local]

name=local

baseurl=file:///software/rh72/

enabled=1

gpgcheck=1

gpgkey=file:///software/rh72/RPM-GPG-KEY-redhat-release

2.2.2  安装软件

# yum install pcs fence-agents-all

 

只有RHEL 7.2才有Oracle的HA Agent, RHEL 7.0和7.1都没有Oracle的HA Agent。在RHEL 7.2,将不再使用cman服务.

2.2.3  启动和配置服务pcsd

[root@fmprfcdb ~]# systemctl start pcsd.service

[root@fmprfcdb ~]# systemctl enable pcsd.service

ln -s ‘/usr/lib/systemd/system/pcsd.service‘ ‘/etc/systemd/system/multi-user.target.wants/pcsd.service‘

 

检查pcsd服务能随机自动重新启动

[root@fmpracora dinxin]# systemctl list-unit-files |grep pcs

pcsd.service                                enabled

rpcsvcgssd.service                          static

2.2.4  禁用和停止主机防火墙服务

#systemctl stop firewalld

2.2.5  重置hacluster密码

hacluster是pcs 管理员账号,建议集群中的服务器设置相同的密码。

# passwd hacluster (hacluster1234)

更改用户 hacluster 的密码 。

新的 密码:

无效的密码: 密码包含用户名在某些地方

重新输入新的 密码:

passwd:所有的身份验证令牌已经成功更新。

 

2.3配置集群

RHEL 6.8和RHEL 7.2配置集群的步骤相同。将在以下介绍。

2.3.1  编辑/etc/hosts

[root@fmprpoora oracle]#  cat /etc/hosts

127.0.0.1localhost localhost.localdomain localhost4 localhost4.localdomain4

::1localhost localhost.localdomain localhost6 localhost6.localdomain6

 

10.0.101.6 fmprmaora

10.0.101.5 fmprpoora

10.0.200.16 fmprpoora-priv

10.0.200.20 fmprmaora-priv

10.0.101.7fmprpoora-vip

10.0.101.8fmprmaora-vip

 

其中fmprpoora-priv,fmprmaora-priv为心跳IP地址对应的主机名,将用作集群间的通讯。

2.3.2  授权hacluster 访问集群中的节点

# pcs cluster auth fmprfcdb-priv fmpracdb-priv

Username: hacluster

Password:

fmprfcdb-priv: Authorized

fmpracdb-priv: Authorized

授权的令牌保存在以下目录中。

[root@fmprpoora oracle]# cat /var/lib/pcsd/tokens

{

  "format_version": 2,

  "data_version": 3,

  "tokens": {

    "fmprmaora-priv": "a2a19af3-584e-4f88-873b-c3bee27d137c",

    "fmprpoora-priv": "33169580-5ba5-4cb3-9a1c-a4a87fd3d226"

  }

2.3.3  创建cluster

 [root@fmprfcora pcsd]# pcs cluster setup --start --name fmprfcac_cluster  fmprfcora-priv fmpracora-priv

Shutting down pacemaker/corosync services...

Redirecting to /bin/systemctl stop  pacemaker.service

Redirecting to /bin/systemctl stop  corosync.service

Killing any remaining services...

Removing all cluster configuration files...

fmprfcora-priv: Succeeded

fmpracora-priv: Succeeded

Starting cluster on nodes: fmprfcora-priv, fmpracora-priv...

fmpracora-priv: Starting Cluster...

fmprfcora-priv: Starting Cluster...

Synchronizing pcsd certificates on nodes fmprfcora-priv, fmpracora-priv...

fmprfcora-priv: Success

fmpracora-priv: Success

 

Restaring pcsd on the nodes in order to reload the certificates...

fmprfcora-priv: Success

fmpracora-priv: Success

[root@fmprfcora pcsd]#

2.3.4  配置集群服务随主机启动而自动启动

[root@fmprfcora pcsd]#  pcs cluster enable --all

fmprfcora-priv: Cluster Enabled

fmpracora-priv: Cluster Enabled

2.3.5  检查cluster服务状态

[root@fmpracdb pcsd]# pcs cluster status          

Cluster Status:

 Last updated: Sun Aug  7 11:07:59 2016         Last change: Sun Aug  7 11:06:44 2016 by hacluster via crmd on fmpracora-priv

 Stack: corosync

 Current DC: fmpracora-priv (version 1.1.13-10.el7-44eb2dd) - partition with quorum

 2 nodes and 0 resources configured

 Online: [ fmpracora-priv fmprfcora-priv ]

 

PCSD Status:

fmprfcora-priv: Online

fmpracora-priv: Online

2.3.6  配置集群的fence设备

#pcs stonith create imm-fmprjdeora fence_ipmilan action=reboot auth=password ipaddr=10.0.202.18 lanplus=1 login=USERID passwd=PASSW0RD delay=120cipher=1 pcmk_host_list=fmprjdeora-priv pcmk_host_check=static-list power_timeout=60

 

#pcs stonith create imm-fmdrjdeora fence_ipmilan action=reboot auth=password ipaddr=10.0.202.22 lanplus=1  login=USERID passwd=PASSW0RD  cipher=1 pcmk_host_list=fmdrjdeora-priv pcmk_host_check=static-list power_timeout=60

 

查看当前fence设备配置的命令如下:

[root@fmprpoora oracle]# pcs stonith show --full

 Resource: imm-fmprmaora (class=stonith type=fence_ipmilan)

Attributes: action=reboot auth=password ipaddr=10.0.202.20 lanplus=1 login=USERID passwd=PASSW0RD cipher=1 pcmk_host_list=fmprmaora-priv pcmk_host_check=static-list power_timeout=60

Operations: monitor interval=60s (imm-fmprmaora-monitor-interval-60s)

 Resource: imm-fmprpoora (class=stonith type=fence_ipmilan)

Attributes: action=reboot auth=password ipaddr=10.0.202.16 lanplus=1 login=USERID passwd=PASSW0RD delay=5 cipher=1 pcmk_host_list=fmprpoora-priv pcmk_host_check=static-list power_timeout=60 debug=/tmp/fmprpoora.log

Operations: monitor interval=60s (imm-fmprpoora-monitor-interval-60s)

[root@fmprpoora oracle]#

2.3.7  测试fence

检查能否使用fence_ipmilan命令查看fence设备的状态

[root@fmpridcora oracle]# fence_ipmilan -a 10.0.202.22 -P -l USERID -p PASSW0RD -o status -v

Executing: /usr/bin/ipmitool -I lanplus -H 10.0.202.22 -U USERID -P PASSW0RD -p 623 -L ADMINISTRATOR chassis power status

 

0 Chassis Power is on

 

Status: ON

测试能否使用命令行将节点fence,fence之后的设备将自动重新启动

root@fmprpoora ~]# pcs stonith fence fmprmaora-priv

Node: fmprmaora-priv fenced

 [root@fmprmaora ~]#  pcs stonith fence fmprpoora-priv

Node: fmprpoora-priv fenced

 

2.3.8  创建浮动IP

创建浮动IP地址资源

[root@fmprfcora pcsd]# pcs resource create fmprfcora-vip ocf:heartbeat:IPaddr2 ip=10.0.101.12 cidr_netmask=24 op monitor interval=30s

[root@fmprfcora pcsd]# pcs resource create fmpracora-vip ocf:heartbeat:IPaddr2 ip=10.0.101.11 cidr_netmask=24 op monitor interval=30s  

 

[root@fmprfcora pcsd]#   pcs resource show  --full

 Resource: fmprfcora-vip (class=ocf provider=heartbeat type=IPaddr2)

Attributes: ip=10.0.101.12 cidr_netmask=24

Operations: start interval=0s timeout=20s (fmprfcora-vip-start-interval-0s)

              stop interval=0s timeout=20s (fmprfcora-vip-stop-interval-0s)

              monitor interval=30s (fmprfcora-vip-monitor-interval-30s)

 Resource: fmpracora-vip (class=ocf provider=heartbeat type=IPaddr2)

Attributes: ip=10.0.101.11 cidr_netmask=24

Operations: start interval=0s timeout=20s (fmpracora-vip-start-interval-0s)

              stop interval=0s timeout=20s (fmpracora-vip-stop-interval-0s)

              monitor interval=30s (fmpracora-vip-monitor-interval-30s)

 

2.3.9  创建文件系统资源

文件系统和挂载点需要已经创建好,并在集群中的各节点依次挂载完成。接下来在集群的文件系统资源。

 

[root@fmpracdb pcsd]# pcs resource create ac_fs Filesystem

>device="/dev/mapper/acvg-aclv" directory="/ac-oradata" fstype="ext4"

[root@fmpracdb pcsd]#    pcs resource create fc_fs Filesystem

>device="/dev/mapper/fcvg-fclv" directory="/fc-oradata" fstype="ext4"

显示文件系统资源的配置

[root@fmpracora dinxin]# pcs resource show ac_fs

 Resource: ac_fs (class=ocf provider=heartbeat type=Filesystem)

Attributes: device=/dev/mapper/acvg-aclv directory=/ac-oradata fstype=ext4

Operations: start interval=0s timeout=60 (ac_fs-start-interval-0s)

              stop interval=0s timeout=60 (ac_fs-stop-interval-0s)

              monitor interval=20 timeout=40 (ac_fs-monitor-interval-20)

[root@fmpracora dinxin]# pcs resource show fc_fs

 Resource: fc_fs (class=ocf provider=heartbeat type=Filesystem)

Attributes: device=/dev/mapper/fcvg-fclv directory=/fc-oradata fstype=ext4

Operations: start interval=0s timeout=60 (fc_fs-start-interval-0s)

              stop interval=0s timeout=60 (fc_fs-stop-interval-0s)

              monitor interval=20 timeout=40 (fc_fs-monitor-interval-20)

 

2.3.10 创建数据库lsnr和db资源

确认当前pcs版本支持Oracle数据库。以下表示支持oracle数据库

[root@fmpracora dinxin]# pcs resource list  |grep ora

ocf:heartbeat:oracle - Manages an Oracle Database instance

ocf:heartbeat:oralsnr - Manages an Oracle TNS listener

 

确认oracle数据库和listener已经创建好,并且手工能够在集群中的节点正常启动到open状态。

确保能解析数据库sid,

[oracle@fmpracora admin]$ tnsping cvs

 

TNS Ping Utility for Linux: Version 11.2.0.4.0 - Production on 23-AUG-2016 13:51:06

 

Copyright (c) 1997, 2013, Oracle.  All rights reserved.

 

Used parameter files:

 

 

Used TNSNAMES adapter to resolve the alias

Attempting to contact (DESCRIPTION = (ADDRESS = (PROTOCOL = TCP)(HOST = fmpracora-vip)(PORT = 1526)) (CONNECT_DATA = (SERVER = DEDICATED) (SERVICE_NAME = cvs)))

OK (0 msec)

[oracle@fmpracora admin]$ tnsping arap2

 

TNS Ping Utility for Linux: Version 11.2.0.4.0 - Production on 23-AUG-2016 13:51:10

 

Copyright (c) 1997, 2013, Oracle.  All rights reserved.

 

Used parameter files:

 

 

Used TNSNAMES adapter to resolve the alias

Attempting to contact (DESCRIPTION = (ADDRESS = (PROTOCOL = TCP)(HOST = fmprfcora-vip)(PORT = 1521)) (CONNECT_DATA = (SERVER = DEDICATED) (SERVICE_NAME = arap2)))

OK (0 msec)

 

当前的oracle listener.ora和tnsnames.ora文件如下:

 

CVS =

(DESCRIPTION =

(ADDRESS = (PROTOCOL = TCP)(HOST = fmpracora-vip)(PORT = 1526))

(CONNECT_DATA =

(SERVER = DEDICATED)

(SERVICE_NAME = cvs)

    )

  )

 

ARAP2 =

(DESCRIPTION =

(ADDRESS = (PROTOCOL = TCP)(HOST = fmprfcora-vip)(PORT = 1521))

(CONNECT_DATA =

(SERVER = DEDICATED)

(SERVICE_NAME = arap2)

    )

  )

 

[oracle@fmpracora admin]$ cat listener.ora

LSNR_arap2=

(ADDRESS_LIST =

(ADDRESS = (PROTOCOL = TCP)(HOST = fmprfcora-vip)(PORT = 1521))

  )

SECURE_REGISTER_LSNR_arap2 = (TCP)

 

LSNR_cvs=

(ADDRESS_LIST =

(ADDRESS = (PROTOCOL = TCP)(HOST = fmpracora-vip)(PORT = 1526))

  )

SECURE_REGISTER_LSNR_cvs = (TCP)

 

使用以下命令创建oracle数据库资源和listnener资源

[root@fmpracdb pcsd]# pcs resource create  arap2db oracle sid=arap2 home=/home/app/11.2/db_1 user=oracle shutdown_method=immediate    

[root@fmpracdb pcsd]# pcs resource create  arap2_lsn oralsnr sid=arap2  home=/home/app/11.2/db_1 user=oracle listener=LSNR_arap2  

[root@fmpracdb pcsd]# pcs resource create  cvsdb oracle sid=cvs home=/home/app/11.2/db_1 user=oracle shutdown_method=immediate    

[root@fmpracdb pcsd]# pcs resource create  cvs_lsn oralsnr sid=cvs  home=/home/app/11.2/db_1 user=oracle listener=LSNR_cvs2

 

使用以下命令查看当前数据库资源和listener资源配置

[root@fmpracora dinxin]# pcs resource show cvs_lsn

 Resource: cvs_lsn (class=ocf provider=heartbeat type=oralsnr)

Attributes: sid=cvs home=/home/app/11.2/db_1 user=oracle listener=LSNR_cvs

Operations: start interval=0s timeout=120 (cvs_lsn-start-interval-0s)

              stop interval=0s timeout=120 (cvs_lsn-stop-interval-0s)

              monitor interval=10 timeout=30 (cvs_lsn-monitor-interval-10)

[root@fmpracora dinxin]# pcs resource show cvsdb

 Resource: cvsdb (class=ocf provider=heartbeat type=oracle)

Attributes: sid=cvs home=/home/app/11.2/db_1 user=oracle shutdown_method=immediate

Operations: start interval=0s timeout=120 (cvsdb-start-interval-0s)

              stop interval=0s timeout=120 (cvsdb-stop-interval-0s)

              monitor interval=120 timeout=30 (cvsdb-monitor-interval-120)

[root@fmpracora dinxin]# pcs resource show arap2db

 Resource: arap2db (class=ocf provider=heartbeat type=oracle)

Attributes: sid=arap2 home=/home/app/11.2/db_1 user=oracle shutdown_method=immediate

Operations: start interval=0s timeout=120 (arap2db-start-interval-0s)

              stop interval=0s timeout=120 (arap2db-stop-interval-0s)

              monitor interval=120 timeout=30 (arap2db-monitor-interval-120)

[root@fmpracora dinxin]# pcs resource show arap2_lsn

 Resource: arap2_lsn (class=ocf provider=heartbeat type=oralsnr)

Attributes: sid=arap2 home=/home/app/11.2/db_1 user=oracle listener=LSNR_arap2

Operations: start interval=0s timeout=120 (arap2_lsn-start-interval-0s)

              stop interval=0s timeout=120 (arap2_lsn-stop-interval-0s)

              monitor interval=10 timeout=30 (arap2_lsn-monitor-interval-10)

[root@fmpracora dinxin]#

2.3.11 创建集群组

创建集群组,在组中添加浮动IP地址资源,文件系统资源,oracle listener资源,以及Oracle数据库资源,并设置资源间的依赖关系。组设置成功后,将作为一个整体进行节点间的切换。

# pcs resource group add  ac_group fmpracora-vip

#pcs resource group add  ac_group ac_fs --after fmpracora-vip

#pcs resource group add  ac_group cvs_lsn --after ac_fs

#pcs resource group add  ac_group cvsdb --after cvs_lsn

 

2.3.12 创建集群资源优先启动节点

集群节点将在优先节点启动。当优先节点因故障关机后,自动切换到其它节点。在优先节点开机重新启动后,将跟随cman,pcs服务启动而自动回切到优先节点。

#pcs constraint location fc_fs   prefers fmprfcora-priv

#pcs constraint location fmprfcora-vip prefers fmprfcora-priv

#pcs constraint location  cvs_lsn prefers fmprfcora-priv

#pcs constraint location  cvsdb prefers fmprfcora-priv

#pcs constraint location  fc_group prefers fmprfcora-priv

 

#pcs constraint location ac_fs   prefers fmpracora-priv

#pcs constraint location fmpracora-vip prefers fmpracora-priv

#pcs constraint location  apar2_lsn  prefers fmpracora-priv

#pcs constraint location  apar2db prefers fmpracora-priv

#pcs constraint location  ac_group prefers fmpracora-priv

 

查看优先节点:

[root@fmpracora dinxin]# pcs constraint

Location Constraints:

  Resource: ac_group

    Enabled on: fmpracora-priv (score:INFINITY)

  Resource: fc_group

    Enabled on: fmprfcora-priv (score:INFINITY)

  Resource: fmpracora-vip

    Enabled on: fmpracora-priv (score:INFINITY)

  Resource: fmprfcora-vip

    Enabled on: fmprfcora-priv (score:INFINITY)

 

 

3RHEL 6.8 & 7.2 HA维护手册

3.1移动资源组到其它节点

[root@fmdrjdeora ~]#  pcs resource move dr_group

Warning: Creating location constraint cli-ban-dr_group-on-fmdrjdeora-priv with a score of -INFINITY for resource dr_group on node fmdrjdeora-priv.

This will prevent dr_group from running on fmdrjdeora-priv until the constraint is removed. This will be the case even if fmdrjdeora-priv is the last node in the cluster.

服务迁移到其它节点。

[root@fmdrjdeora ~]# pcs resource

 Resource Group: dr_group

fmdrjdeora-vip(ocf::heartbeat:IPaddr2):Started fmprjdeora-priv

dr_fs(ocf::heartbeat:Filesystem):Started fmprjdeora-priv

jdedbdr_lsn(ocf::heartbeat:oralsnr):Started fmprjdeora-priv

jdedr_db(ocf::heartbeat:oracle):Started fmprjdeora-priv

 Resource Group: pr_group

pr_fs(ocf::heartbeat:Filesystem):Started fmprjdeora-priv

fmprjdeora-vip(ocf::heartbeat:IPaddr2):Started fmprjdeora-priv

jdedb(ocf::heartbeat:oracle):Started fmprjdeora-priv

jde_lsn(ocf::heartbeat:oralsnr):Started fmprjdeora-priv

且生成一个在节点fmdrjdeora-priv禁用

[root@fmdrjdeora oracle]# pcs constraint

Location Constraints:

Resource: dr_fs

Enabled on: fmdrjdeora-priv (score:INFINITY)

Resource: dr_group

Enabled on: fmdrjdeora-priv (score:300)

Enabled on: fmprjdeora-priv (score:100)

Disabled on: fmdrjdeora-priv (score:-INFINITY) (role: Started)

 

3.2回切该资源回到原优先节点

需要清除限制,否则该资源不会fmdrjdeora-priv运行。

 

[root@fmprjdeora ~]#  pcs resource clear dr_group

 

清除资源后,constraint恢复正常。

[root@fmprjdeora ~]# pcs constraint

Location Constraints:

Resource: dr_fs

Enabled on: fmdrjdeora-priv (score:INFINITY)

Resource: dr_group

Enabled on: fmdrjdeora-priv (score:300)

Enabled on: fmprjdeora-priv (score:100)

Resource: fmdrjdeora-vip

Enabled on: fmdrjdeora-priv (score:300)

Resource: fmprjdeora-vip

Enabled on: fmprjdeora-priv (score:INFINITY)

Resource: jde_lsn

Enabled on: fmprjdeora-priv (score:INFINITY)

Resource: jdedb

Enabled on: fmprjdeora-priv (score:INFINITY)

Resource: pr_fs

Enabled on: fmprjdeora-priv (score:INFINITY)

Resource: pr_group

Enabled on: fmprjdeora-priv (score:INFINITY)

Ordering Constraints:

Resource Sets:

set fmprjdeora-vip pr_fs jde_lsn jdedb

set fmdrjdeora-vip dr_fs

Colocation Constraints:

 

资源就会自动回切到原始服务器。

[root@fmdrjdeora ~]# pcs resource clear dr_group

[root@fmdrjdeora ~]# pcs resource

 Resource Group: dr_group

fmdrjdeora-vip(ocf::heartbeat:IPaddr2):Started fmdrjdeora-priv

dr_fs(ocf::heartbeat:Filesystem):Stopped

jdedbdr_lsn(ocf::heartbeat:oralsnr):Stopped

jdedr_db(ocf::heartbeat:oracle):        Stopped

 Resource Group: pr_group

pr_fs(ocf::heartbeat:Filesystem):Started fmprjdeora-priv

fmprjdeora-vip(ocf::heartbeat:IPaddr2):Started fmprjdeora-priv

jdedb(ocf::heartbeat:oracle):Started fmprjdeora-priv

jde_lsn(ocf::heartbeat:oralsnr):Started fmprjdeora-priv

[root@fmdrjdeora ~]# pcs resource

 Resource Group: dr_group

fmdrjdeora-vip(ocf::heartbeat:IPaddr2):Started fmdrjdeora-priv

dr_fs(ocf::heartbeat:Filesystem):Started fmdrjdeora-priv

jdedbdr_lsn(ocf::heartbeat:oralsnr):Started fmdrjdeora-priv

jdedr_db(ocf::heartbeat:oracle):Started fmdrjdeora-priv

 Resource Group: pr_group

pr_fs(ocf::heartbeat:Filesystem):    Started fmprjdeora-priv

fmprjdeora-vip(ocf::heartbeat:IPaddr2):Started fmprjdeora-priv

jdedb(ocf::heartbeat:oracle):Started fmprjdeora-priv

jde_lsn(ocf::heartbeat:oralsnr):Started fmprjdeora-priv

3.3查看资源运行状态

[root@fmpracora dinxin]# pcs resource

 Resource Group: ac_group

fmpracora-vip(ocf::heartbeat:IPaddr2):Started fmpracora-priv

ac_fs(ocf::heartbeat:Filesystem):Started fmpracora-priv

cvs_lsn(ocf::heartbeat:oralsnr):       Started fmpracora-priv

cvsdb(ocf::heartbeat:oracle):Started fmpracora-priv

 Resource Group: fc_group

fmprfcora-vip(ocf::heartbeat:IPaddr2):Started fmprfcora-priv

fc_fs(ocf::heartbeat:Filesystem):Started fmprfcora-priv

arap2_lsn(ocf::heartbeat:oralsnr):Started fmprfcora-priv

arap2db(ocf::heartbeat:oracle):Started fmprfcora-priv

[root@fmpracora dinxin]#

3.4查看资源运行状态

[root@fmpracora dinxin]# pcs status

Cluster name: fmprfcac_cluster

Last updated: Tue Aug 23 15:29:07 2016          Last change: Tue Aug 23 15:24:26 2016 by root via cibadmin on fmpracora-priv

Stack: corosync

Current DC: fmpracora-priv (version 1.1.13-10.el7-44eb2dd) - partition with quorum

2 nodes and 10 resources configured

 

Online: [ fmpracora-priv fmprfcora-priv ]

 

Full list of resources:

 

 imm-fmprfcora(stonith:fence_ipmilan):Started fmpracora-priv

 imm-fmpracora(stonith:fence_ipmilan):Started fmprfcora-priv

 Resource Group: ac_group

fmpracora-vip(ocf::heartbeat:IPaddr2):Started fmpracora-priv

ac_fs(ocf::heartbeat:Filesystem):Started fmpracora-priv

cvs_lsn(ocf::heartbeat:oralsnr):Started fmpracora-priv

cvsdb(ocf::heartbeat:oracle):        Started fmpracora-priv

 Resource Group: fc_group

fmprfcora-vip(ocf::heartbeat:IPaddr2):Started fmprfcora-priv

fc_fs(ocf::heartbeat:Filesystem):Started fmprfcora-priv

arap2_lsn(ocf::heartbeat:oralsnr):Started fmprfcora-priv

arap2db(ocf::heartbeat:oracle):Started fmprfcora-priv

 

Failed Actions:

* cvsdb_monitor_120000 on fmpracora-priv ‘not running‘ (7): call=245, status=complete, exitreason=‘none‘,

last-rc-change=‘Mon Aug 22 14:05:39 2016‘, queued=0ms, exec=0ms

* cvs_lsn_monitor_10000 on fmpracora-priv ‘not running‘ (7): call=243, status=complete, exitreason=‘none‘,

last-rc-change=‘Tue Aug 23 10:45:13 2016‘, queued=0ms, exec=0ms

 

 

PCSD Status:

fmprfcora-priv: Online

fmpracora-priv: Online

 

Daemon Status:

corosync: active/enabled

pacemaker: active/enabled

pcsd: active/enabled

3.5standby节点

设置节点为standby,该节点的所有集群资源将自动切换到集群中的其它节点

[root@fmprpoora ~]# pcs cluster standby fmprpoora-priv

取消standby的命令是unstandby,该节点的所有集群资源将自动回切到本节点。

[root@fmprpoora ~]# pcs cluster unstandby fmprpoora-priv

3.6清除资源错误计数计

[root@fmpracora dinxin]# pcs resource cleanup

Waiting for 1 replies from the CRMd. OK

3.7维护节点

节点处于维护状态时,该节点上的资源将不会自动启动。

3.7.1  RHEL 6.8维护节点

[root@fmpridcora oracle]#pcs node maintenance fmpridcora-priv

 

[root@fmpridcora oracle]# pcs status

Cluster name: fmpridc-clu

Last updated: Tue Aug 23 14:56:21 2016          Last change: Tue Aug 23 14:55:59                2016 by root via crm_attribute on fmpridcora-priv

Stack: cman

Current DC: fmprbiora-priv (version 1.1.14-8.el6-70404b0) - partition with quoru               m

2 nodes and 6 resources configured

 

Node fmpridcora-priv: maintenance

Online: [ fmprbiora-priv ]

 

Full list of resources:

 

 imm-fmpridcora (stonith:fence_ipmilan):        Started fmprbiora-priv

 imm-fmprbiora(stonith:fence_ipmilan):Started fmpridcora-priv (unmanag ed)

 Resource Group: idc_group

     fmpridcora-vip     (ocf::heartbeat:IPaddr2):   Started fmpridcora-priv      (unmanaged)

     idc_fs(ocf::heartbeat:Filesystem):    Started fmpridcora-priv (unmanaged)

     fmidc_lsn(ocf::heartbeat:oralsnr):Started fmpridcora-priv (unmanaged)

     fmidcdb(ocf::heartbeat:oracle):        Started fmpridcora-priv (unmanaged)

 

PCSD Status:

fmpridcora-priv: Online

fmprbiora-priv: Online

 

清除节点维护状态:

[root@fmpridcora oracle]# pcs node unmaintenance fmpridcora-priv

 [root@fmpridcora oracle]# pcs status

Cluster name: fmpridc-clu

Last updated: Tue Aug 23 15:32:28 2016          Last change: Tue Aug 23 15:31:19 2016 by root via crm_attribute on fmpridcora-priv

Stack: cman

Current DC: fmprbiora-priv (version 1.1.14-8.el6-70404b0) - partition with quorum

2 nodes and 6 resources configured

 

Online: [ fmprbiora-priv fmpridcora-priv ]

 

Full list of resources:

 

 imm-fmpridcora (stonith:fence_ipmilan):        Started fmprbiora-priv

 imm-fmprbiora(stonith:fence_ipmilan):Started fmpridcora-priv

 Resource Group: idc_group

fmpridcora-vip(ocf::heartbeat:IPaddr2):Started fmpridcora-priv

idc_fs(ocf::heartbeat:Filesystem):Started fmpridcora-priv

fmidc_lsn(ocf::heartbeat:oralsnr):Started fmpridcora-priv

fmidcdb(ocf::heartbeat:oracle):Started fmpridcora-priv

 

PCSD Status:

fmpridcora-priv: Online

fmprbiora-priv: Online

3.7.2  RHEL 7.2

[root@fmpracora dinxin]# pcs property set maintenance-mode=true

[root@fmpracora dinxin]# pcs property show  maintenance-mode

Cluster Properties:

 maintenance-mode: true

[root@fmpracora dinxin]# pcs status

Cluster name: fmprfcac_cluster

Last updated: Tue Aug 23 15:11:46 2016          Last change: Tue Aug 23 15:09:15 2016 by root via cibadmin on fmpracora-priv

Stack: corosync

Current DC: fmpracora-priv (version 1.1.13-10.el7-44eb2dd) - partition with quorum

2 nodes and 10 resources configured

 

Online: [ fmpracora-priv fmprfcora-priv ]

 

Full list of resources:

 

 imm-fmprfcora(stonith:fence_ipmilan):Started fmpracora-priv (unmanaged)

 imm-fmpracora(stonith:fence_ipmilan):Started fmprfcora-priv (unmanaged)

 Resource Group: ac_group

fmpracora-vip(ocf::heartbeat:IPaddr2):Started fmpracora-priv (unmanaged)

ac_fs(ocf::heartbeat:Filesystem):Started fmpracora-priv (unmanaged)

cvs_lsn(ocf::heartbeat:oralsnr):Stopped (unmanaged)

cvsdb      (ocf::heartbeat:oracle):        Started fmpracora-priv (unmanaged)

 Resource Group: fc_group

fmprfcora-vip(ocf::heartbeat:IPaddr2):Started fmprfcora-priv (unmanaged)

fc_fs(ocf::heartbeat:Filesystem):Started fmprfcora-priv (unmanaged)

     arap2_lsn(ocf::heartbeat:oralsnr):Started fmprfcora-priv (unmanaged)

arap2db(ocf::heartbeat:oracle):Started fmprfcora-priv (unmanaged)

 

PCSD Status:

fmprfcora-priv: Online

fmpracora-priv: Online

 

Daemon Status:

corosync: active/enabled

pacemaker: active/enabled

pcsd: active/enabled

清除维护模式

[root@fmpracora dinxin]# pcs property set maintenance-mode=false

清除维护模式后,系统状态恢复正常。

[root@fmpracora dinxin]# pcs resource

 Resource Group: ac_group

fmpracora-vip      (ocf::heartbeat:IPaddr2):       Started fmpracora-priv

ac_fs(ocf::heartbeat:Filesystem):Started fmpracora-priv

cvs_lsn(ocf::heartbeat:oralsnr):Started fmpracora-priv

cvsdb(ocf::heartbeat:oracle):Started fmpracora-priv

 Resource Group: fc_group

fmprfcora-vip(ocf::heartbeat:IPaddr2):Started fmprfcora-priv

fc_fs(ocf::heartbeat:Filesystem):Started fmprfcora-priv

arap2_lsn(ocf::heartbeat:oralsnr):Started fmprfcora-priv

arap2db(ocf::heartbeat:oracle):Started fmprfcora-priv

3.8手工启动资源

当系统处于维护模式时,可以手工启动资源。启动资源加上--full参数,可以看到详细的输出。

 

[root@fmprpoora ~]# pcs resource debug-start fmprpoora-vip

Operation start for fmprpoora-vip (ocf:heartbeat:IPaddr2) returned 0

 >stderr: INFO: Adding inet address 10.0.101.7/24 with broadcast address 10.0.101.255 to device bond0

 >stderr: INFO: Bringing device bond0 up

 >stderr: INFO: /usr/libexec/heartbeat/send_arp -i 200 -r 5 -p /var/run/resource-agents/send_arp-10.0.101.7 bond0 10.0.101.7 auto not_used not_used

 

3.9手工停止资源

当系统处于维护模式时,可以停止资源。

[root@fmprmaora ~]# pcs resource debug-stop po_fs

Operation stop for po_fs (ocf:heartbeat:Filesystem) returned 0

 >stderr: INFO: Running stop for /dev/mapper/povg-polv on /po-oradata

 >stderr: INFO: Trying to unmount /po-oradata

 >stderr: INFO: unmounted /po-oradata successfully

[root@fmprmaora ~]# pcs resource debug-stop fmprpoora-vip

Operation stop for fmprpoora-vip (ocf:heartbeat:IPaddr2) returned 0

 >stderr: INFO: IP status = ok, IP_CIP=

[root@fmprmaora ~]#

3.10删除cluster配置

[root@fmprpoora ~]# pcs cluster destroy

Shutting down pacemaker/corosync services...

Pacemaker Cluster Manager is already stopped               [确定]

Stopping cluster:

Leaving fence domain...                                 [确定]

Stopping gfs_controld...                                [确定]

Stopping dlm_controld...                                [确定]

Stopping fenced...                                      [确定]

Stopping cman...                                        [确定]

Unloading kernel modules...                             [确定]

Unmounting configfs...                                  [确定]

Killing any remaining services...

Removing all cluster configuration files...

[root@fmprpoora ~]#

 

pcs resource disable<resource-group-name>

 

以上是关于Linux HA安装维护手册的主要内容,如果未能解决你的问题,请参考以下文章

ES安装手册

测者的性能测试手册:Web压力测试工具webbench

测者的性能测试手册:Web压力测试工具webbench

linux HA高可用安装及最简单配置

Linux 高可用(HA)集群之Heartbeat安装

db_ha集群日常维护命令