Nagios监控nginx服务具体过程
Posted jhcelue
tags:
篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了Nagios监控nginx服务具体过程相关的知识,希望对你有一定的参考价值。
1在nginx 服务器上安装nrpe客户端:
Nginx的服务须要监控起来。不然万一down了而不及时修复,会影响web应用。例如以下web应用上面启动的nginx后台进程
[[email protected] ~]# ps aux|grep nginx
nobody 15294 0.0 0.0 22432 3464 ? S Jul03 0:05 nginx: worker process
nobody 15295 0.0 0.0 22432 3480 ? S Jul03 0:05 nginx: worker process
......
nobody 15316 0.0 0.0 22432 3468 ?
S Jul03 0:05 nginx: worker process
nobody 15317 0.0 0.0 22432 3480 ? S Jul03 0:05 nginx: worker process
root 16260 0.0 0.0 20584 1684 ? Ss Jun18 0:00 nginx: master process /usr/local/nginx/sbin/nginx
root 21211 0.0 0.0 103252 860 pts/1 S+ 17:50 0:00 grep nginx
1.1,rpm方式安装nrpe客户端
下载地址:http://download.csdn.net/detail/mchdba/7493875
[[email protected] nagios]# ll
总计 768
-rw-r--r-- 1 root root 713389 12-16 12:08 nagios-plugins-1.4.11-1.x86_64.rpm
-rw-r--r-- 1 root root 32706 12-16 12:09 nrpe-2.12-1.x86_64.rpm
-rw-r--r-- 1 root root 18997 12-16 12:08 nrpe-plugin-2.12-1.x86_64.rpm
[[email protected] nagios]# rpm -ivh *.rpm --nodeps --force
1.2 在配置文件最末尾,加入配置信息以及监控主机服务器ip地址
[[email protected] localhost nagios]# vim /etc/nagios/nrpe.cfg
# add by tim on 2014-06-11
command[check_users]=/usr/local/nagios/libexec/check_users -w 8 -c 15
command[check_load]=/usr/local/nagios/libexec/check_load -w 15,10,5 -c 30,25,20
command[check_sda1]=/usr/local/nagios/libexec/check_disk -w 20% -c 10% -p /dev/sda
command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z
#command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 50 -c 80
command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 750 -c 800
command[check-host-alive]=/usr/local/nagios/libexec/check_ping -H 10.xx.3.29 -w 3000.0,80% -c 5000.0,100% -p 5
allowed_hosts = 127.0.0.1,10.xx.3.41
check下命令是否生效:
[[email protected] nrpe-2.15]# /usr/local/nagios/libexec/check_users -w 8 -c 15
USERS OK - 2 users currently logged in |users=2;8;15;0
[[email protected] nrpe-2.15]#
看到已经USERS OK -….命令已经生效。
1.3 启动nrpe报错例如以下:
[[email protected] ~]# service nrpe restart
Shutting down nrpe: [失败]
Starting nrpe: /usr/sbin/nrpe: error while loading shared libraries: libssl.so.6: cannot open shared object file: No such file or directory
[失败]
[[email protected] ~]#
[[email protected] nagios_client]# service nrpe start
Starting nrpe: /usr/sbin/nrpe: error while loading shared libraries: libssl.so.6: cannot open shared object file: No such file or directory
[失败]
[[email protected] nagios_client]#
建立连接
[[email protected] nagios_client]# ln -s /usr/lib64/libssl.so /usr/lib64/libssl.so.6
(假设没有libssl.so,就採用别的libssl.so.10来做软连接,ln -s /usr/lib64/libssl.so.10 /usr/lib64/libssl.so.6)
[[email protected] nagios_client]#
再又一次启动例如以下:
[[email protected] nagios_client]# service nrpe start
Starting nrpe: /usr/sbin/nrpe: error while loading shared libraries: libcrypto.so.6: cannot open shared object file: No such file or directory
[失败]
[[email protected] ~]# ll /usr/lib64/libcrypto.so
lrwxrwxrwx. 1 root root 18 10月 13 2013 /usr/lib64/libcrypto.so -> libcrypto.so.1.0.0
[[email protected] nagios_client]#
再建链接:
[[email protected] nagios_client]# ln -s /usr/lib64/libcrypto.so /usr/lib64/libcrypto.so.6
(或者假设没有libcrypto.so,就採用libcrypto.so.10做软连接, ln -s /usr/lib64/libcrypto.so.10 /usr/lib64/libcrypto.so.6)
[[email protected] nagios_client]# service nrpe start
Starting nrpe: [确定]
[[email protected] nagios_client]#
1.4 检測下nrpe是否正常执行:
去nagios服务器端check下
[[email protected] ~]# /usr/local/nagios/libexec/check_nrpe -H 10.xx.3.xx
NRPE v2.12
[[email protected] ~]#
看到返回NRPE v2.15表示已经连接成功,客户端的nrpe服务已经监控完毕。
2,比較简单的通过check_http的方式监控
能够在/etc/nagios/nrpe.cfg里面採用check_http的方式来获取nginx是否执行:
(1) 编辑nrpe.cfg
Vim /etc/nagios/nrpe.cfg
command[check_nginx_status]=/usr/lib/nagios/plugins/check_http -I localhost -p 80 -u /nginx_status -e 200 -w 3 -c 10
(2) 重新启动nrpe服务
[[email protected] ~]# service nrpe restart
Shutting down nrpe: [确定]
Starting nrpe: [确定]
[[email protected] ~]#
(3) 在nagios服务器端check。成功。
[[email protected] ~]# /usr/local/nagios/libexec/check_nrpe -H10.xx.1.22 -c check_nginx_status
HTTP OK HTTP/1.1 200 OK - 254 bytes in 0.002 seconds |time=0.002031s;3.000000;10.000000;0.000000 size=254B;;;0
(4) 在services.cfg里面加入check_nginx_status服务
define service{
host_name lb-net-2
service_description check_nginx_status
check_command check_nrpe!check_nginx_status
max_check_attempts 5
normal_check_interval 3
retry_check_interval 2
check_period 24x7
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
contact_groups opsweb
}
(5) 在command.cfg加入check_nginx_status服务
define command{
command_name check_nginx_status
command_line $USER1$/check_nginx_status -I $HOSTADDRESS$ -w $Warning$ -c $Cri$
}
(6) 又一次载入nagios
[[email protected] objects]# service nagios reload
Running configuration check...
Reloading nagios configuration...
done
[[email protected] objects]#
(7) 查看界面的nginx监控服务。例如以下所看到的:
3 编写脚本来监控nginx服务
3.1 调试具体经过
[[email protected] run]# find / -name nginx.pid
/usr/local/nginx/logs/nginx.pid
[[email protected] run]# /usr/lib/nagios/plugins/check_nginxstatus -H localhost -P 80 -p /usr/local/nginx/logs/ -n /usr/local/nginx/logs/nginx.pid -s nginx_status -o /tmp/ -w 1500 -c 2000
expr: 參数数目错误
expr: 语法错误
(standard_in) 1: syntax error
/usr/lib/nagios/plugins/check_nginxstatus: line 258: [: : integer expression expected
/usr/lib/nagios/plugins/check_nginxstatus: line 262: [: : integer expression expected
OK - nginx is running. requests per second, connections per second ( requests per connection) | ‘reqpsec‘= ‘conpsec‘= ‘conpreq‘= ]
去查看262行。将逻辑运算符 "-a" 改成 "&&"
[[email protected] run]# vim /usr/lib/nagios/plugins/check_nginxstatus
[[email protected] run]# /usr/lib/nagios/plugins/check_nginxstatus -H localhost -P 80 -p /usr/local/nginx/logs/ -n /usr/local/nginx/logs/nginx.pid -s nginx_status -o /tmp/ -w 1500 -c 2000
expr: 參数数目错误
expr: 语法错误
(standard_in) 1: syntax error
/usr/lib/nagios/plugins/check_nginxstatus: line 258: [: missing `]‘
/usr/lib/nagios/plugins/check_nginxstatus: line 262: [: : integer expression expected
OK - nginx is running. requests per second, connections per second ( requests per connection) | ‘reqpsec‘= ‘conpsec‘= ‘conpreq‘= ]
[[email protected] run]#
看到已经OK了。再改动文件。
[[email protected] run]# vim /usr/lib/nagios/plugins/check_nginxstatus
[[email protected] run]#
[[email protected] run]# /usr/lib/nagios/plugins/check_nginxstatus -H localhost -P 80 -p /usr/local/nginx/logs/ -n /usr/local/nginx/logs/nginx.pid -s nginx_status -o /tmp/ -w 1500 -c 2000
expr: 參数数目错误
expr: 语法错误
(standard_in) 1: syntax error
/usr/lib/nagios/plugins/check_nginxstatus: line 258: [: missing `]‘
OK - nginx is running. requests per second, connections per second ( requests per connection) | ‘reqpsec‘= ‘conpsec‘= ‘conpreq‘= ]
[[email protected] run]#
将[]改成使用"[[]]"。 就可以!
[[email protected] run]# vim /usr/lib/nagios/plugins/check_nginxstatus
[[email protected] run]#
[[email protected] run]#
[[email protected] run]# /usr/lib/nagios/plugins/check_nginxstatus -H localhost -P 80 -p /usr/local/nginx/logs/ -n /usr/local/nginx/logs/nginx.pid -s nginx_status -o /tmp/ -w 1500 -c 2000
expr: 參数数目错误
expr: 语法错误
(standard_in) 1: syntax error
OK - nginx is running. requests per second, connections per second ( requests per connection) | ‘reqpsec‘= ‘conpsec‘= ‘conpreq‘= ]
[[email protected] run]#
凝视掉#reqpcon=`echo "scale=2; $reqpsec / $conpsec" | bc -l`之后。就不会报(standard_in) 1: syntax error错误,例如以下所看到的:
[[email protected] run]# /usr/lib/nagios/plugins/check_nginxstatus -H localhost -P 80 -p /usr/local/nginx/logs/ -s nginx_status -n nginx.pid -w 15000 -c 20000
expr: 參数数目错误
expr: 语法错误
OK - nginx is running. requests per second, connections per second ( requests per connection) | ‘reqpsec‘= ‘conpsec‘= ‘conpreq‘= ]
[[email protected] run]#
凝视掉# reqpsec=`expr $tmp2_reqpsec - $tmp1_reqpsec` 就不会再报 expr: 參数数目错误,例如以下所看到的:
报错:
[[email protected] run]# /usr/lib/nagios/plugins/check_nginxstatus -H localhost -P 80 -p /usr/local/nginx/logs/ -s nginx_status -n nginx.pid -w 15000 -c 20000
expr: 语法错误
OK - nginx is running. requests per second, connections per second ( requests per connection) | ‘reqpsec‘= ‘conpsec‘= ‘conpreq‘= ]
再次凝视掉 #reqpcon=`echo "scale=2; $reqpsec / $conpsec" | bc -l` 后。执行不会报expr: 语法错误。例如以下所看到的:
[[email protected] run]# /usr/lib/nagios/plugins/check_nginxstatus -H localhost -P 80 -p /usr/local/nginx/logs/ -s nginx_status -n nginx.pid -w 15000 -c 20000
OK - nginx is running. requests per second, connections per second ( requests per connection) | ‘reqpsec‘= ‘conpsec‘= ‘conpreq‘= ]
[[email protected] run]#
看到这里发现‘reqpsec‘= ‘conpsec‘= ‘conpreq‘=都没有值,可是nginx又是在启动执行着,问题出在哪里?经过排查,原来是nginx_status服务没有启动。须要在/usr/local/nginx/conf/nginx.conf配置文件中面加入例如以下配置:
# 加入pid參数
pid logs/nginx.pid;
#charset koi8-r;
access_log logs/host.access.log main;
location /nginx_status {
stub_status on;
access_log off;
deny all;
}
然后又一次载入nginx,看到新的nginx-status文件是生成了。可是文件内容为空,例如以下所看到的:
[[email protected] logs]# ll /tmp/nginx*
-rw-r--r--. 1 root root 0 7月 3 15:06 /tmp/nginx-status.1
[[email protected] logs]#
去查看ngins后台日志
[[email protected] logs]# cd /usr/local/nginx/
[[email protected] logs]# tail -n 300 error.log
……
2014/07/03 15:05:47 [error] 4285#0: *1851293 access forbidden by rule, client: 127.0.0.1, server: localhost, request: "GET /nginx_status HTTP/1.0", host: "localhost"
2014/07/03 15:05:48 [error] 4285#0: *1851294 access forbidden by rule, client: 127.0.0.1, server: localhost, request: "GET /nginx_status HTTP/1.0", host: "localhost"
2014/07/03 15:06:12 [error] 4282#0: *1851362 access forbidden by rule, client: 127.0.0.1, server: localhost, request: "GET /nginx_status HTTP/1.0", host: "localhost"
2014/07/03 15:06:13 [error] 4282#0: *1851363 access forbidden by rule, client: 127.0.0.1, server: localhost, request: "GET /nginx_status HTTP/1.0", host: "localhost"
2014/07/03 15:06:55 [error] 4285#0: *1851509 access forbidden by rule, client: 127.0.0.1, server: localhost, request: "GET /nginx_status HTTP/1.0", host: "localhost"
2014/07/03 15:06:56 [error] 4285#0: *1851519 access forbidden by rule, client: 127.0.0.1, server: localhost, request: "GET /nginx_status HTTP/1.0", host: "localhost"
查看nginx编译參数
[[email protected] logs]# /usr/local/nginx/sbin/nginx -V
nginx version: nginx/1.4.2
built by gcc 4.4.7 20120313 (Red Hat 4.4.7-3) (GCC)
configure arguments: --prefix=/usr/local/nginx --with-http_stub_status_module --with-http_realip_module
证明白实是载入了stub_status插件,之后去改动配置文件。凝视掉deny all;又一次载入nginx。
[[email protected] logs]# vim /usr/local/nginx/conf/nginx.conf
#deny all;
[[email protected] logs]# service nginx reload
reload nginx
[[email protected] logs]#
[[email protected] logs]# ll /tmp/nginx*
ls: 无法訪问/tmp/nginx*: 没有那个文件或文件夹
[[email protected] logs]#
还是没有看到/tmp/nginx-status.1状态文件生成。因为nagios下监控nginx的脚本是从nginx-status.1获取数据,假设没有这个文件,没有办法获取数据。
继续google,”nginx stub_status没有生成nginx-status.1”文件。看到有人说仅仅要配置好了这个状态文件有没有无所谓,我就试着直接执行脚本看看是否能生效。
[[email protected] logs]# ll /tmp/nginx*
ls: 无法訪问/tmp/nginx*: 没有那个文件或文件夹
[[email protected] logs]# /root/check_nginx2.sh -H localhost -P 80 -p /usr/local/nginx/logs/ -n nginx.pid -s nginx_status -w 15000 -c 20000
OK - nginx is running. 1 requests per second, 2 connections per second (.50 requests per connection) | ‘reqpsec‘=1 ‘conpsec‘=2 ‘conpreq‘=.50 ]
[[email protected] logs]#
看到‘reqpsec‘=1 ‘conpsec‘=2 ‘conpreq‘=.50里面有数据了,再去check下文件有没有生成,例如以下所看到的:
[[email protected] logs]# ll /tmp/nginx*
ls: 无法訪问/tmp/nginx*: 没有那个文件或文件夹
[[email protected] logs]#
还是没有文件生成,可是check已经有数据了,证明不一定要拘泥于是否在/tmp/文件夹下是否有nginx-status.1文件。
通过脚本分析例如以下:
[[email protected] logs]# vim /usr/lib/nagios/plugins/check_nginxstatus
180 get_status() {
181 if [ "$secure" = 1 ]
182 then
183 wget_opts="-O- -q -t 3 -T 3 --no-check-certificate"
184 out1=`wget ${wget_opts} http://${hostname}:${port}/${status_page}`
185 sleep 1
186 out2=`wget ${wget_opts} http://${hostname}:${port}/${status_page}`
187 else
188 wget_opts="-O- -q -t 3 -T 3"
189 out1=`wget ${wget_opts} http://${hostname}:${port}/${status_page}`
190 sleep 1
191 out2=`wget ${wget_opts} http://${hostname}:${port}/${status_page}`
192 fi
193
194 if [ -z "$out1" -o -z "$out2" ]
195 then
196 echo "UNKNOWN - Local copy/copies of $status_page is empty."
197 exit $ST_UK
198 fi
199 }
是通过訪问`wget -O- -q -t 3 -T 3 --no-check-certificate http://10.xx.xx.xx:80/nginx_status`这个链接来获取status的数据记录的,而不是去载入/tmp/nginx-status.1文件来获取数据的。
直接訪问 http://10.xx.xx.xx:80/nginx_status 地址就能获取nginx执行数据,例如以下图所看到的:
在nagios服务器上check下。报错:
[[email protected] ~]# /usr/local/nagios/libexec/check_nrpe -H10.xx.xx.xx -c check_nginx_status
UNKNOWN - Local copy/copies of nginx_status is empty.
[[email protected] ~]#
检查监控脚本,搜索 ‘Local copy/copies of nginx_status is empty.’在第197行,有例如以下代码:
195 if [ -z "$out1" -o -z "$out2" ]
196 then
197 echo "UNKNOWN - Local copy/copies of $status_page is empty."
198 exit $ST_UK
199 fi
看出是因为if [ -z "$out1" -o -z "$out2" ]这个推断生效。导致监控脚本执行到这里就exit了。继续调试。发现用nagios服务器调用脚本的时候。执行到下面第190行到第192行
out1=`/usr/bin/wget ${wget_opts} http://${hostname}:${port}/${status_page}`
sleep 1
out2=`/usr/bin/wget ${wget_opts} http://${hostname}:${port}/${status_page}`
的时候。out1为空,out2也为空,所以在后面的if [ -z "$out1" -o -z "$out2" ]推断通过报出信息为:UNKNOWN - Local copy/copies of $status_page is empty. 然后直接exit。
说明:因为nginx是要调用wget命令来获取nginx_status状态的,而wget命令是仅仅能以root用户来执行的, 所以须要将nagios用户设置成能够无需password直接su成root,这样就能以nagios用户执行命令sudo /usr/lib/nagios/plugins/check_nginxstatus 。在centos系统中。无法直接调用sudo命令,须要改动/etc/sudoers, 找到 #Defaults requiretty 并取消凝视,另外新增一行。表示nagios用户不须要登陆终端就能够调用命令。例如以下所看到的:
Defaults requiretty
Defaults:nagios !requiretty
#加入nagios 请求sudo,同意特定指令时(可跟參数),不须要password(如)。
nagios ALL=(ALL) NOPASSWD: ALL
改动完后。再check,数据出来了:
[[email protected] ~]# /usr/local/nagios/libexec/check_nrpe -H10.xx.xx.xx -c check_nginx_status
OK - nginx is running. 1 requests per second, 1 connections per second (1.00 requests per connection) | ‘reqpsec‘=1 ‘conpsec‘=1 ‘conpreq‘=1.00 ]
[[email protected] ~]#
3.2 share下check_nginxstatus脚本
- #!/bin/sh
- PROGNAME=`basename $0`
- VERSION=\\\"Version 1.1,\\\"
- AUTHOR=\\\"tim man\\\"
- ST_OK=0
- ST_WR=1
- ST_CR=2
- ST_UK=3
- hostname=\\\"localhost\\\"
- port=80
- path_pid=/var/run
- name_pid=\\\"nginx.pid\\\"
- status_page=\\\"nginx_status\\\"
- pid_check=1
- secure=0
- print_version() {
- echo \\\"$VERSION $AUTHOR\\\"
- }
- print_help() {
- print_version $PROGNAME $VERSION
- echo \\\"\\\"
- echo \\\"$PROGNAME is a Nagios plugin to check whether nginx is running.\\\"
- echo \\\"It also parses the nginx\\\‘s status page to get requests and\\\"
- echo \\\"connections per second as well as requests per connection. You\\\"
- echo \\\"may have to alter your nginx configuration so that the plugin\\\"
- echo \\\"can access the server\\\‘s status page.\\\"
- echo \\\"The plugin is highly configurable for this reason. See below for\\\"
- echo \\\"available options.\\\"
- echo \\\"\\\"
- echo \\\"$PROGNAME -H localhost -P 80 -p /var/run -n nginx.pid \\\"
- echo \\\" -s nginx_statut -o /tmp [-w INT] [-c INT] [-S] [-N]\\\"
- echo \\\"\\\"
- echo \\\"Options:\\\"
- echo \\\" -H/--hostname)\\\"
- echo \\\" Defines the hostname. Default is: localhost\\\"
- echo \\\" -P/--port)\\\"
- echo \\\" Defines the port. Default is: 80\\\"
- echo \\\" -p/--path-pid)\\\"
- echo \\\" Path where nginx\\\‘s pid file is being stored. You might need\\\"
- echo \\\" to alter this path according to your distribution. Default\\\"
- echo \\\" is: /var/run\\\"
- echo \\\" -n/--name_pid)\\\"
- echo \\\" Name of the pid file. Default is: nginx.pid\\\"
- echo \\\" -N/--no-pid-check)\\\"
- echo \\\" Turn this on, if you don\\\‘t want to check for a pid file\\\"
- echo \\\" whether nginx is running, e.g. when you\\\‘re checking a\\\"
- echo \\\" remote server. Default is: off\\\"
- echo \\\" -s/--status-page)\\\"
- echo \\\" Name of the server\\\‘s status page defined in the location\\\"
- echo \\\" directive of your nginx configuration. Default is:\\\"
- echo \\\" nginx_status\\\"
- echo \\\" -S/--secure)\\\"
- echo \\\" In case your server is only reachable via SSL, use this\\\"
- echo \\\" this switch to use HTTPS instead of HTTP. Default is: off\\\"
- echo \\\" -w/--warning)\\\"
- echo \\\" Sets a warning level for requests per second. Default is: off\\\"
- echo \\\" -c/--critical)\\\"
- echo \\\" Sets a critical level for requests per second. Default is:\\\"
- echo \\\" off\\\"
- exit $ST_UK
- }
- while test -n \\\"$1\\\"; do
- case \\\"$1\\\" in
- -help|-h)
- print_help
- exit $ST_UK
- ;;
- --version|-v)
- print_version $PROGNAME $VERSION
- exit $ST_UK
- ;;
- --hostname|-H)
- hostname=$2
- shift
- ;;
- --port|-P)
- port=$2
- shift
- ;;
- --path-pid|-p)
- path_pid=$2
- shift
- ;;
- --name-pid|-n)
- name_pid=$2
- shift
- ;;
- --no-pid-check|-N)
- pid_check=0
- ;;
- --status-page|-s)
- status_page=$2
- shift
- ;;
- --secure|-S)
- secure=1
- ;;
- --warning|-w)
- warning=$2
- shift
- ;;
- --critical|-c)
- critical=$2
- shift
- ;;
- *)
- echo \\\"Unknown argument: $1\\\"
- print_help
- exit $ST_UK
- ;;
- esac
- shift
- done
- get_wcdiff() {
- if [ ! -z \\\"$warning\\\" -a ! -z \\\"$critical\\\" ]
- then
- wclvls=1
- if [ ${warning} -ge ${critical} ]
- then
- wcdiff=1
- fi
- elif [ ! -z \\\"$warning\\\" -a -z \\\"$critical\\\" ]
- then
- wcdiff=2
- elif [ -z \\\"$warning\\\" -a ! -z \\\"$critical\\\" ]
- then
- wcdiff=3
- fi
- }
- val_wcdiff() {
- if [ \\\"$wcdiff\\\" = 1 ]
- then
- echo \\\"Please adjust your warning/critical thresholds. The warning \\\\
- must be lower than the critical level!\\\"
- exit $ST_UK
- elif [ \\\"$wcdiff\\\" = 2 ]
- then
- echo \\\"Please also set a critical value when you want to use \\\\
- warning/critical thresholds!\\\"
- exit $ST_UK
- elif [ \\\"$wcdiff\\\" = 3 ]
- then
- echo \\\"Please also set a warning value when you want to use \\\\
- warning/critical thresholds!\\\"
- exit $ST_UK
- fi
- }
- check_pid() {
- if [ -f \\\"$path_pid/$name_pid\\\" ]
- then
- retval=0
- else
- retval=1
- fi
- }
- get_status() {
- if [ \\\"$secure\\\" = 1 ]
- then
- wget_opts=\\\"-O- -q -t 3 -T 3 --no-check-certificate\\\"
- #out1=`/usr/bin/wget ${wget_opts} http://${hostname}:${port}/${status_page}`
- out1=`/usr/bin/wget -O- -q -t 3 -T 3 http://localhost:80/nginx_status`
- sleep 1
- out2=`/usr/bin/wget -O- -q -t 3 -T 3 http://localhost:80/nginx_status`
- else
- wget_opts=\\\"-O- -q -t 3 -T 3\\\"
- out1=`/usr/bin/wget -O- -q -t 3 -T 3 http://localhost:80/nginx_status`
- sleep 1
- out2=`/usr/bin/wget -O- -q -t 3 -T 3 http://localhost:80/nginx_status`
- fi
- if [ -z \\\"$out1\\\" -o -z \\\"$out2\\\" ]
- then
- echo \\\"out1:$out1 out2:$out2, UNKNOWN - Local copy/copies of $status_page is empty.\\\"
- exit $ST_UK
- fi
- }
- get_vals() {
- tmp1_reqpsec=`echo ${out1}|awk \\\‘{print $10}\\\‘`
- tmp2_reqpsec=`echo ${out2}|awk \\\‘{print $10}\\\‘`
- reqpsec=`expr $tmp2_reqpsec - $tmp1_reqpsec`
- tmp1_conpsec=`echo ${out1}|awk \\\‘{print $9}\\\‘`
- tmp2_conpsec=`echo ${out2}|awk \\\‘{print $9}\\\‘`
- conpsec=`expr $tmp2_conpsec - $tmp1_conpsec`
- reqpcon=`echo \\\"scale=2; $reqpsec / $conpsec\\\" | bc -l`
- if [ \\\"$reqpcon\\\" = \\\".99\\\" ]
- then
- reqpcon=\\\"1.00\\\"
- fi
- }
- do_output() {
- output=\\\"nginx is running. $reqpsec requests per second, $conpsec connections per second ($reqpcon requests per connection)\\\"
- }
- do_perfdata() {
- perfdata=\\\"\\\‘reqpsec\\\‘=$reqpsec \\\‘conpsec\\\‘=$conpsec \\\‘conpreq\\\‘=$reqpcon\\\"
- }
- # Here we
- get_wcdiff
- val_wcdiff
- if [ ${pid_check} = 1 ]
- then
- check_pid
- if [ \\\"$retval\\\" = 1 ]
- then
- echo \\\"There\\\‘s no pid file for nginx. Is nginx running? Please also make sure whether your pid path and name is correct.\\\"
- exit $ST_CR
- fi
- fi
- get_status
- get_vals
- do_output
- do_perfdata
- if [[ -n \\\"$warning\\\" ]] && [[ -n \\\"$critical\\\" ]]
- then
- if [[ \\\"$reqpsec\\\" -ge \\\"$warning\\\" ]] && [[ \\\"$reqpsec\\\" -lt \\\"$critical\\\" ]]
- then
- echo \\\"WARNING - ${output} | ${perfdata}\\\"
- exit $ST_WR
- elif [ \\\"$reqpsec\\\" -ge \\\"$critical\\\" ]
- then
- echo \\\"CRITICAL - ${output} | ${perfdata}\\\"
- exit $ST_CR
- else
- echo \\\"OK - ${output} | ${perfdata} ]\\\"
- exit $ST_OK
- fi
- else
- echo \\\"OK - ${output} | ${perfdata}\\\"
- exit $ST_OK
- fi
以上是关于Nagios监控nginx服务具体过程的主要内容,如果未能解决你的问题,请参考以下文章