如何防止 start-stop-daemon 在 cronjob 中创建多个实例?

Posted

技术标签:

【中文标题】如何防止 start-stop-daemon 在 cronjob 中创建多个实例?【英文标题】:How do I prevent start-stop-daemon from creating multiple instances in a cronjob? 【发布时间】:2016-02-12 01:43:43 【问题描述】:

好的,我将 /etc/init.d/skeleton 复制到 /etc/init.d/ark-server 以启动和停止 ARK: Survival Evolved 服务器,我有以下文件:

#! /bin/sh
### BEGIN INIT INFO
# Provides:          skeleton
# Required-Start:    $remote_fs $syslog
# Required-Stop:     $remote_fs $syslog
# Default-Start:     2 3 4 5
# Default-Stop:      0 1 6
# Short-Description: Example initscript
# Description:       This file should be used to construct scripts to be
#                    placed in /etc/init.d.
### END INIT INFO

# Author: Foo Bar <foobar@baz.org>
#
# Please remove the "Author" lines above and replace them
# with your own name if you copy and modify this script.

# Do NOT "set -e"

# PATH should only include /usr/* if it runs after the mountnfs.sh script
PATH=<path_to_ark>/ShooterGame/Binaries/Linux:/sbin:/usr/sbin:/bin:/usr/bin
DESC="ARK Dedicated Server"
NAME=ShooterGameServer
DAEMON=<path_to_ark>/ShooterGame/Binaries/Linux/$NAME
DAEMON_ARGS="\"TheIsland?listen?MultiHome=<ip>?QueryPort=<port>?ServerAdminPassword=<password>\" -nosteamclient -game -server -log"
PIDFILE=/var/run/$NAME.pid
SCRIPTNAME=/etc/init.d/$NAME

# Exit if the package is not installed
[ -x "$DAEMON" ] || exit 0

# Read configuration variable file if it is present
[ -r /etc/default/$NAME ] && . /etc/default/$NAME

# Load the VERBOSE setting and other rcS variables
. /lib/init/vars.sh

# Define LSB log_* functions.
# Depend on lsb-base (>= 3.2-14) to ensure that this file is present
# and status_of_proc is working.
. /lib/lsb/init-functions

#
# Function that starts the daemon/service
#
do_start()

        # Return
        #   0 if daemon has been started
        #   1 if daemon was already running
        #   2 if daemon could not be started
        start-stop-daemon --start --quiet --background --pidfile $PIDFILE --make-pidfile --chuid 1000:1000 --exec $DAEMON --test > /dev/null 2>&1 \
                || return 1
        start-stop-daemon --start --quiet --background --pidfile $PIDFILE --make-pidfile --chuid 1000:1000 --exec $DAEMON -- \
                $DAEMON_ARGS \
                || return 2
        # Add code here, if necessary, that waits for the process to be ready
        # to handle requests from services started subsequently which depend
        # on this one.  As a last resort, sleep for some time.
        #return 0


#
# Function that stops the daemon/service
#
do_stop()

        # Return
        #   0 if daemon has been stopped
        #   1 if daemon was already stopped
        #   2 if daemon could not be stopped
        #   other if a failure occurred
        start-stop-daemon --stop --quiet --retry=TERM/30/INT/15/KILL/5 --pidfile $PIDFILE --chuid 1000:1000 --exec $DAEMON
        RETVAL="$?"
        [ "$RETVAL" = 2 ] && return 2
        # Wait for children to finish too if this is a daemon that forks
        # and if the daemon is only ever run from this initscript.
        # If the above conditions are not satisfied then add some other code
        # that waits for the process to drop all resources that could be
        # needed by services started subsequently.  A last resort is to
        # sleep for some time.
        ##start-stop-daemon --stop --quiet --oknodo --retry=0/30/KILL/5 --exec $DAEMON
        ##[ "$?" = 2 ] && return 2
        # Many daemons don't delete their pidfiles when they exit.
        rm -f $PIDFILE
        return "$RETVAL"


#
# Function that sends a SIGHUP to the daemon/service
#
do_reload() 
        #
        # If the daemon can reload its configuration without
        # restarting (for example, when it is sent a SIGHUP),
        # then implement that here.
        #
        start-stop-daemon --stop --signal 1 --quiet --pidfile $PIDFILE --name $NAME &
        return 0


case "$1" in
  start)
        [ "$VERBOSE" != no ] && log_daemon_msg "Starting $DESC" "$NAME"
        do_start
        case "$?" in
                0|1) [ "$VERBOSE" != no ] && log_end_msg 0 ;;
                2) [ "$VERBOSE" != no ] && log_end_msg 1 ;;
        esac
        ;;
  stop)
        [ "$VERBOSE" != no ] && log_daemon_msg "Stopping $DESC" "$NAME"
        do_stop
        case "$?" in
                0|1) [ "$VERBOSE" != no ] && log_end_msg 0 ;;
                2) [ "$VERBOSE" != no ] && log_end_msg 1 ;;
        esac
        ;;
  status)
       status_of_proc "$DAEMON" "$NAME" && exit 0 || exit $?
       ;;
  #reload|force-reload)
        #
        # If do_reload() is not implemented then leave this commented out
        # and leave 'force-reload' as an alias for 'restart'.
        #
        #log_daemon_msg "Reloading $DESC" "$NAME"
        #do_reload
        #log_end_msg $?
        #;;
  restart|force-reload)
        #
        # If the "reload" option is implemented then remove the
        # 'force-reload' alias
        #
        log_daemon_msg "Restarting $DESC" "$NAME"
        do_stop
        case "$?" in
          0|1)
                do_start
                case "$?" in
                        0) log_end_msg 0 ;;
                        1) log_end_msg 1 ;; # Old process is still running
                        *) log_end_msg 1 ;; # Failed to start
                esac
                ;;
          *)
                # Failed to stop
                log_end_msg 1
                ;;
        esac
        ;;
  *)
        #echo "Usage: $SCRIPTNAME start|stop|restart|reload|force-reload" >&2
        echo "Usage: $SCRIPTNAME start|stop|status|restart|force-reload" >&2
        exit 3
        ;;
esac

这是我的 cron 条目:

0 12,3 * * 1-5 /etc/init.d/ark-server restart
0 6 * * 0,6 /etc/init.d/ark-server restart

如果我手动重新启动服务器,它总是会结束当前进程并创建一个新进程。但是,当 cronjobs 运行时,它会这样做:

top - 18:47:08 up 60 days, 20:39,  2 users,  load average: 3.43, 3.36, 3.39
Tasks: 215 total,   4 running, 211 sleeping,   0 stopped,   0 zombie
Cpu(s): 15.7%us, 31.6%sy,  0.0%ni, 49.3%id,  3.4%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:  24397548k total, 23295208k used,  1102340k free,    36332k buffers
Swap:  7996348k total,   192228k used,  7804120k free,  7754660k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
31329 ark     20   0 2927m 1.9g  27m R  143  8.3   0:14.45 ShooterGameServ
28672 ark     20   0 4349m 3.9g  31m R  100 16.8 948:52.64 ShooterGameServ
19470 ark     20   0 4343m 3.9g  29m R   92 16.7   2388:36 ShooterGameServ
14218 ark     20   0 5211m 4.0g  18m S   24 17.2   2975:16 ShooterGameServ

如何阻止我的脚本在 cronjobs 中创建多个实例?

这就是我在日志文件中的全部内容:

$ cat Logs/ShooterGame_2-backup-2015.11.21-01.10.20.log
[2015.11.20-17.09.30:075][  0]Log file open, 11/20/15 11:09:30
[2015.11.20-17.09.30:075][  0]Primal Game Data Took 15.39 seconds

ShooterGame.logShooterGame_2.log 都是空的。

所以我注意到,如果服务器已经运行超过 12 小时,重启只会失败并创建一个新的服务器实例。这是我今天重启服务器时发生的情况:

$ cat /var/run/ShooterGameServer.pid
16330
$ sudo /etc/init.d/ark-server restart
 * Restarting ARK Dedicated Server ShooterGameServer                                                             [ OK ]
$ cat /var/run/ShooterGameServer.pid
21094
$ kill -9 16330
$ kill -9 21094

它启动了 PID 21094 的第二个进程,但没有结束 PID 16330 的进程。但是,如果我杀死这两个进程并使用相同的脚本重新启动服务器,它总是先关闭第一个进程。

【问题讨论】:

为什么要定期重启服务器?发生这种情况时,您会在日志中得到什么?有什么事吗? 下次发生这种情况时我会尝试获取日志。 我会定期重新启动服务器,因为我有另一个 cronjob 可以自动更新服务器,所以希望我可以运行自动更新并且实际上让它们有点自动。 日志没有太多内容。它们要么是空白的,要么只是告诉我服务器何时加载。 为什么不让更新脚本仅在它实际上更新它时才重新启动它?您是否尝试过从 cronjob 重定向标准错误和标准输出?这可能会告诉你更多关于发生了什么。 -v 标志到 start-stop-daemon 也可能会告诉您一些有用的信息。 【参考方案1】:

我只是在RETVAL 之后放了一个kill -9 以确保它结束。不是最优雅的解决方案,我想知道为什么 start-stop-daemon 不起作用,但这里是代码:

#
# Function that stops the daemon/service
#
do_stop()

        # Return
        #   0 if daemon has been stopped
        #   1 if daemon was already stopped
        #   2 if daemon could not be stopped
        #   other if a failure occurred
        # --retry=TERM/30/INT/15/KILL/5
        start-stop-daemon --stop --quiet --retry TERM/forever/KILL/5 --pidfile $PIDFILE --chuid 1000:1000 --exec $DAEMON
        RETVAL="$?"
        kill -9 `cat $PIDFILE`
        [ "$RETVAL" = 2 ] && return 2
        # Wait for children to finish too if this is a daemon that forks
        # and if the daemon is only ever run from this initscript.
        # If the above conditions are not satisfied then add some other code
        # that waits for the process to drop all resources that could be
        # needed by services started subsequently.  A last resort is to
        # sleep for some time.
        ##start-stop-daemon --stop --quiet --oknodo --retry=0/30/KILL/5 --exec $DAEMON
        ##[ "$?" = 2 ] && return 2
        # Many daemons don't delete their pidfiles when they exit.
        rm -f $PIDFILE
        return "$RETVAL"

我不打算接受这个答案,以防有人真的知道如何让start-stop-daemon 正常工作。

【讨论】:

以上是关于如何防止 start-stop-daemon 在 cronjob 中创建多个实例?的主要内容,如果未能解决你的问题,请参考以下文章

如何记录由 start-stop-daemon 启动的进程的标准输出?

start-stop-daemon 在外部脚本条件下运行

什么是 linux 脚本中的 start-stop-daemon?

start-stop-daemon

CentOS使用Ubuntu的start-stop-daemon来启动守护进程

关于 'start-stop-daemon' not found in PATH or not executable