在 Elastic Beanstalk(AWS) 中守护 Celerybeat

Posted 2023-02-24

技术标签:

【中文标题】在 Elastic Beanstalk(AWS) 中守护 Celerybeat【英文标题】：Daemonize Celerybeat in Elastic Beanstalk(AWS) 【发布时间】：2015-05-26 04:13:00 【问题描述】：

我正在尝试将 celerybeat 作为 Elastic beanstalk 中的守护进程运行。这是我的配置文件：

files:
"/opt/python/log/django.log":
mode: "000666"
owner: ec2-user
group: ec2-user
content: |
  # Log file
encoding: plain
"/opt/elasticbeanstalk/hooks/appdeploy/post/run_supervised_celeryd.sh":
mode: "000755"
owner: root
group: root
content: |
  #!/usr/bin/env bash
  # Get django environment variables
  celeryenv=`cat /opt/python/current/env | tr '\n' ',' | sed 's/%/%%/g' | sed 's/export //g' | sed 's/$PATH/%(ENV_PATH)s/g' | sed 's/$PYTHONPATH//g' | sed 's/$LD_LIBRARY_PATH//g'`
  celeryenv=$celeryenv%?

  # Create celery configuraiton script
  celeryconf="[program:celeryd]
  ; Set full path to celery program if using virtualenv
  command=/opt/python/run/venv/bin/celery worker -A avtotest --loglevel=INFO

  directory=/opt/python/current/app
  user=nobody
  numprocs=1
  stdout_logfile=/var/log/celery-worker.log
  stderr_logfile=/var/log/celery-worker.log
  autostart=true
  autorestart=true
  startsecs=10

  ; Need to wait for currently executing tasks to finish at shutdown.
  ; Increase this if you have very long running tasks.
  stopwaitsecs = 600

  ; When resorting to send SIGKILL to the program to terminate it
  ; send SIGKILL to its whole process group instead,
  ; taking care of its children as well.
  killasgroup=true

  ; if rabbitmq is supervised, set its priority higher
  ; so it starts first
  priority=998

  environment=$celeryenv"

  # Create celerybeat configuraiton script
  celerybeatconf="[program:celerybeat]
  ; Set full path to celery program if using virtualenv
  command=/opt/python/run/venv/bin/celery beat -A avtotest --loglevel=INFO

  ; remove the -A avtotest argument if you are not using an app instance

  directory=/opt/python/current/app
  user=nobody
  numprocs=1
  stdout_logfile=/var/log/celerybeat.log
  stderr_logfile=/var/log/celerybeat.log
  autostart=true
  autorestart=true
  startsecs=10

  ; Need to wait for currently executing tasks to finish at shutdown.
  ; Increase this if you have very long running tasks.
  stopwaitsecs = 600

  ; When resorting to send SIGKILL to the program to terminate it
  ; send SIGKILL to its whole process group instead,
  ; taking care of its children as well.
  killasgroup=true

  ; if rabbitmq is supervised, set its priority higher
  ; so it starts first
  priority=999

  environment=$celeryenv"

  # Create the celery and beat supervisord conf script
  echo "$celeryconf" | tee /opt/python/etc/celery.conf
  echo "$celerybeatconf" | tee /opt/python/etc/celerybeat.conf

  # Add configuration script to supervisord conf (if not there already)
  if ! grep -Fxq "[include]" /opt/python/etc/supervisord.conf
      then
      echo "[include]" | tee -a /opt/python/etc/supervisord.conf
      echo "files: celery.conf" | tee -a /opt/python/etc/supervisord.conf
      echo "files: celerybeat.conf" | tee -a /opt/python/etc/supervisord.conf
  fi

  # Reread the supervisord config
  supervisorctl -c /opt/python/etc/supervisord.conf reread

  # Update supervisord in cache without restarting all services
  supervisorctl -c /opt/python/etc/supervisord.conf update

  # Start/Restart celeryd through supervisord
  supervisorctl -c /opt/python/etc/supervisord.conf restart celeryd

这个文件守护着 celery 和 celerybeat。芹菜工作正常。但 celerybeat 不是。我没有看到创建的 celerybeat.log 文件，我认为这表明 celerybeat 无法正常工作。

对此有什么想法吗？

如果需要，我会发布更多代码。感谢您的帮助

【问题讨论】：

您是否也应该在脚本的最后一行重新启动celerybeat？ 【参考方案1】：

你的 supervisord 语法有点不对，首先你可能需要 SSH 到你的实例，并直接编辑 supervisord.conf 文件（vim /opt/python/etc/supervisord.conf），并直接修复这一行.

echo "[include]" | tee -a /opt/python/etc/supervisord.conf
echo "files: celery.conf" | tee -a /opt/python/etc/supervisord.conf
echo "files: celerybeat.conf" | tee -a /opt/python/etc/supervisord.conf

应该是

echo "[include]" | tee -a /opt/python/etc/supervisord.conf
echo "files: celery.conf celerybeat.conf" | tee -a /opt/python/etc/supervisord.conf

编辑：

要运行 celerybeat，并确保它只在你的所有机器上运行一次，你应该将这些行放在你的配置文件中 --

04_killotherbeats:
  command: "ps auxww | grep 'celery beat' | awk 'print $2' | sudo xargs kill -9 || true"
05_restartbeat:
  command: "supervisorctl -c /opt/python/etc/supervisord.conf restart celerybeat"
  leader_only: true

【讨论】：

我还必须 (1) 将 --pidfile=/tmp/celerybeat.pid 添加到 celerybeat.conf 中的 Celerybeat 命令和 (2) 将 ignoreErrors: true 添加到 04_killotherbeats 以使其在 Elastic Beanstalk 工作实例上正常工作. @HakanB。你能详细说明一下，因为我的 celerybeat 恶魔被随机杀死并且拒绝开始引用未找到的 supervisord 吗？一切正常，但 celerybeat 在每个实例上都提供午餐。结果就是周期性任务的重复…… @MatthieuDELMAIRE 如果你有 leader_only: true，那么这不应该发生。这个设置的问题是，如果你使用自动缩放实例，你的领导者实例可能会被杀死。 @jonzlin95 我知道，这就是为什么它很奇怪，当我 ssh 进入实例时，我可以在每个实例上看到 celery beat 过程

以上是关于在 Elastic Beanstalk(AWS) 中守护 Celerybeat的主要内容，如果未能解决你的问题，请参考以下文章