Prometheus+Altermanager钉钉报警

Posted dev0ps

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了Prometheus+Altermanager钉钉报警相关的知识,希望对你有一定的参考价值。

Prometheus+Altermanager钉钉报警

一、添加钉钉机器人

参考钉钉官方文档:https://ding-doc.dingtalk.com/doc#/serverapi2/qf2nxq

技术图片

 

 

 二、在k8s上部署钉钉告警deployment,这里我们引用第三方的插件。

[root@cn-hongkong webhook-dingtalk]# cat webhook-dingtalk.yaml
apiVersion: apps/v1beta2
kind: Deployment
metadata:
  labels:
    app: webhook-dingtalk
  name: webhook-dingtalk
  namespace: monitoring
  #需要和alertmanager在同一个namespace
spec:
  replicas: 1
  selector:
    matchLabels:
      app: webhook-dingtalk
  template:
    metadata:
      labels:
        app: webhook-dingtalk
    spec:
      containers:
      - image: billy98/webhook-dingtalk:latest
        name: webhook-dingtalk
        args:
        - "https://oapi.dingtalk.com/robot/send?access_token=1fd59067ab85bea575122a5e4f05cefd6609d9d3e41a725e46a90c2fad9b3"
        #上面创建的钉钉机器人hook
        ports:
        - containerPort: 8080
          protocol: TCP
        resources:
          requests:
            cpu: 100m
            memory: 100Mi
          limits:
            cpu: 500m
            memory: 500Mi
        livenessProbe:
          failureThreshold: 3
          initialDelaySeconds: 30
          periodSeconds: 10
          successThreshold: 1
          timeoutSeconds: 1
          tcpSocket:
            port: 8080
        readinessProbe:
          failureThreshold: 3
          initialDelaySeconds: 30
          periodSeconds: 10
          successThreshold: 1
          timeoutSeconds: 1
          httpGet:
            port: 8080
            path: /
      imagePullSecrets:
        - name: IfNotPresent
---
apiVersion: v1
kind: Service
metadata:
  labels:
    app: webhook-dingtalk
  name: webhook-dingtalk
  namespace: monitoring
  #需要和alertmanager在同一个namespace
spec:
  ports:
  - name: http
    port: 80
    protocol: TCP
    targetPort: 8080
  selector:
    app: webhook-dingtalk
  type: ClusterIP 

 三、alertmanager添加钉钉告警类型

  config:
    global:
      resolve_timeout: 5m
    route:
      group_by: [‘job‘,‘severity‘]
      group_wait: 30s
      group_interval: 5m
      repeat_interval: 12h
      receiver: default
      receiver: webhook
      routes:
      - match:
          filesystem: node
        receiver: webhook
    receivers:
    - name: webhook
      webhook_configs:
      - url: http://webhook-dingtalk/dingtalk/send/
        send_resolved: true

 四、查看告警信息

技术图片

 

 

以上是关于Prometheus+Altermanager钉钉报警的主要内容,如果未能解决你的问题,请参考以下文章

prometheus altermanager邮件报警

alertmanager和altermanager一样吗

Prometheus基于Alertmanager实现钉钉告警

三分钟实现Prometheus电话短信邮件钉钉飞书企业微信报警

Prometheus+Grafana+alertmanager+ 邮件 +钉钉告警

Prometheus 通过钉钉告警