prometheus-operator监控traefik-Ingress组件状态

Posted lvcisco

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了prometheus-operator监控traefik-Ingress组件状态相关的知识,希望对你有一定的参考价值。

系统环境:

Prometheus Operator版本: 0.29 Kubernetes 版本: 1.14.0

一、Traefik 配置文件设置 Prometheus

要监控 Traefik 控制器,首先要控制 Traeik 将 Metrics 数据暴露出来,这需要在配置文件中加入下面配置:

[metrics]
  [metrics.prometheus]
    entryPoint = "traefik"
    buckets = [0.1,0.3,1.2,5.0]

安装 Traefik 时候已经将配置文件外挂到 Kubernetes ConfigMap 中,详情可以参考 Kubernetes 部署 Traefik Ingress 一文。

例如,集群中将 Traefik 配置文件挂载到 Kubernetes ConfigMap 中,可以用 “kubectl etid” 命令编辑 Traefik 配置文件,加上 Prometheus 配置,这里提供本人完整配置如下:

$ kubectl edit ConfigMap traefik-config -n kube-system

apiVersion: v1
data:
  traefik.toml: |
    # traefik.toml
    debug = true
    InsecureSkipVerify = true
    defaultEntryPoints = ["http","https"]
    [entryPoints]
      [entryPoints.http]
        address = ":80"
        compress = true
      [entryPoints.https]
        address = ":443"
        compress = true
        [entryPoints.https.tls]
          [[entryPoints.https.tls.certificates]]
            CertFile = "/ssl/tls.crt"
            KeyFile = "/ssl/tls.key"
      [entryPoints.traefik]
        address = ":8080"
    [kubernetes]
    [traefikLog]
      format = "json"
      #filePath = "/data/traefik.log"
    [accessLog]
      #filePath = "/data/access.log"
      format = "json"
      [accessLog.filters]
        retryAttempts = true
        minDuration = "10ms"
      [accessLog.fields]
        defaultMode = "keep"
        [accessLog.fields.names]
        "ClientUsername" = "drop"
        [accessLog.fields.headers]
          defaultMode = "keep"
          [accessLog.fields.headers.names]
            "User-Agent" = "redact"
            "Authorization" = "drop"
            "Content-Type" = "keep"
    [api]
      entryPoint = "traefik"
      dashboard = true
    [metrics]
      [metrics.prometheus]
        entryPoint = "traefik"
        buckets = [0.1,0.3,1.2,5.0]

二、Traefik Service 设置标签

Prometheus Operator 是通过 Label 匹配的,需要提前设置 Service 贴上“k8s-app: traefik-ingress”标签

1、查看 Traefik Service

$ kubectl get service -n kube-system

kube-dns                  ClusterIP   10.10.0.10      <none>        53/UDP,53/TCP,9153/TCP    79d
kubelet                   ClusterIP   None            <none>        10250/TCP                 35d
traefik-ingress-service   ClusterIP   10.10.114.105   <none>        80/TCP,443/TCP,8080/TCP   56d

2、编辑该 Service 设置 Label

编辑 Traefik Service

$ kubectl edit service traefik-ingress-service -n kube-system

设置 Label “k8s-app: traefik-ingress”

apiVersion: v1
kind: Service
metadata:
  creationTimestamp: "2019-04-15T05:06:41Z"
  name: traefik-ingress-service
  namespace: kube-system
  resourceVersion: "85575"
  selfLink: /api/v1/namespaces/kube-system/services/traefik-ingress-service
  uid: 4172b4df-5f3c-11e9-9287-000c29d98697
  labels:                       
    k8s-app: traefik-ingress        #---增加标签 “k8s-app: traefik-ingress”
spec:
  clusterIP: 10.10.114.105
  ports:
  - name: http
    port: 80
    protocol: TCP
    targetPort: 80
  - name: https
    port: 443
    protocol: TCP
    targetPort: 443
  - name: admin      #---Prometheus metrics 数据是通过8080端口暴露的
    port: 8080
    protocol: TCP
    targetPort: 8080
  selector:
    k8s-app: traefik-ingress-lb
  sessionAffinity: None
  type: ClusterIP
status:
  loadBalancer: {}

三、Prometheus Operator 配置监控规则

配置服务监控资源,用于监控 Traefik 控制器:

traefik-monitor.yaml

apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
  name: traefik-ingress
  namespace: monitoring
  labels:
    k8s-app: traefik-ingress
spec:
  jobLabel: k8s-app
  endpoints:
  - port: admin              #---设置为traefik 8080端口名称 admin
    interval: 30s
  selector:
    matchLabels:
      k8s-app: traefik-ingress
  namespaceSelector:
    matchNames:
    - kube-system

创建该Service Monitor

$ kubectl apply -f traefik-monitor.yaml

四、查看 Prometheus 规则

打开 Prometheus UI,查看 Prometheus 规则,可以看到 traefik 数据已经存在。

技术图片

五、Grafana 引入仪表盘

打开 Grafana,在其中引入编号“4475”的仪表盘

技术图片

 

 技术图片

然后就可以看到仪表盘

如果没有数据,请提前通过 Traefik Ingress 访问其配置的域名,刷新出一些数据,然后调整小时间范围。

技术图片

以上是关于prometheus-operator监控traefik-Ingress组件状态的主要内容,如果未能解决你的问题,请参考以下文章

prometheus-operator监控traefik-Ingress组件状态

kubeadm构建k8s之Prometheus-operated监控

kubernetes1.23.1部署prometheus-operator

Kubernetes集群监控解决方案kube-prometheus-stack(prometheus-operator)helm安装

Kubernetes集群监控解决方案kube-prometheus-stack(prometheus-operator)helm安装

Prometheus-operator架构详解