(11)ceph 告警:1 slow ops, oldest one blocked for

Posted

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了(11)ceph 告警:1 slow ops, oldest one blocked for相关的知识,希望对你有一定的参考价值。

(1)ceph告警提示:1 slow ops, oldest one blocked for
[root@node143 ~]# ceph -s
  cluster:
    id:     58a12719-a5ed-4f95-b312-6efd6e34e558
    health: HEALTH_WARN
            1 slow ops, oldest one blocked for 1416 sec, mon.node142 has slow ops

  services:
    mon: 2 daemon, quorum node140,node142 (age 8d)
    mgr: admin(active, since 8d), standbys: node140
    mds: cephfs:1 0=node140=up:active 1 up:standby
    osd: 22 osds: 22 up (since 23m), 18 in (since 29m)

  data:
    pools:   5 pools, 768 pgs
    objects: 2.65k objects, 9.9 GiB
    usage:   53 GiB used, 12 TiB / 12 TiB avail
    pgs:     768 active+clean

(2)检查ntpd服务

[root@node143 ~]# systemctl status  ntpd
[root@node143 ~]# systemctl start ntpd     #新增的节点没有启动ntpd

(3)重启monitor服务node140 node142 的monitor

[root@node140 ceph]# systemctl  restart  ceph-mon.target
[root@node140 ceph]# systemctl  status ceph-mon.target
[root@node142 ceph]# systemctl  restart  ceph-mon.target
[root@node142 ceph]# systemctl  status ceph-mon.target

(4)检查集群已经恢复

[root@node140 ceph]# ceph -s 
  cluster:
    id:     58a12719-a5ed-4f95-b312-6efd6e34e558
    health: HEALTH_OK

  services:
    mon: 2 daemon, quorum node140,node142 (age 3m)
    mgr: admin(active, since 8d), standbys: node140
    mds: cephfs:1 0=node140=up:active 1 up:standby
    osd: 22 osds: 22 up (since 31m), 18 in (since 36m)

  data:
    pools:   5 pools, 768 pgs
    objects: 2.65k objects, 9.9 GiB
    usage:   53 GiB used, 12 TiB / 12 TiB avail
    pgs:     768 active+clean

参考:https://blog.csdn.net/genglei1022/article/details/82461053

以上是关于(11)ceph 告警:1 slow ops, oldest one blocked for的主要内容,如果未能解决你的问题,请参考以下文章

Ceph源码解析:读写流程

ceph写入时延高导致虚拟机hang死的故障处理

Proxmox VE 超融合集群ceph OSD磁盘塞满处理

Ceph神坑系列

splunk实现微信告警

MySQL慢查询日志引起磁盘空间告警