CEPH OSD Down故障分析与处理（Truck Driver City Crush）

事情起因

今早Zabbix报警OSD Down与OSD nearfull警告，于是上CEPH服务器查看发现其中osd.7报警满，osd.19不在ceph中。从目前的情况来看，应该是osd.19磁盘挂了，数据迁移至其它OSD，造成OSD.7满的报警。于是尝试ceph osd in osd.19并启用ceph osd up osd.19发现不生效。看来问题没有这么简单，得一步一步来处理了。特地将整个处理过程记录下来，希望对于刚刚学习CEPH运维的同学有所帮助。

#查看CEPH状态 root@OB001:~# ceph status cluster: id: b1cf0cd3-28f9-4fc4-a495-d5bafb3195ec health: HEALTH_WARN 1 nearfull osd(s) 1 pool(s) nearfull services: mon: 3 daemons, quorum OB001,OB002,OB003 mgr: OB002(active), standbys: OB001, OB003 #从这里可以看到有一个osd已经down了 osd: 20 osds: 19 up, 19 in data: pools: 1 pools, 1024 pgs objects: 830.61k objects, 3.09TiB usage: 6.17TiB used, 2.13TiB / 8.29TiB avail pgs: 1024 active clean io: client: 12.5KiB/s rd, 6.43MiB/s wr, 11op/s rd, 247op/s wr #查看OSD状态 root@OB001:~# ceph osd status ---- ------- ------- ------- -------- --------- -------- --------- -------------------- | id | host | used | avail | wr ops | wr data | rd ops | rd data | state | ---- ------- ------- ------- -------- --------- -------- --------- -------------------- | 0 | OB001 | 356G | 90.1G | 30 | 79.2k | 0 | 0 | exists,up | | 1 | OB001 | 342G | 104G | 17 | 86.8k | 1 | 0 | exists,up | | 2 | OB001 | 295G | 151G | 4 | 85.6k | 0 | 0 | exists,up | | 3 | OB001 | 321G | 125G | 2 | 8396 | 0 | 0 | exists,up | | 4 | OB001 | 334G | 112G | 7 | 50.4k | 0 | 0 | exists,up | | 5 | OB002 | 238G | 208G | 4 | 27.3k | 0 | 0 | exists,up | | 6 | OB002 | 320G | 126G | 8 | 260k | 0 | 0 | exists,up | | 7 | OB002 | 383G | 63.4G | 2 | 11.8k | 0 | 0 | exists,nearfull,up | #报警OSD | 8 | OB002 | 361G | 85.1G | 15 | 678k | 0 | 0 | exists,up | | 9 | OB002 | 291G | 155G | 7 | 28.0k | 0 | 0 | exists,up | | 10 | OB003 | 368G | 78.2G | 22 | 127k | 0 | 0 | exists,up | | 11 | OB003 | 323G | 123G | 3 | 52.9k | 0 | 0 | exists,up | | 12 | OB003 | 319G | 127G | 2 | 93.0k | 1 | 0 | exists,up | | 13 | OB003 | 374G | 72.9G | 2 | 18.1k | 1 | 0 | exists,up | | 14 | OB003 | 345G | 101G | 23 | 169k | 1 | 0 | exists,up | | 15 | OB004 | 305G | 141G | 4 | 47.2k | 0 | 0 | exists,up | | 16 | OB004 | 360G | 86.5G | 7 | 39.2k | 1 | 0 | exists,up | | 17 | OB004 | 358G | 88.7G | 4 | 48.8k | 0 | 0 | exists,up | | 18 | OB004 | 313G | 133G | 13 | 196k | 0 | 0 | exists,up | | 19 | OB004 | 0 | 0 | 0 | 0 | 0 | 0 | exists | #Down的OSD ---- ------- ------- ------- -------- --------- -------- --------- --------------------

定位问题

#定位OSD所有物理节点 root@OB001:~# ceph osd find 19 { "osd": 19, "ip": "172.16.0.208:6816/3245", "osd_fsid": "75c6907d-942a-43b4-ba2a-ed8d90a416e2", "crush_location": { "host": "OB004", #OSD所在节点 "root": "default" } } #连接至对应服务器OB004节点，通过dmesg命令，查看硬件检测或者断开连接的信息，可以看到sdf出现了I/O error。 root@OB004:~# dmesg -T [Sat Apr 11 09:06:51 2020] sd 5:0:0:0: [sdf] tag#16 CDB: Read(10) 28 00 37 e4 36 00 00 00 08 00 [Sat Apr 11 09:06:51 2020] print_req_error: I/O error, dev sdf, sector 937702912 [Sat Apr 11 09:06:51 2020] sd 5:0:0:0: [sdf] tag#17 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK [Sat Apr 11 09:06:51 2020] sd 5:0:0:0: [sdf] tag#17 CDB: Read(10) 28 00 37 e4 36 a0 00 00 08 00 [Sat Apr 11 09:06:51 2020] print_req_error: I/O error, dev sdf, sector 937703072 [Sat Apr 11 09:06:51 2020] sd 5:0:0:0: [sdf] tag#18 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK [Sat Apr 11 09:06:51 2020] sd 5:0:0:0: [sdf] tag#18 CDB: Read(10) 28 00 00 00 00 00 00 00 08 00 [Sat Apr 11 09:06:51 2020] print_req_error: I/O error, dev sdf, sector 0 [Sat Apr 11 09:29:50 2020] XFS (sdf1): metadata I/O error: block 0x80 ("xfs_trans_read_buf_map") error 5 numblks 128 [Sat Apr 11 09:29:50 2020] XFS (sdf1): xfs_imap_to_bp: xfs_trans_read_buf() returned error -5. #于是查看对应OSD日志，发现也是Input/output error。 root@OB004:~# tail -n 10 /var/log/ceph/ceph-osd.19.log 2020-04-11 09:30:48.726470 7f8e96983e00 -1 ** ERROR: unable to open OSD superblock on /var/lib/ceph/osd/ceph-19: (5) Input/output error 2020-04-11 09:31:08.962079 7ff8f2065e00 0 set uid:gid to 64045:64045 (ceph:ceph) 2020-04-11 09:31:08.962096 7ff8f2065e00 0 ceph version 12.2.11 (c96e82ac735a75ae99d4847983711e1f2dbf12e5) luminous (stable), process ceph-osd, pid 3613380 2020-04-11 09:31:08.963319 7ff8f2065e00 -1 ** ERROR: unable to open OSD superblock on /var/lib/ceph/osd/ceph-19: (5) Input/output error 2020-04-11 09:31:29.218118 7f480fc15e00 0 set uid:gid to 64045:64045 (ceph:ceph) 2020-04-11 09:31:29.218142 7f480fc15e00 0 ceph version 12.2.11 (c96e82ac735a75ae99d4847983711e1f2dbf12e5) luminous (stable), process ceph-osd, pid 3613437 2020-04-11 09:31:29.219493 7f480fc15e00 -1 ** ERROR: unable to open OSD superblock on /var/lib/ceph/osd/ceph-19: (5) Input/output error 2020-04-11 09:31:49.458303 7f863b073e00 0 set uid:gid to 64045:64045 (ceph:ceph) 2020-04-11 09:31:49.458321 7f863b073e00 0 ceph version 12.2.11 (c96e82ac735a75ae99d4847983711e1f2dbf12e5) luminous (stable), process ceph-osd, pid 3613502 2020-04-11 09:31:49.459655 7f863b073e00 -1 ** ERROR: unable to open OSD superblock on /var/lib/ceph/osd/ceph-19: (5) Input/output error #vgs和lvs出现的都是IO error root@OB004:~# vgs /dev/sdf: read failed after 0 of 4096 at 0: Input/output error /dev/sdf: read failed after 0 of 4096 at 480103890944: Input/output error /dev/sdf: read failed after 0 of 4096 at 480103972864: Input/output error /dev/sdf: read failed after 0 of 4096 at 4096: Input/output error /dev/sdf1: read failed after 0 of 512 at 104792064: Input/output error /dev/sdf1: read failed after 0 of 512 at 104849408: Input/output error /dev/sdf1: read failed after 0 of 512 at 0: Input/output error /dev/sdf1: read failed after 0 of 512 at 4096: Input/output error /dev/sdf1: read failed after 0 of 2048 at 0: Input/output error /dev/sdf2: read failed after 0 of 512 at 479997984768: Input/output error /dev/sdf2: read failed after 0 of 512 at 479998046208: Input/output error /dev/sdf2: read failed after 0 of 512 at 0: Input/output error /dev/sdf2: read failed after 0 of 512 at 4096: Input/output error /dev/sdf2: read failed after 0 of 2048 at 0: Input/output error root@OB004:~# lvs /dev/sdf: read failed after 0 of 4096 at 0: Input/output error /dev/sdf: read failed after 0 of 4096 at 480103890944: Input/output error /dev/sdf: read failed after 0 of 4096 at 480103972864: Input/output error /dev/sdf: read failed after 0 of 4096 at 4096: Input/output error /dev/sdf1: read failed after 0 of 512 at 104792064: Input/output error /dev/sdf1: read failed after 0 of 512 at 104849408: Input/output error /dev/sdf1: read failed after 0 of 512 at 0: Input/output error /dev/sdf1: read failed after 0 of 512 at 4096: Input/output error /dev/sdf1: read failed after 0 of 2048 at 0: Input/output error /dev/sdf2: read failed after 0 of 512 at 479997984768: Input/output error /dev/sdf2: read failed after 0 of 512 at 479998046208: Input/output error /dev/sdf2: read failed after 0 of 512 at 0: Input/output error /dev/sdf2: read failed after 0 of 512 at 4096: Input/output error /dev/sdf2: read failed after 0 of 2048 at 0: Input/output error

综合以上情况来看，OSD已经挂了，原因存在以下可能性：

磁盘松动造成的IO error。
xfs文件系统异常造成的IO error。
磁盘挂了造成的IO error。

通过以上可能的原因，我们来进行逐步排查与分析。

分析问题

因为木子这里使用的是SATA直连的，所以采用smartctl命令查看对应磁盘信息。如果有使用RAID卡，可以使用MegaCli命令进行查看。正常来说木子的服务器是可以通过smart取值的，现在这个磁盘已经没有办法取到对应值，看来只能够去机房看一下硬盘位是否有报警了。查看机房磁盘没有红灯报警，显示绿灯一直快闪，应该是xfs分区挂了，并不是磁盘挂了。拿过去的备盘也不用换了，直接重建xfs文件系统，加入CEPH OSD应该可以解决。

#查看硬盘状态信息 root@OB004:~# smartctl -a /dev/sdf smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.18-12-pve] (local build) Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Vendor: Product: User Capacity: 1,254,013,891,915,610,980 bytes [1254 PB] Logical block size: 1390128140 bytes scsiModePageOffset: raw_curr too small, offset=246 resp_len=54 bd_len=242 scsiModePageOffset: raw_curr too small, offset=246 resp_len=54 bd_len=242 >> Terminate command early due to bad response to IEC mode page A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.

解决问题

#获取当前服务器上的osd信息，确认ceph.19磁盘为/dev/sdf root@OB004:~# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sdb 8:16 0 447.1G 0 disk ├─sdb1 8:17 0 100M 0 part /var/lib/ceph/osd/ceph-15 └─sdb2 8:18 0 447G 0 part sdc 8:32 0 447.1G 0 disk ├─sdc1 8:33 0 100M 0 part /var/lib/ceph/osd/ceph-16 └─sdc2 8:34 0 447G 0 part sdd 8:48 0 447.1G 0 disk ├─sdd1 8:49 0 100M 0 part /var/lib/ceph/osd/ceph-17 └─sdd2 8:50 0 447G 0 part sde 8:64 0 447.1G 0 disk ├─sde1 8:65 0 100M 0 part /var/lib/ceph/osd/ceph-18 └─sde2 8:66 0 447G 0 part sdf 8:80 0 447.1G 0 disk ├─sdf1 8:81 0 100M 0 part /var/lib/ceph/osd/ceph-19 └─sdf2 8:82 0 447G 0 part #取消挂载 umount /var/lib/ceph/osd/ceph-19 #删除OSD前，它通常是up且in的，要先把它踢出集群，以使Ceph启动重新均衡、把数据拷贝到其他OSD。因为本身这个盘已经挂了，所以正常是已经out了，但我们还是按标准流程操作。 ceph osd out osd.19 #一旦把OSD踢出(out)集群， ceph就会开始重新均衡集群、把归置组迁出将删除的OSD。你可以用ceph工具观察此过程。 ceph -w #你会看到归置组状态从active clean变为active, some degraded objects 、迁移完成后最终回到active clean状态。 #把OSD踢出集群后，它可能仍在运行，就是说其状态为up且out。删除前要先停止OSD进程。 ceph stop osd.19 #删除CRUSH Map中对应的OSD条目，它就不再接收数据了。你也可以反编译CRUSH Map、删除device列表中对应条目、删除host桶中对应条目，并重新编译CRUSH Map并应用它。 ceph osd crush remove osd.19 #删除OSD认证密钥 ceph auth del osd.19 #删除OSD ceph osd rm 19 #删除对应挂载目录 rm -rf /var/lib/ceph/osd/ceph-19 #格式化分区 root@OB004:~# mkfs.xfs -f /dev/sdf meta-data=/dev/sdf isize=512 agcount=4, agsize=29303222 blks = sectsz=512 attr=2, projid32bit=1 = crc=1 finobt=1, sparse=0, rmapbt=0, reflink=0 data = bsize=4096 blocks=117212886, imaxpct=25 = sunit=0 swidth=0 blks naming =version 2 bsize=4096 ascii-ci=0 ftype=1 log =internal log bsize=4096 blocks=57232, version=2 = sectsz=512 sunit=0 blks, lazy-count=1 realtime =none extsz=4096 blocks=0, rtextents=0 #重载分区表信息 partprobe #创建OSD pveceph createosd /dev/sdf #这时候因为OSD19已经加入集群，所以数据迁移开始回退。 root@OB004:~# ceph status cluster: id: b1cf0cd3-28f9-4fc4-a495-d5bafb3195ec health: HEALTH_WARN 1 nearfull osd(s) 1 pool(s) nearfull 136483/1661222 objects misplaced (8.216%) #当前值为8.216% services: mon: 3 daemons, quorum OB001,OB002,OB003 mgr: OB002(active), standbys: OB001, OB003 osd: 20 osds: 20 up, 20 in; 159 remapped pgs data: pools: 1 pools, 1024 pgs objects: 830.61k objects, 3.09TiB usage: 6.20TiB used, 2.53TiB / 8.73TiB avail pgs: 136483/1661222 objects misplaced (8.216%) 865 active clean 151 active remapped backfill_wait 8 active remapped backfilling io: client: 1.03MiB/s wr, 0op/s rd, 129op/s wr recovery: 297MiB/s, 0keys/s, 74objects/s root@OB004:~# ceph status cluster: id: b1cf0cd3-28f9-4fc4-a495-d5bafb3195ec health: HEALTH_WARN 1 nearfull osd(s) 1 pool(s) nearfull 136308/1661222 objects misplaced (8.205%) #回退到8.205%,当回退到0的时候就完成了。 services: mon: 3 daemons, quorum OB001,OB002,OB003 mgr: OB002(active), standbys: OB001, OB003 osd: 20 osds: 20 up, 20 in; 159 remapped pgs data: pools: 1 pools, 1024 pgs objects: 830.61k objects, 3.09TiB usage: 6.20TiB used, 2.53TiB / 8.73TiB avail pgs: 136308/1661222 objects misplaced (8.205%) 865 active clean 151 active remapped backfill_wait 8 active remapped backfilling io: client: 1.18MiB/s wr, 0op/s rd, 160op/s wr recovery: 304MiB/s, 76objects/s

结果验证

通过上面的一系列的操作，基本上我们已经解决了这个OSD故障，最后我们还是要验证一下，整个过程周期可能比较长，因各环境而异。正常恢复以后，我们可以看到其状态如下。

#CEPH状态 root@OB004:~# ceph status cluster: id: b1cf0cd3-28f9-4fc4-a495-d5bafb3195ec health: HEALTH_OK services: mon: 3 daemons, quorum OB001,OB002,OB003 mgr: OB002(active), standbys: OB001, OB003 osd: 20 osds: 20 up, 20 in data: pools: 1 pools, 1024 pgs objects: 820.63k objects, 3.02TiB usage: 6.03TiB used, 2.70TiB / 8.73TiB avail pgs: 1024 active clean io: client: 52.1KiB/s rd, 6.79MiB/s wr, 0op/s rd, 684op/s wr #OSD状态 root@OB004:~# ceph osd status ---- ------- ------- ------- -------- --------- -------- --------- ----------- | id | host | used | avail | wr ops | wr data | rd ops | rd data | state | ---- ------- ------- ------- -------- --------- -------- --------- ----------- | 0 | OB001 | 342G | 104G | 71 | 155k | 0 | 0 | exists,up | | 1 | OB001 | 322G | 124G | 9 | 36.0k | 1 | 819 | exists,up | | 2 | OB001 | 270G | 176G | 0 | 4915 | 0 | 819 | exists,up | | 3 | OB001 | 301G | 145G | 6 | 122k | 0 | 0 | exists,up | | 4 | OB001 | 308G | 138G | 8 | 51.3k | 0 | 0 | exists,up | | 5 | OB002 | 226G | 220G | 0 | 0 | 0 | 0 | exists,up | | 6 | OB002 | 306G | 140G | 29 | 288k | 0 | 0 | exists,up | | 7 | OB002 | 358G | 88.4G | 5 | 25.7k | 0 | 0 | exists,up | | 8 | OB002 | 339G | 107G | 1 | 8192 | 0 | 0 | exists,up | | 9 | OB002 | 272G | 174G | 1 | 7270 | 0 | 0 | exists,up | | 10 | OB003 | 344G | 102G | 35 | 65.8k | 0 | 0 | exists,up | | 11 | OB003 | 309G | 137G | 0 | 1945 | 0 | 0 | exists,up | | 12 | OB003 | 299G | 147G | 8 | 196k | 0 | 0 | exists,up | | 13 | OB003 | 356G | 91.0G | 1 | 7577 | 1 | 0 | exists,up | | 14 | OB003 | 322G | 124G | 8 | 72.9k | 1 | 0 | exists,up | | 15 | OB004 | 261G | 185G | 10 | 153k | 0 | 0 | exists,up | | 16 | OB004 | 337G | 109G | 0 | 5734 | 1 | 0 | exists,up | | 17 | OB004 | 307G | 139G | 0 | 3276 | 0 | 0 | exists,up | | 18 | OB004 | 286G | 160G | 160 | 835k | 0 | 0 | exists,up | | 19 | OB004 | 297G | 149G | 308 | 1468k | 0 | 0 | exists,up | ---- ------- ------- ------- -------- --------- -------- --------- ----------- root@OB004:~# ceph osd tree ID CLASS WEIGHT TYPE NAME STATUS REWEIGHT PRI-AFF -1 8.73199 root default -3 2.18300 host OB001 0 ssd 0.43660 osd.0 up 1.00000 1.00000 1 ssd 0.43660 osd.1 up 1.00000 1.00000 2 ssd 0.43660 osd.2 up 1.00000 1.00000 3 ssd 0.43660 osd.3 up 1.00000 1.00000 4 ssd 0.43660 osd.4 up 1.00000 1.00000 -5 2.18300 host OB002 5 ssd 0.43660 osd.5 up 1.00000 1.00000 6 ssd 0.43660 osd.6 up 1.00000 1.00000 7 ssd 0.43660 osd.7 up 1.00000 1.00000 8 ssd 0.43660 osd.8 up 1.00000 1.00000 9 ssd 0.43660 osd.9 up 1.00000 1.00000 -7 2.18300 host OB003 10 ssd 0.43660 osd.10 up 1.00000 1.00000 11 ssd 0.43660 osd.11 up 1.00000 1.00000 12 ssd 0.43660 osd.12 up 1.00000 1.00000 13 ssd 0.43660 osd.13 up 1.00000 1.00000 14 ssd 0.43660 osd.14 up 1.00000 1.00000 -9 2.18300 host OB004 15 ssd 0.43660 osd.15 up 1.00000 1.00000 16 ssd 0.43660 osd.16 up 1.00000 1.00000 17 ssd 0.43660 osd.17 up 1.00000 1.00000 18 ssd 0.43660 osd.18 up 1.00000 1.00000 19 ssd 0.43660 osd.19 up 1.00000 1.00000 #获取磁盘信息正常 root@OB004:~# smartctl -a /dev/sdf smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.18-12-pve] (local build) Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Device Model: Q200 EX. Serial Number: 29CS1024TQYT LU WWN Device Id: 5 00080d 911346382 Firmware Version: JYRA0102 User Capacity: 480,103,981,056 bytes [480 GB] Sector Size: 512 bytes logical/physical Rotation Rate: Solid State Device Form Factor: 2.5 inches Device is: Not in smartctl database [for details use: -P showall] ATA Version is: ACS-2 (minor revision not indicated) SATA Version is: SATA 3.1, 6.0 Gb/s (current: 3.0 Gb/s) Local Time is: Mon Apr 13 10:52:33 2020 CST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 120) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 19) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000a 100 100 000 Old_age Always - 0 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 100 100 050 Pre-fail Always - 0 5 Reallocated_Sector_Ct 0x0013 100 100 050 Pre-fail Always - 0 7 Unknown_SSD_Attribute 0x000b 100 100 050 Pre-fail Always - 0 8 Unknown_SSD_Attribute 0x0005 100 100 050 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 8778 10 Unknown_SSD_Attribute 0x0013 100 100 050 Pre-fail Always - 0 12 Power_Cycle_Count 0x0012 100 100 000 Old_age Always - 9 167 Unknown_Attribute 0x0022 100 100 000 Old_age Always - 0 168 Unknown_Attribute 0x0012 100 100 000 Old_age Always - 0 169 Unknown_Attribute 0x0013 100 100 010 Pre-fail Always - 100 173 Unknown_Attribute 0x0012 189 189 000 Old_age Always - 0 175 Program_Fail_Count_Chip 0x0013 100 100 010 Pre-fail Always - 0 192 Power-Off_Retract_Count 0x0012 100 100 000 Old_age Always - 6 194 Temperature_Celsius 0x0023 079 067 020 Pre-fail Always - 21 (Min/Max 14/33) 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 240 Unknown_SSD_Attribute 0x0013 100 100 050 Pre-fail Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.

至此这个CEPH OSD故障就解决了，当然需要注意的是，如果你的集群很大，磁盘空间也很大，需要回写的数据很多，这时候我们需要注意控制磁盘回写的速度，避免对正常的业务访问造成影响。当然这时候集群恢复成HEALTH_OK时间就会拉长，所以我们可以根据自身集群的状态来进行调控。

关于CEPH OSD故障恢复，木子就写到这里，欢迎各位点赞、评论、转发。