美文网首页
ceph backfill

ceph backfill

作者: little_wang | 来源:发表于2023-11-23 13:38 被阅读0次

第一次backfill 循环

step1: 
     new_backfill: 1
     get_backfill_targets(): 2
     last_backfill_started: MIN
     local pg_info_t: 1.0( v 20"20000 (20"9900,20"20000"] local-lis/les=32/33 n=20000 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
     earliest_backfill(): MIN

step2: before update_range 
     backfill_info: BackfillInfo(MIN-MIN 0 version: 0"0 objects)"
     pending_backfill_updates: {}
     backfills_in_flight: 
     peer: 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MIN-MIN 0 version: 0"0 objects)"
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000"] lb MIN local-lis/les=32/33 n=0 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
step3: after update_range 
     local backfill_info: BackfillInfo(MIN-1:067f54e0:::test19063.txt:head 511 version: 20"20000 objects 
     {1:00001e5d:::test4054.txt:head=20"4054,
     1:0005c6e0:::test18392.txt:head=20"18392,
     1:000a2176:::test11801.txt:head=20"11801,
     1:000d9c27:::test1033.txt:head=20"1033,
     ,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})              
step4: befor while (ops < max)
     local backfill_info: BackfillInfo(1:00001e5d:::test4054.txt:head-1:067f54e0:::test19063.txt:head 511 version: 20"20000 objects {1:00001e5d:::test4054.txt:head=20"4054,1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     earliest_peer_backfill(): MIN     
     peer 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MIN-MIN 0 version: 0"0 objects)"

step8 in while send to peer scan
     peer: 2
     pbi: BackfillInfo(MIN-MIN 0 version: 0"0 objects)"

step 10:  out while:  before update peer last_backfill:
     backfills_in_flight: 
     backfill_info: BackfillInfo(1:00001e5d:::test4054.txt:head-1:067f54e0:::test19063.txt:head 511 version: 20"20000 objects {1:00001e5d:::test4054.txt:head=20"4054,1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     pending_backfill_updates: {}
     last_backfill_started: MIN
     add_to_stat: 
     next_backfill_to_complete: MIN
     new_last_backfill: MIN
     backfill_pos MIN
     peer: 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MIN-MIN 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000] lb MIN local-lis/les=32/33 n=0 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
step : the end compute new_last_backfill
     new_last_backfill MIN         
     peer: 2
         peer last_backfill: MIN
         new_last_backfill > pinfo.last_backfill : 0
         pbi: BackfillInfo(MIN-MIN 0 version: 0"0 objects)"

do_scan 返回:
return in OP_SCAN_DIGEST BackfillInfo(MIN-MAX 0 version: 0"0 objects) from 2

第二次循环:

(PG::finish_recovery_op(hobject_t const&, bool)+0x13c)
void PrimaryLogPG::do_scan(OpRequestRef op,ThreadPool::TPHandle &handle)
step1: 
     new_backfill: 0
     get_backfill_targets(): 2
     last_backfill_started: MIN
     local pg_info_t: 1.0( v 20"20000 (20"9900,20"20000] local-lis/les=32/33 n=20000 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
     earliest_backfill(): MIN         
step2: before update_range 
     backfill_info: BackfillInfo(MIN-1:067f54e0:::test19063.txt:head 511 version: 20"20000 objects {1:00001e5d:::test4054.txt:head=20"4054,1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     pending_backfill_updates: {}
     backfills_in_flight: 
    peer: 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MIN-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000] lb MIN local-lis/les=32/33 n=0 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)          
step3: after update_range 
     local backfill_info: BackfillInfo(MIN-1:067f54e0:::test19063.txt:head 511 version: 20"20000 objects {1:00001e5d:::test4054.txt:head=20"4054,1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
step4: befor while (ops < max)
     local backfill_info: BackfillInfo(1:00001e5d:::test4054.txt:head-1:067f54e0:::test19063.txt:head 511 version: 20"20000 objects {1:00001e5d:::test4054.txt:head=20"4054,1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     earliest_peer_backfill(): MAX         
     peer 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)     
step inwhile:
     process file backfill_info.begin 1:00001e5d:::test4054.txt:head
     check MAX
     need_ver_targs= []
     keep_ver_targs= []
     missing_targs= [2]
     skip_targs= []         
step 9: in while after process one file
     lcoal backfill_info: BackfillInfo(1:0005c6e0:::test18392.txt:head-1:067f54e0:::test19063.txt:head 510 version: 20"20000 objects {1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     last_backfill_started 1:00001e5d:::test4054.txt:head     
     peer 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
step 10:  out while:  before update peer last_backfill:
     backfills_in_flight: 1:00001e5d:::test4054.txt:head
     backfill_info: BackfillInfo(1:0005c6e0:::test18392.txt:head-1:067f54e0:::test19063.txt:head 510 version: 20"20000 objects {1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     pending_backfill_updates: {1:00001e5d:::test4054.txt:head=0"0}
     last_backfill_started: 1:00001e5d:::test4054.txt:head
     add_to_stat: 1:00001e5d:::test4054.txt:head
     next_backfill_to_complete: 1:00001e5d:::test4054.txt:head
     new_last_backfill: MIN
     backfill_pos 1:0005c6e0:::test18392.txt:head
     peer: 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000] lb MIN local-lis/les=32/33 n=0 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)         

    pending_backfill_updates < next_backfill_to_complete 
    才会更新 new_last_backfill 为pending_backfill_updates的元素

step : the end compute new_last_backfill
     new_last_backfill MIN
     peer: 2
         peer last_backfill: MIN
         new_last_backfill > pinfo.last_backfill : 0
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)         

处理了一个文件之后,开始第三次循环:

5: (PG::queue_recovery()+0x304) 
6: (PG::finish_recovery_op(hobject_t const&, bool)+0x13c) 
7: (PrimaryLogPG::on_global_recover(hobject_t const&, object_stat_sum_t const&, bool)+0x264)
8: (ReplicatedBackend::handle_push_reply(pg_shard_t, PushReplyOp const&, PushOp*)+0x660)
9: (ReplicatedBackend::do_push_reply(boost::intrusive_ptr<OpRequest>)+0xe8) 
10: (ReplicatedBackend::_handle_message(boost::intrusive_ptr<OpRequest>)+0x1a4)

如果文件比较大,数据和omap的kv实际大小,如果过大会分多批次进行PushOP,如果最终处理完成,就触发下一个文件
osd_recovery_max_chunk 8M
osd_recovery_max_omap_entries_per_chunk 8096

step1: 
     new_backfill: 0
     get_backfill_targets(): 2
     last_backfill_started: 1:00001e5d:::test4054.txt:head
     local pg_info_t: 1.0( v 20"20000 (20"9900,20"20000] local-lis/les=32/33 n=20000 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
     earliest_backfill(): MIN
step2: before update_range 
     backfill_info: BackfillInfo(1:00001e5d:::test4054.txt:head-1:067f54e0:::test19063.txt:head 510 version: 20"20000 objects {1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     pending_backfill_updates: {1:00001e5d:::test4054.txt:head=0"0}
     backfills_in_flight: 
     peer: 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000] lb MIN local-lis/les=32/33 n=0 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)     
step3: after update_range 
     local backfill_info: BackfillInfo(1:00001e5d:::test4054.txt:head-1:067f54e0:::test19063.txt:head 510 version: 20"20000 objects {1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
step4: befor while (ops < max)
     local backfill_info: BackfillInfo(1:0005c6e0:::test18392.txt:head-1:067f54e0:::test19063.txt:head 510 version: 20"20000 objects {1:0005c6e0:::test18392.txt:head=20"18392,1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     earliest_peer_backfill(): MAX         
     peer 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
step inwhile:
     process file backfill_info.begin 1:0005c6e0:::test18392.txt:head
     check MAX
     need_ver_targs= []
     keep_ver_targs= []
     missing_targs= [2]
     skip_targs= []
step 9: in while after process one file
     lcoal backfill_info: BackfillInfo(1:000a2176:::test11801.txt:head-1:067f54e0:::test19063.txt:head 509 version: 20"20000 objects {1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     last_backfill_started 1:0005c6e0:::test18392.txt:head
     peer 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)         
step 10:  out while:  before update peer last_backfill:
     backfills_in_flight: 1:0005c6e0:::test18392.txt:head
     backfill_info: BackfillInfo(1:000a2176:::test11801.txt:head-1:067f54e0:::test19063.txt:head 509 version: 20"20000 objects {1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     pending_backfill_updates: {1:00001e5d:::test4054.txt:head=0"0,1:0005c6e0:::test18392.txt:head=0"0}
     last_backfill_started: 1:0005c6e0:::test18392.txt:head
     add_to_stat: 1:0005c6e0:::test18392.txt:head
     next_backfill_to_complete: 1:0005c6e0:::test18392.txt:head
     new_last_backfill: MIN
     backfill_pos 1:000a2176:::test11801.txt:head
     peer: 2
         peer last_backfill: MIN
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000] lb MIN local-lis/les=32/33 n=0 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
step : the end compute new_last_backfill
     new_last_backfill 1:00001e5d:::test4054.txt:head
     peer: 2
         peer last_backfill: MIN
         new_last_backfill > pinfo.last_backfill : 1
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)

第四次循环: 关注 peer last_backfill

step1: 
     new_backfill: 0
     get_backfill_targets(): 2
     last_backfill_started: 1:0005c6e0:::test18392.txt:head
     local pg_info_t: 1.0( v 20"20000 (20"9900,20"20000] local-lis/les=32/33 n=20000 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
     earliest_backfill(): 1:00001e5d:::test4054.txt:head

step2: before update_range 
     backfill_info: BackfillInfo(1:0005c6e0:::test18392.txt:head-1:067f54e0:::test19063.txt:head 509 version: 20"20000 objects {1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     pending_backfill_updates: {1:0005c6e0:::test18392.txt:head=0"0}
     backfills_in_flight: 
     peer: 2
         peer last_backfill: 1:00001e5d:::test4054.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000] lb 1:00001e5d:::test4054.txt:head local-lis/les=32/33 n=1 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
step3: after update_range 
     local backfill_info: BackfillInfo(1:0005c6e0:::test18392.txt:head-1:067f54e0:::test19063.txt:head 509 version: 20"20000 objects {1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})

step4: befor while (ops < max)
     local backfill_info: BackfillInfo(1:000a2176:::test11801.txt:head-1:067f54e0:::test19063.txt:head 509 version: 20"20000 objects {1:000a2176:::test11801.txt:head=20"11801,1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     earliest_peer_backfill(): MAX         
     peer 2
         peer last_backfill: 1:00001e5d:::test4054.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
step inwhile:
     process file backfill_info.begin 1:000a2176:::test11801.txt:head
     check MAX
     need_ver_targs= []
     keep_ver_targs= []
     missing_targs= [2]
     skip_targs= []

step 9: in while after process one file
     lcoal backfill_info: BackfillInfo(1:000d9c27:::test1033.txt:head-1:067f54e0:::test19063.txt:head 508 version: 20"20000 objects {1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     last_backfill_started 1:000a2176:::test11801.txt:head
     peer 2
         peer last_backfill: 1:00001e5d:::test4054.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)         
step 10:  out while:  before update peer last_backfill:
     backfills_in_flight: 1:000a2176:::test11801.txt:head
     backfill_info: BackfillInfo(1:000d9c27:::test1033.txt:head-1:067f54e0:::test19063.txt:head 508 version: 20"20000 objects {1:000d9c27:::test1033.txt:head=20"1033,,,,,,,,,1:06738052:::test4708.txt:head=20"4708,1:067e20fb:::test10941.txt:head=20"10941})
     pending_backfill_updates: {1:0005c6e0:::test18392.txt:head=0"0,1:000a2176:::test11801.txt:head=0"0}
     last_backfill_started: 1:000a2176:::test11801.txt:head
     add_to_stat: 1:000a2176:::test11801.txt:head
     next_backfill_to_complete: 1:000a2176:::test11801.txt:head
     new_last_backfill: 1:00001e5d:::test4054.txt:head
     backfill_pos 1:000d9c27:::test1033.txt:head
     peer: 2
         peer last_backfill: 1:00001e5d:::test4054.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000] lb 1:00001e5d:::test4054.txt:head local-lis/les=32/33 n=1 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
step : the end compute new_last_backfill
     new_last_backfill 1:0005c6e0:::test18392.txt:head
     peer: 2
         peer last_backfill: 1:00001e5d:::test4054.txt:head
         new_last_backfill > pinfo.last_backfill : 1
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)

511 个数据,已经遍历空了 : bi 为空之后的循环:

step1: 
     new_backfill: 0
     get_backfill_targets(): 2
     last_backfill_started: 1:067e20fb:::test10941.txt:head
     local pg_info_t: 1.0( v 20"20000 (20"9900,20"20000] local-lis/les=32/33 n=20000 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
     earliest_backfill(): 1:06738052:::test4708.txt:head

step2: before update_range 
     backfill_info: BackfillInfo(1:067e20fb:::test10941.txt:head-1:067f54e0:::test19063.txt:head 0 version: 20"20000 objects)
     pending_backfill_updates: {1:067e20fb:::test10941.txt:head=0"0}
     backfills_in_flight: 
     peer: 2
         peer last_backfill: 1:06738052:::test4708.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000] lb 1:06738052:::test4708.txt:head local-lis/les=32/33 n=510 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
step3: after update_range 
     local backfill_info: BackfillInfo(1:067e20fb:::test10941.txt:head-1:067f54e0:::test19063.txt:head 0 version: 20"20000 objects)

step4: befor while (ops < max)
     local backfill_info: BackfillInfo(1:067f54e0:::test19063.txt:head-1:067f54e0:::test19063.txt:head 0 version: 20"20000 objects)
     earliest_peer_backfill(): MAX
     peer 2
         peer last_backfill: 1:06738052:::test4708.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)

step5: in while befor update_range 1
    local backfill_info: BackfillInfo(1:067f54e0:::test19063.txt:head-1:067f54e0:::test19063.txt:head 0 version: 20"20000 objects)

step5: in while befor update_range 2
     local backfill_info: BackfillInfo(1:067f54e0:::test19063.txt:head-MAX 0 version: 0"0 objects)
step7: in while after update_range 3
     local backfill_info: BackfillInfo(1:067f54e0:::test19063.txt:head-1:0c924fff:::test14189.txt:head 512 version: 20"20000 objects {1:067f54e0:::test19063.txt:head=20"19063,1:0680e042:::test10674.txt:head=20"10674,,,,,,,,,,1:0c8ef72a:::test19518.txt:head=20"19518})

step inwhile:
     process file backfill_info.begin 1:067f54e0:::test19063.txt:head
     check MAX
     need_ver_targs= []
     keep_ver_targs= []
     missing_targs= [2]
     skip_targs= []

step 9: in while after process one file
     lcoal backfill_info: BackfillInfo(1:0680e042:::test10674.txt:head-1:0c924fff:::test14189.txt:head 511 version: 20"20000 objects {1:0680e042:::test10674.txt:head=20"10674,,,,,,,,,,1:0c8ef72a:::test19518.txt:head=20"19518})
     last_backfill_started 1:067f54e0:::test19063.txt:head
     peer 2
         peer last_backfill: 1:06738052:::test4708.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)

step 10:  out while:  before update peer last_backfill:
     backfills_in_flight: 1:067f54e0:::test19063.txt:head
     backfill_info: BackfillInfo(1:0680e042:::test10674.txt:head-1:0c924fff:::test14189.txt:head 511 version: 20"20000 objects {1:0680e042:::test10674.txt:head=20"10674,,,,,,,,,,1:0c8ef72a:::test19518.txt:head=20"19518})
     pending_backfill_updates: {1:067e20fb:::test10941.txt:head=0"0,1:067f54e0:::test19063.txt:head=0"0}
     last_backfill_started: 1:067f54e0:::test19063.txt:head
     add_to_stat: 1:067f54e0:::test19063.txt:head
     next_backfill_to_complete: 1:067f54e0:::test19063.txt:head
     new_last_backfill: 1:06738052:::test4708.txt:head
     backfill_pos 1:0680e042:::test10674.txt:head
     peer: 2
         peer last_backfill: 1:06738052:::test4708.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 20"20000 (20"10000,20"20000] lb 1:06738052:::test4708.txt:head local-lis/les=32/33 n=510 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
step : the end compute new_last_backfill
     new_last_backfill 1:067e20fb:::test10941.txt:head
     peer: 2
         peer last_backfill: 1:06738052:::test4708.txt:head
         new_last_backfill > pinfo.last_backfill : 1
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)

在backfill的过程中,写入新的数据,查看双边osd的行为,如何保障backfill的一致性

每次写入一个文件 projected_last_update 版本就更新为 epoch + version_t, 39"20001,39"20002; update_range 需要和最新更新的文件ptlog 进行状态merge
实际写入aaa.txt.1-aaa.txt.7 7个文件, 只有aaa.txt.3 在osd2中落盘了, 因为osd 2 last_backfill 是: 1:18b5b223:::test10094.txt:head
按照hash计算, 只有aaa.txt.3 的hash reverse小于 osd2的last_backfill,只有它落盘了。

aaa.txt.1 0x6a06655f  0xfaa66056
aaa.txt.2 0xeea16c69  0x96368577   
aaa.txt.3 0xbc5572b0  0x0d4eaa3d   
aaa.txt.4 0x361a48ac  0x3512586c   
aaa.txt.5 0xc1a4b22a  0x544d2583   
aaa.txt.6 0xbdc7c15e  0x7a83e3bd   
aaa.txt.7 0x6e517bcb  0xd3de8a76   

test10094.txt 0xc44dad18  0x18b5b223   

./bin/ceph-kvstore-tool bluestore-kv ./dev/osd2 list O
./bin/ceph-kvstore-tool bluestore-kv ./dev/osd2 dump O
使用工具查看osd2的数据 过滤aaa.txt

O       %7f%80%00%00%00%00%00%00%01%0dN%aa%3d%21aaa.txt.3%21%3d%ff%ff%ff%ff%ff%ff%ff%fe%ff%ff%ff%ff%ff%ff%ff%ffo

当恢复的数据量比较多时,停止backfill,然后再次写入7个bbb.txt.1-7 查看osd2的落盘情况:
new_last_backfill 1:f85b51eb:::test6800.txt:head

bbb.txt.1 0x49065466  0x662a6092   
bbb.txt.2 0x39ae3912  0x489c759c   
bbb.txt.3 0xba7464b2  0x4d262e5d   
bbb.txt.4 0xc99ba421  0x8425d993   
bbb.txt.5 0x5fbf94b7  0xed29fdfa   
bbb.txt.6 0x55dd7d8b  0xd1bebbaa   
bbb.txt.7 0xd828a671  0x8e65141b   

osd.2 再次过滤aaa.txt 除了aaa.txt.1 不存在,其他都已经存在
osd.2 过滤bbb.txt 7个文件都存在, 说明了last_backfill 在backfill中的作用, 有点过于严谨了

step1: 
     new_backfill: 0
     get_backfill_targets(): 2
     last_backfill_started: 1:18b725a9:::test10814.txt:head
     local pg_info_t: 1.0( v 39"20001 (20"10000,39"20001] local-lis/les=32/33 n=20001 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)
     earliest_backfill(): 1:18b5b223:::test10094.txt:head

step2: before update_range 
     backfill_info: BackfillInfo(1:18b725a9:::test10814.txt:head-1:19fba18e:::test12743.txt:head 88 version: 20"20000 objects {1:18b7eb0c:::test15864.txt:head=20"15864,,,,,,,,,,1:19f875c7:::test17364.txt:head=20"17364})
     pending_backfill_updates: {1:18b725a9:::test10814.txt:head=0"0}
     backfills_in_flight: 
     peer: 2
         peer last_backfill: 1:18b5b223:::test10094.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 39"20001 (20"10000,39"20001] lb 1:18b5b223:::test10094.txt:head local-lis/les=32/33 n=1958 ec=3/3 lis/c=32/19 les/c/f=33/20/0 sis=32)

2023-11-23T18:30:37.812+0800 ffff73bde9c0 10 update_range: bi is old, (20"20000) can be updated with log to projected_last_update 39"20001
2023-11-23T18:30:37.812+0800 ffff73bde9c0 10 scanning pg log first
2023-11-23T18:30:37.812+0800 ffff73bde9c0 10 operator(): updating from version 39"20001
2023-11-23T18:30:37.812+0800 ffff73bde9c0 10 scanning projected log
2023-11-23T18:30:37.812+0800 ffff73bde9c0 10 
每次有更新操作, bi的 version 会比 log_tail 大,但是比projected_last_update 要小,需要进行更新
这里的更新只是从pg log中,从后往前查找到 上次bi version之后的操作,然后再把所有的操作应用到 bi的object中

step3: after update_range 
     local backfill_info: BackfillInfo(1:18b725a9:::test10814.txt:head-1:19fba18e:::test12743.txt:head 88 version: 39"20001 objects {1:18b7eb0c:::test15864.txt:head=20"15864,,,,,,,,,,1:19f875c7:::test17364.txt:head=20"17364})

step4: befor while (ops < max)
     local backfill_info: BackfillInfo(1:18b7eb0c:::test15864.txt:head-1:19fba18e:::test12743.txt:head 88 version: 39"20001 objects {1:18b7eb0c:::test15864.txt:head=20"15864,,,,,,,,,,1:19f875c7:::test17364.txt:head=20"17364})
     earliest_peer_backfill(): MAX
     peer 2
         peer last_backfill: 1:18b5b223:::test10094.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)

倒数第二次

step1: 
     new_backfill: 0
     get_backfill_targets(): 2
     last_backfill_started: 1:fffc981f:::test18600.txt:head
     local pg_info_t: 1.0( v 64"20014 (20"10000,64"20014] local-lis/les=69/70 n=20014 ec=3/3 lis/c=69/19 les/c/f=70/20/0 sis=69)
     earliest_backfill(): 1:fffbd9ba:::test8865.txt:head
step2: before update_range 
     backfill_info: BackfillInfo(1:fffc981f:::test18600.txt:head-MAX 0 version: 64"20014 objects)
     pending_backfill_updates: {1:fffc981f:::test18600.txt:head=0"0}
     backfills_in_flight: 
     peer: 2
         peer last_backfill: 1:fffbd9ba:::test8865.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 64"20014 (20"10000,64"20014] lb 1:fffbd9ba:::test8865.txt:head local-lis/les=69/70 n=20013 ec=3/3 lis/c=69/19 les/c/f=70/20/0 sis=69)
2023-11-24T10:34:05.935+0800 ffff7c2e99c0 10 update_range: bi is current 64"20014 project_last_update 64"20014
step3: after update_range 
     local backfill_info: BackfillInfo(1:fffc981f:::test18600.txt:head-MAX 0 version: 64"20014 objects)

step4: befor while (ops < max)
     local backfill_info: BackfillInfo(MAX-MAX 0 version: 64"20014 objects)
     earliest_peer_backfill(): MAX
     peer 2
         peer last_backfill: 1:fffbd9ba:::test8865.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
2023-11-24T10:34:05.935+0800 ffff7c2e99c0 10  reached end for both local and all peers

step 10:  out while:  before update peer last_backfill:
     backfills_in_flight: 
     backfill_info: BackfillInfo(MAX-MAX 0 version: 64"20014 objects)
     pending_backfill_updates: {1:fffc981f:::test18600.txt:head=0"0}
     last_backfill_started: 1:fffc981f:::test18600.txt:head
     add_to_stat: 
     next_backfill_to_complete: MAX
     new_last_backfill: 1:fffbd9ba:::test8865.txt:head
     backfill_pos MAX
     peer: 2
         peer last_backfill: 1:fffbd9ba:::test8865.txt:head
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 64"20014 (20"10000,64"20014] lb 1:fffbd9ba:::test8865.txt:head local-lis/les=69/70 n=20013 ec=3/3 lis/c=69/19 les/c/f=70/20/0 sis=69)

step : the end compute new_last_backfill
     new_last_backfill MAX
     peer: 2
         peer last_backfill: 1:fffbd9ba:::test8865.txt:head
         new_last_backfill > pinfo.last_backfill : 1
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)

2023-11-24T10:34:05.935+0800 ffff7c2e99c0 10 update peer lastbackfill MAX
发送pg_backfill(finish)

最终backfill结束:

5: (PG::queue_recovery()+0x304) [0xaaaaec7934d4]
6: (PG::finish_recovery_op(hobject_t const&, bool)+0x13c) [0xaaaaec794170]
7: (PrimaryLogPG::do_backfill(boost::intrusive_ptr<OpRequest>)+0x3e4) [0xaaaaec842de8]
step1: 
     new_backfill: 0
     get_backfill_targets(): 2
     last_backfill_started: MAX
     local pg_info_t: 1.0( v 64"20014 (20"10000,64"20014] local-lis/les=69/70 n=20014 ec=3/3 lis/c=69/19 les/c/f=70/20/0 sis=69)
     earliest_backfill(): MAX

step2: before update_range 
     backfill_info: BackfillInfo(MAX-MAX 0 version: 64"20014 objects)
     pending_backfill_updates: {}
     backfills_in_flight: 
     peer: 2
         peer last_backfill: MAX
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 64"20014 (20"10000,64"20014] local-lis/les=69/70 n=20014 ec=3/3 lis/c=69/19 les/c/f=70/20/0 sis=69)

step3: after update_range 
     local backfill_info: BackfillInfo(MAX-MAX 0 version: 64"20014 objects)

step4: befor while (ops < max)
     local backfill_info: BackfillInfo(MAX-MAX 0 version: 64"20014 objects)
     earliest_peer_backfill(): MAX
     peer 2
         peer last_backfill: MAX
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
2023-11-24T10:34:06.025+0800 ffff782e19c0 10  reached end for both local and all peers

step 10:  out while:  before update peer last_backfill:
     backfills_in_flight: 
     backfill_info: BackfillInfo(MAX-MAX 0 version: 64"20014 objects)
     pending_backfill_updates: {}
     last_backfill_started: MAX
     add_to_stat: 
     next_backfill_to_complete: MAX
     new_last_backfill: MAX
     backfill_pos MAX
     peer: 2
         peer last_backfill: MAX
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)
         peer pg_info_t: 1.0( v 64"20014 (20"10000,64"20014] local-lis/les=69/70 n=20014 ec=3/3 lis/c=69/19 les/c/f=70/20/0 sis=69)

step : the end compute new_last_backfill
     new_last_backfill MAX
     peer: 2
         peer last_backfill: MAX
         new_last_backfill > pinfo.last_backfill : 0
         pbi: BackfillInfo(MAX-MAX 0 version: 0"0 objects)

相关文章

网友评论

      本文标题:ceph backfill

      本文链接:https://www.haomeiwen.com/subject/vucgwdtx.html