背景: 业务预估,某个业务所用的几个topic数据量 以及io会非常大,需要增加更多的broker来支撑。这时就有两种方案,
迁移操作步骤如下: 1、首先查看kafka集群的broker信息确保迁移的broker已经加入集群 本实验环境有七个broker,分别是0 、1 、2 、3 、4 、5 、6 ,其中1 、2 、3 三个节点有zooKeeper。 /data/zookeeper/bin/zkCli.sh [zk: localhost:2181 (CONNECTED) 0 ] ls /brokers/ids [0 , 1 , 2 , 3 , 4 , 5 , 6 ]
2、查看topic分区的ISR,确保topic的Leader、Replicas、ISR参数正常,确保ISR同步队列正常。 root@kafka-2:~# /data/kafka/bin/kafka-topics.sh --describe --topic my-topic --bootstrap-server 192.168 .1 .231 :9092 Topic: my-topic TopicId: 1FtXs5m8TMyjC18Qjs3ttg PartitionCount: 3 ReplicationFactor: 3 Configs: segment.bytes=1073741824 Topic: my-topic Partition: 0 Leader: 0 Replicas: 0 ,1,2 Isr: 0 ,1,2 Topic: my-topic Partition: 1 Leader: 1 Replicas: 1 ,2,0 Isr: 0 ,1,2 Topic: my-topic Partition: 2 Leader: 1 Replicas: 2 ,1,0 Isr: 1 ,2,0
3、如果数据量特别大,可以提前设置好topic数据保存时间,可以加快迁移速度。 /data/kafka/bin/kafka-configs.sh --bootstrap-server --alter --topic my-topic --entity-default-config retention.ms=3600000
4、创建需要迁移的topic的move文件 root@kafka -2 :~ { "topics" : [ { "topic" : "my-topic" } ] }
5、通过命令生成迁移文件,包含需要迁移到哪几个broker。 # 前面这个命令会生成两个部分,分别问现状和迁移计划。 grep Proposed -A1 | grep -v Proposed 过滤掉Current部分,只保留迁移计划。 /data/kafka/bin/kafka-reassign-partitions.sh --bootstrap-server 192.168 .1.231 :9092 --topics-to -move -json-file topics-to -move .json --broker-list "4,5,6" --generate | grep Proposed -A1 | grep -v Proposed > move .json
6、执行分区重新分配 root@kafka-2 :~# /data/kafka/bin/kafka-reassign -partitions.sh Current partition replica assignment {"version":1 ,"partitions":[{"topic":"my-topic","partition":0 ,"replicas":[0 ,1 ,2 ],"log_dirs":["any","any","any"]},{"topic":"my-topic","partition":1 ,"replicas":[1 ,2 ,0 ],"log_dirs":["any","any","any"]},{"topic":"my-topic","partition":2 ,"replicas":[2 ,1 ,0 ],"log_dirs":["any","any","any"]}]} Save this to use as the Successfully started partition reassignments for my-topic-0 ,my-topic-1 ,my-topic-2
7、验证重新分配状态 root@ kafka-2 :~# /data/kafka/bin/kafka-reassign-partitions.sh --bootstrap-server 192.168 .1 .231 :9092 --reassignment-json-file mytopic-reassignment-plan.json --verify Status of partition reassignment: There is no active reassignment of partition my-topic-0 , but replica set is 5 ,4 ,6 rather than 0 ,1 ,2. There is no active reassignment of partition my-topic-1 , but replica set is 6 ,5 ,4 rather than 1 ,2 ,0. There is no active reassignment of partition my-topic-2 , but replica set is 4 ,6 ,5 rather than 2 ,1 ,0. Clearing broker-level throttles on brokers 0 ,5 ,1 ,6 ,2 ,3 ,4 Clearing topic-level throttles on topic my-topic
8、查看 topic 详情 root@kafka-2:~# /data/kafka/bin/kafka-topics.sh --describe --topic my-topic --bootstrap-server 192.168 .1 .231 :9092 Topic: my-topic TopicId: 1FtXs5m8TMyjC18Qjs3ttg PartitionCount: 3 ReplicationFactor: 3 Configs: segment.bytes=1073741824 Topic: my-topic Partition: 0 Leader: 5 Replicas: 5 ,4,6 Isr: 5 ,6,4 Topic: my-topic Partition: 1 Leader: 6 Replicas: 6 ,5,4 Isr: 5 ,6,4 Topic: my-topic Partition: 2 Leader: 4 Replicas: 4 ,6,5 Isr: 5 ,6,4
9、操作恢复topic数据保存时间,(如果前面操作了的话) root@ kafka-2 :~# /data/kafka/bin/kafka-configs.sh --bootstrap-server 192.168 .1 .231 :9092 --alter --topic my-topic --entity-default -config retention.ms=360000000 entity-default -config is not a recognized option