当前位置：首页 > news >正文

消息队列-kafka-消息发送流程(源码跟踪) 与消息可靠性

news 2026/2/9 5:03:15

官方网址

源码：https://kafka.apache.org/downloads
快速开始：https://kafka.apache.org/documentation/#gettingStarted
springcloud整合

发送消息流程

在这里插入图片描述
主线程：主线程只负责组织消息，如果是同步发送会阻塞，如果是异步发送需要传入一个回调函数。
Map集合：存储了主线程的消息。
Sender线程：真正的发送其实是sender去发送到broker中。

源码阅读

1 首先打开Producer.send()可以看到里面的内容

// 返回值是一个 Future 参数为ProducerRecord
Future<RecordMetadata> send(ProducerRecord<K, V> record);

// ProducerRecord定义了这些信息
// 主题
private final String topic;
// 分区
private final Integer partition;
// header
private final Headers headers;
private final K key;
private final V value;
// 时间戳
private final Long timestamp;

2 发送之前的前置处理

public Future<RecordMetadata> send(ProducerRecord<K, V> record, Callback callback) {// intercept the record, which can be potentially modified; this method does not throw exceptions// 这里给开发者提供了前置处理的勾子ProducerRecord<K, V> interceptedRecord = this.interceptors.onSend(record);// 我们最终发送的是经过处理后的消息 并且如果是异步发送会有callback 这个是用户定义的return doSend(interceptedRecord, callback);}

3 进入真正的发送逻辑Future doSend()

由于是网络通信，所以我们要序列化，在这个函数里面就做了序列化的内容。

try {serializedKey = keySerializer.serialize(record.topic(), record.headers(), record.key());} catch (ClassCastException cce) {throw new SerializationException("Can't convert key of class " + record.key().getClass().getName() +" to class " + producerConfig.getClass(ProducerConfig.KEY_SERIALIZER_CLASS_CONFIG).getName() +" specified in key.serializer", cce);}byte[] serializedValue;try {serializedValue = valueSerializer.serialize(record.topic(), record.headers(), record.value());} catch (ClassCastException cce) {throw new SerializationException("Can't convert value of class " + record.value().getClass().getName() +" to class " + producerConfig.getClass(ProducerConfig.VALUE_SERIALIZER_CLASS_CONFIG).getName() +" specified in value.serializer", cce);}

然后我们获取分区

// 然后这里又是一个策略者模式 也是由用户可以配置的  DefaultPartitioner UniformStickyPartitioner RoundRobinPartitioner 提供了这样三个分区器
private int partition(ProducerRecord<K, V> record, byte[] serializedKey, byte[] serializedValue, Cluster cluster) {Integer partition = record.partition();return partition != null ?partition :partitioner.partition(record.topic(), record.key(), serializedKey, record.value(), serializedValue, cluster);
}

4 到了我们的RecordAccumulator，也就是先由主线程发送到了RecordAccumulator

// 也就是对图中的Map集合
RecordAccumulator.RecordAppendResult result = accumulator.append(tp, timestamp, serializedKey,serializedValue, headers, interceptCallback, remainingWaitMs, true, nowMs);

我们发现里面是用一个MAP存储的一个分区和ProducerBatch 是讲这个消息写到内存里面MemoryRecordsBuilder 通过这个进行写入

// 可以看到是一个链表实现的双向队列，也就是消息会按append的顺序写到 内存记录中去
private final ConcurrentMap<TopicPartition, Deque<ProducerBatch>> batches;

5 接着我们看,我们append了以后，会有一个判断去唤醒sender线程，见下面的注释

// 如果说哦我们当前的 这个batch满了或者 我们创建了一个新的batch 这个时候唤醒 sender线程去发送数据
if (result.batchIsFull || result.newBatchCreated) {log.trace("Waking up the sender since topic {} partition {} is either full or getting a new batch", record.topic(), partition);// 唤醒sender 去发送数据this.sender.wakeup();}

// 实现了Runnable 所以我们去看一下RUN方法的逻辑
public class Sender implements Runnable

好上来就是一个循环

while (running) {try {runOnce();} catch (Exception e) {log.error("Uncaught error in kafka producer I/O thread: ", e);}
}

接着进入runOnece方法，直接看核心逻辑

// 从RecordAccumulator 拿数据 然后发送
Map<Integer, List<ProducerBatch>> batches = this.accumulator.drain(cluster, result.readyNodes, this.maxRequestSize, now);addToInflightBatches(batches);
// 中间省去了非核心逻辑
sendProduceRequests(batches, now);

如果继续跟踪的话最终是走到了selector.send()里面：

Send send = request.toSend(destination, header);InFlightRequest inFlightRequest = new InFlightRequest(clientRequest,header,isInternalRequest,request,send,now);this.inFlightRequests.add(inFlightRequest);selector.send(send);

6 接着我们就要看返回逻辑了,可以看到在sendRequest里面sendProduceRequest方法是通过传入了一个回调函数处理返回的。

RequestCompletionHandler callback = new RequestCompletionHandler() {public void onComplete(ClientResponse response) {handleProduceResponse(response, recordsByPartition, time.milliseconds());}};

// 如果有返回
if (response.hasResponse()) {ProduceResponse produceResponse = (ProduceResponse) response.responseBody();for (Map.Entry<TopicPartition, ProduceResponse.PartitionResponse> entry : produceResponse.responses().entrySet()) {TopicPartition tp = entry.getKey();ProduceResponse.PartitionResponse partResp = entry.getValue();ProducerBatch batch = batches.get(tp);completeBatch(batch, partResp, correlationId, now, receivedTimeMs + produceResponse.throttleTimeMs());}this.sensors.recordLatency(response.destination(), response.requestLatencyMs());}

追踪到ProducerBatch

if (this.finalState.compareAndSet(null, tryFinalState)) {completeFutureAndFireCallbacks(baseOffset, logAppendTime, exception);return true;}

private void completeFutureAndFireCallbacks(long baseOffset, long logAppendTime, RuntimeException exception) {// Set the future before invoking the callbacks as we rely on its state for the `onCompletion` callproduceFuture.set(baseOffset, logAppendTime, exception);// execute callbacksfor (Thunk thunk : thunks) {try {if (exception == null) {RecordMetadata metadata = thunk.future.value();if (thunk.callback != null)thunk.callback.onCompletion(metadata, null);} else {if (thunk.callback != null)thunk.callback.onCompletion(null, exception);}} catch (Exception e) {log.error("Error executing user-provided callback on message for topic-partition '{}'", topicPartition, e);}}produceFuture.done();}

Thunk 这个其实就是我们在Append的时候的回调：
在这里插入图片描述
至此整个流程就完成了，从发送消息，到响应后回调我们的函数。

消息可靠性

// 所有消费者的配置都在ProducerConfig 里面
public static final String ACKS_CONFIG = "acks";

acks = 0：异步形式，单向发送，不会等待 broker 的响应
acks = 1：主分区保存成功，然后就响应了客户端，并不保证所有的副本分区保存成功
acks = all 或 -1：等待 broker 的响应，然后 broker 等待副本分区的响应，总之数据落地到所有的分区后，才能给到producer 一个响应

在可靠性的保证下，假设一些故障：

Broker 收到消息后，同步 ISR 异常：只要在 -1 的情况下，其实不会造成消息的丢失，因为有重试机制
Broker 收到消息，并同步 ISR 成功，但是响应超时：只要在 -1 的情况下，其实不会造成消息的丢失，因为有重试机制

可靠性能保证哪些，不能保障哪些？

保证了消息不会丢失
不保证消息一定不会重复（消息有重复的概率，需要消费者有幂等性控制机制）

消息队列-kafka-消息发送流程(源码跟踪) 与消息可靠性

官方网址

发送消息流程

源码阅读

消息可靠性

相关文章：

消息队列-kafka-消息发送流程(源码跟踪) 与消息可靠性

机器学习笔记计算机视觉中的测距任务常见技术路线

云计算 3月8号（wordpress的搭建）

【CSS】(浮动定位)易忘知识点汇总

Vitual Box虚拟机打开后，键盘鼠标失效

宠物空气净化器值得入手吗？选购宠物空气净化器关注哪些方面？

前端发起请求，后端模型需处理很久，怎样设置前端直接完成请求响应，后端计算完在返回结果给前端?

DDD领域驱动设计

网络编程第1天

Springboot--整合Logback 日志框架（Maven）

【考研数学】李林《880》vs 李永乐《660》完美使用搭配

Java面试之消息中间件

网工学习 DHCP配置-接口模式

【GO】语言特点 | Go和Java的对比

USB2.0设备检测过程信号分析

Go语言物联网开发安科瑞ADW300/4G电能表数据上传mqtt平台-电表接线到传输数据完整流程

LabVIEW质谱仪开发与升级

SwiftUI之DragGesture

主网NFT的发布合约

分享2024年在家轻松兼职赚钱的5个副业

日语AI面试高效通关秘籍：专业解读与青柚面试智能助攻

调用支付宝接口响应40004 SYSTEM_ERROR问题排查

Unity3D中Gfx.WaitForPresent优化方案

鸿蒙中用HarmonyOS SDK应用服务 HarmonyOS5开发一个医院挂号小程序

STM32标准库-DMA直接存储器存取

数据链路层的主要功能是什么

.Net Framework 4/C# 关键字（非常用，持续更新...）

关键领域软件测试的突围之路：如何破解安全与效率的平衡难题

Redis的发布订阅模式与专业的 MQ（如 Kafka, RabbitMQ）相比，优缺点是什么？适用于哪些场景？

JAVA后端开发——多租户