当前位置：首页 > news >正文

MongoDB中的嵌套List操作

news 2026/5/21 12:00:09

前言

MongoDB区别Mysql的地方，就是MongoDB支持文档嵌套，比如最近业务中就有一个在音频转写结果中进行对话场景，一个音频中对应多轮对话，这些音频数据和对话信息就存储在MongoDB中文档中。集合结构大致如下

{"_id":23424234234324234,"audioId": 2689944,"contextId": "cht000d24ab@dx187d1168a449a4b540","dialogues": [{"ask": "今天是礼拜天？","answer": "是的","createTime": 1697356990966}, {"ask": "你也要加油哈","answer": "奥利给！","createTime": 1697378011483}, {"ask": "下周见","answer": "拜拜！","createTime": 1697378072063}]
}

下面简单介绍几个业务中用到的简单操作。

查询嵌套List的长度大小

    public Integer getDialoguesSize(Long audioId) {Integer datasSize = 0;List<Document> group = Arrays.asList(new Document("$match",new Document("audioId",new Document("$eq", audioId))), new Document("$match",new Document("dialogues",new Document("$exists", true))), new Document("$project",new Document("datasSize",new Document("$size", "$dialogues"))));AggregateIterable<Document> aggregate = generalCollection.aggregate(group);Document document = aggregate.first();if (document != null) {datasSize = (Integer) document.get("datasSize");}return datasSize;}

根据嵌套List中属性查询

下面的代码主要查询指定audioId中的dialogues集合中小于createTime,并且根据limit分页查询，这里用到了MongoDB中的Aggregates和unwind来进行聚合查询,具体使用细节，可以参见MongoDB官方文档

    public AIDialoguesResultDTO queryAiResult(Long audioId, Long createTime, Integer limit) {AIDialoguesResultDTO aiDialoguesResultDTO = new AIDialoguesResultDTO();List<Bson> pipeline = Arrays.asList(Aggregates.match(Filters.eq("audioId", audioId)),Aggregates.unwind("$dialogues"),Aggregates.match(Filters.lt("dialogues.createTime", createTime)),Aggregates.sort(Sorts.descending("dialogues.createTime")),Aggregates.limit(limit));AggregateIterable<Document> aggregate = generalCollection.aggregate(pipeline);List<AIDialoguesResult> aiDialoguesResultList = new ArrayList<>();String contextId = Constant.EMPTY_STR;for (Document document : aggregate) {AIDialoguesResult aiDialoguesResult = new AIDialoguesResult();List<String> key = Collections.singletonList("dialogues");aiDialoguesResult.setAnswer(document.getEmbedded(key, Document.class).getString("answer"));aiDialoguesResult.setAsk(document.getEmbedded(key, Document.class).getString("ask"));aiDialoguesResult.setCreateTime(document.getEmbedded(key, Document.class).getLong("createTime"));aiDialoguesResultList.add(aiDialoguesResult);contextId = document.getString("contextId");}if (!CollectionUtils.isEmpty(aiDialoguesResultList)) {aiDialoguesResultList = aiDialoguesResultList.stream().sorted(Comparator.comparingLong(AIDialoguesResult::getCreateTime)).collect(Collectors.toList());}aiDialoguesResultDTO.setCount(aiDialoguesResultList.size());aiDialoguesResultDTO.setContextId(contextId);aiDialoguesResultDTO.setResult(aiDialoguesResultList);return aiDialoguesResultDTO;}

当然，我们还有一种比较简单的写法

    public AIDialoguesResultDTO queryAiResultBackupVersion(Long audioId, Long createTime, Integer limit) {Bson query = and(eq("audioId", audioId));AITextResult aiTextResult = mongoDao.findSingle(query, AITextResult.class);AIDialoguesResultDTO aiDialoguesResultDTO = new AIDialoguesResultDTO();if (Objects.isNull(aiTextResult)) {aiDialoguesResultDTO.setResult(Collections.emptyList());aiDialoguesResultDTO.setCount(0);aiDialoguesResultDTO.setContextId("");}List<AIDialoguesResult> aiDialoguesResultList = aiTextResult.getDialogues();if (CollectionUtils.isEmpty(aiDialoguesResultList)) {return aiDialoguesResultDTO;}Long finalCreateTime = createTime;List<AIDialoguesResult> afterFilterAiDialoguesResultList =aiDialoguesResultList.stream().filter(t -> t.getCreateTime()< finalCreateTime).sorted(Comparator.comparingLong(AIDialoguesResult::getCreateTime).reversed()).limit(limit).collect(Collectors.toList());if (CollectionUtils.isEmpty(afterFilterAiDialoguesResultList)) {aiDialoguesResultDTO.setCount(0);} else {aiDialoguesResultDTO.setCount(afterFilterAiDialoguesResultList.size());}afterFilterAiDialoguesResultList = afterFilterAiDialoguesResultList.stream().sorted(Comparator.comparingLong(AIDialoguesResult::getCreateTime)).collect(Collectors.toList());aiDialoguesResultDTO.setResult(afterFilterAiDialoguesResultList);aiDialoguesResultDTO.setContextId(aiTextResult.getContextId());return aiDialoguesResultDTO;}

上面这种写法比较直接，就是直接audioId进行匹配查询，然后将当前文档中的dialogues全部加载到内存中，然后在内存中进行排序，分页返回，显然如果dialogues集合长度很大，对内存占用会比较高。

嵌套List的增量追加

对于dialogues数组，如果我们要向dialogues追加元素，我们可以把audioId对应的dialogues全部取出来，然后在List后面追加一个元素，大致代码如下

    public void saveAiResult(SaveAIResultDTO saveAIResultDTO) {Long audioId = saveAIResultDTO.getAudioId();Bson filter = Filters.eq("audioId", audioId);AITextResult aiTextResult = mongoDao.findSingle(filter, AITextResult.class);if (Objects.isNull(aiTextResult)) {aiTextResult = AITextResult.buildAiTextResult(saveAIResultDTO);mongoDao.saveOrUpdate(aiTextResult);return;}List<AIDialoguesResult> aiDialoguesResults = aiTextResult.getDialogues();AIDialoguesResult aiDialoguesResult = new AIDialoguesResult();aiDialoguesResult.setCreateTime(new Date().getTime());aiDialoguesResult.setAsk(saveAIResultDTO.getAsk());aiDialoguesResult.setAnswer(saveAIResultDTO.getAnswer());aiDialoguesResults.add(aiDialoguesResult);aiTextResult.setDialogues(aiDialoguesResults);mongoDao.saveOrUpdate(aiTextResult);}

上面这种写法本身没有什么问题，但是如果dialogues集合大小比较大，每次追加都将dialogues全部取出来进行追加操作，可能比较占用内存，我们可以利用MongoDB中的push操作，直接追加

    public void saveAiResultIncremental(SaveAIResultDTO saveAIResultDTO) {Long audioId = saveAIResultDTO.getAudioId();Document query = new Document("audioId", audioId);Bson projection = Projections.fields(Projections.include("contextId"), Projections.excludeId());FindIterable<Document> result = generalCollection.find(query).projection(projection);AITextResult aiTextResult;if (!result.iterator().hasNext()) {aiTextResult = AITextResult.buildAiTextResult(saveAIResultDTO);mongoDao.saveOrUpdate(aiTextResult);return;}AIDialoguesResult aiDialoguesResult = new AIDialoguesResult();aiDialoguesResult.setCreateTime(new Date().getTime());aiDialoguesResult.setAsk(saveAIResultDTO.getAsk());aiDialoguesResult.setAnswer(saveAIResultDTO.getAnswer());Bson update = push("dialogues", aiDialoguesResult);Bson filter = Filters.eq("audioId", audioId);generalCollection.updateOne(filter, update);}

总结

既然选择了MongoDB,就不能继续沿用Mysql的查询风格，要学会利用MongoDB的特性，否则往往达不到预期效果。

MongoDB中的嵌套List操作

前言

查询嵌套List的长度大小

根据嵌套List中属性查询

嵌套List的增量追加

总结

相关文章：

MongoDB中的嵌套List操作

【C#】什么是并发，C#常规解决高并发的基本方法

MySQL双主一从高可用

#力扣：2894. 分类求和并作差@FDDLC

【网络协议】聊聊从物理层到MAC层 ARP 交换机

WordPress插件 WP-PostViews 汉化语言包

基础课2——自然语言处理

有趣的GPT指令

小样本学习--（1）概论

数据结构之手撕顺序表（讲解➕源代码）

小微企业是怎样从客户管理系统中获益的？

mysql整库备份表结构和数据

LinkedHashMap与LRU缓存

2023大联盟6比赛总结

05_51单片机led流水线的实现

Java系列 | 如何讲自己的JAR包上传至阿里云maven私有仓库【云效制品仓库】

小程序技术加速信创操作系统国产化替换

免费：实时 AI 编程助手 Amazon CodeWhisperer

面试准备-深入理解计算机系统-信息的表示与处理1

搭建Atlas2.2.0 集成CDH6.3.2 生产环境+kerberos

告别无效熬夜！10 款 AI 毕业论文工具实测，解锁高效通关路径

深入RKMedia：拆解Rockchip RV1126多媒体框架，看它如何封装RGA/MPP/RKNN

从0到1：产品经理如何构建高效的产品管理体系

SeekStorm PDF文档搜索指南：从文件解析到全文索引的完整流程

Fansly下载器完整指南：3分钟掌握免费离线下载技巧

ReTerraForged终极指南：5步打造专业级Minecraft地形生成体验

汽车供应链客户定位方法拆解：复杂B2B能力如何被客户看懂

魔百盒CM311-1s刷机后体验：安卓9.0固件到底香不香？附5621DS无线实测

别再死记硬背了！COBOL中COMP、COMP-3、COMP-5数据类型的区别与实战赋值避坑指南

基于STM32的智能空调控制器设计：从环境感知到PID控制