当前位置：首页 > news >正文

[AI Mem0] 源码解读，带你了解 Mem0 的实现

news 2025/7/7 23:26:37

Mem0 的 CRUD 到底是如何实现的？我们来看下源码。

使用

先来看下，如何使用 Mem0

import os
os.environ["OPENAI_API_KEY"] = "sk-xxx"from mem0 import Memorym = Memory()# 1. Add: Store a memory from any unstructured text
result = m.add("I am working on improving my tennis skills. Suggest some online courses.", user_id="alice", metadata={"category": "hobbies"})# Created memory --> 'Improving her tennis skills.' and 'Looking for online suggestions.'# 2. Update: update the memory
result = m.update(memory_id=<memory_id_1>, data="Likes to play tennis on weekends")# Updated memory --> 'Likes to play tennis on weekends.' and 'Looking for online suggestions.'# 3. Search: search related memories
related_memories = m.search(query="What are Alice's hobbies?", user_id="alice")# Retrieved memory --> 'Likes to play tennis on weekends'# 4. Get all memories
all_memories = m.get_all()
memory_id = all_memories[0]["id"] # get a memory_id# All memory items --> 'Likes to play tennis on weekends.' and 'Looking for online suggestions.'# 5. Get memory history for a particular memory_id
history = m.history(memory_id=<memory_id_1>)# Logs corresponding to memory_id_1 --> {'prev_value': 'Working on improving tennis skills and interested in online courses for tennis.', 'new_value': 'Likes to play tennis on weekends' }

MemoryBase

MemoryBase 是一个抽象类，定义了一些接口方法

get
get_all
update
delete
history

class MemoryBase(ABC):@abstractmethoddef get(self, memory_id):"""Retrieve a memory by ID.Args:memory_id (str): ID of the memory to retrieve.Returns:dict: Retrieved memory."""pass@abstractmethoddef get_all(self):"""List all memories.Returns:list: List of all memories."""pass@abstractmethoddef update(self, memory_id, data):"""Update a memory by ID.Args:memory_id (str): ID of the memory to update.data (dict): Data to update the memory with.Returns:dict: Updated memory."""pass@abstractmethoddef delete(self, memory_id):"""Delete a memory by ID.Args:memory_id (str): ID of the memory to delete."""pass@abstractmethoddef history(self, memory_id):"""Get the history of changes for a memory by ID.Args:memory_id (str): ID of the memory to get history for.Returns:list: List of changes for the memory."""pass

Memory

Memory 实现 MemoryBase 接口

class Memory(MemoryBase):

init

    def __init__(self, config: MemoryConfig = MemoryConfig()):self.config = configself.embedding_model = EmbedderFactory.create(self.config.embedder.provider)# Initialize the appropriate vector store based on the configurationvector_store_config = self.config.vector_store.configif self.config.vector_store.provider == "qdrant":self.vector_store = Qdrant(host=vector_store_config.host,port=vector_store_config.port,path=vector_store_config.path,url=vector_store_config.url,api_key=vector_store_config.api_key,)else:raise ValueError(f"Unsupported vector store type: {self.config.vector_store_type}")self.llm = LlmFactory.create(self.config.llm.provider, self.config.llm.config)self.db = SQLiteManager(self.config.history_db_path)self.collection_name = self.config.collection_nameself.vector_store.create_col(name=self.collection_name, vector_size=self.embedding_model.dims)self.vector_store.create_col(name=self.collection_name, vector_size=self.embedding_model.dims)capture_event("mem0.init", self)

初始化 embedding_model, vector_store(这里只能是 Qdrant), llm, db, collection_name

add

    def add(self,data,user_id=None,agent_id=None,run_id=None,metadata=None,filters=None,prompt=None,):"""Create a new memory.Args:data (str): Data to store in the memory.user_id (str, optional): ID of the user creating the memory. Defaults to None.agent_id (str, optional): ID of the agent creating the memory. Defaults to None.run_id (str, optional): ID of the run creating the memory. Defaults to None.metadata (dict, optional): Metadata to store with the memory. Defaults to None.filters (dict, optional): Filters to apply to the search. Defaults to None.Returns:str: ID of the created memory."""

将用户 data 发给 llm ，得到 extracted_memories
将用户 data 转成 embeddings
vector_store 根据 embeddings search 得到 existing_memories
将新，老 memory 发给 llm 来 merge
调用函数 _create_memory_tool 进行实际操作
- vector_store insert
- db add_history

get

    def get(self, memory_id):"""Retrieve a memory by ID.Args:memory_id (str): ID of the memory to retrieve.Returns:dict: Retrieved memory."""

vector_store 根据 memory_id 去 get

get_all

    def get_all(self, user_id=None, agent_id=None, run_id=None, limit=100):"""List all memories.Returns:list: List of all memories."""

vector_store 根据 collection_name, filters, limit 调用 list 接口

search

    def search(self, query, user_id=None, agent_id=None, run_id=None, limit=100, filters=None):"""Search for memories.Args:query (str): Query to search for.user_id (str, optional): ID of the user to search for. Defaults to None.agent_id (str, optional): ID of the agent to search for. Defaults to None.run_id (str, optional): ID of the run to search for. Defaults to None.limit (int, optional): Limit the number of results. Defaults to 100.filters (dict, optional): Filters to apply to the search. Defaults to None.Returns:list: List of search results."""

embedding_model 将 query 转 embeddings
vector_store 根据 embeddings search

update

    def update(self, memory_id, data):"""Update a memory by ID.Args:memory_id (str): ID of the memory to update.data (dict): Data to update the memory with.Returns:dict: Updated memory."""

调用 _update_memory_tool
- existing_memory = self.vector_store.get
- embeddings = self.embedding_model.embed(data)
- self.vector_store.update
- self.db.add_history

delete

    def delete(self, memory_id):"""Delete a memory by ID.Args:memory_id (str): ID of the memory to delete."""

调用 _delete_memory_tool
- existing_memory = self.vector_store.get
- self.vector_store.delete
- self.db.add_history

delete_all

    def delete_all(self, user_id=None, agent_id=None, run_id=None):"""Delete all memories.Args:user_id (str, optional): ID of the user to delete memories for. Defaults to None.agent_id (str, optional): ID of the agent to delete memories for. Defaults to None.run_id (str, optional): ID of the run to delete memories for. Defaults to None."""

memories = self.vector_store.list
foreach memories
- _delete_memory_tool

history

    def history(self, memory_id):"""Get the history of changes for a memory by ID.Args:memory_id (str): ID of the memory to get history for.Returns:list: List of changes for the memory."""

self.db.get_history

reset

    def reset(self):"""Reset the memory store."""

self.vector_store.delete_col
self.db.reset()

AnonymousTelemetry

capture_event 收集信息
telemetry 用的是 Posthog(https://us.i.posthog.com)

SQLiteManager

db 用的是 sqlite3
一个记录历史的表

CREATE TABLE IF NOT EXISTS history (id TEXT PRIMARY KEY,memory_id TEXT,prev_value TEXT,new_value TEXT,event TEXT,timestamp DATETIME,is_deleted INTEGER
)

MemoryClient

class MemoryClient:"""Client for interacting with the Mem0 API.This class provides methods to create, retrieve, search, and delete memoriesusing the Mem0 API.Attributes:api_key (str): The API key for authenticating with the Mem0 API.host (str): The base URL for the Mem0 API.client (httpx.Client): The HTTP client used for making API requests."""

主要用于跟平台(https://api.mem0.ai/v1)交互
接口
- add
- get
- get_all
- search
- delete
- delete_all
- history
- reset

Embedding

class EmbeddingBase(ABC):@abstractmethoddef embed(self, text):"""Get the embedding for the given text.Args:text (str): The text to embed.Returns:list: The embedding vector."""pass

HuggingFaceEmbedding(model_name=“multi-qa-MiniLM-L6-cos-v1”)
Ollama(model=“nomic-embed-text”)
OpenAI(model=“text-embedding-3-small”)

LLM

class LLMBase(ABC):def __init__(self, config: Optional[BaseLlmConfig] = None):"""Initialize a base LLM class:param config: LLM configuration option class, defaults to None:type config: Optional[BaseLlmConfig], optional"""if config is None:self.config = BaseLlmConfig()else:self.config = config@abstractmethoddef generate_response(self, messages):"""Generate a response based on the given messages.Args:messages (list): List of message dicts containing 'role' and 'content'.Returns:str: The generated response."""pass

AWSBedrockLLM(anthropic.claude-3-5-sonnet-20240620-v1:0)
GroqLLM(llama3-70b-8192)
LiteLLM(gpt-4o)
OllamaLLM(llama3)
OpenAILLM(gpt-4o)
TogetherLLM(mistralai/Mixtral-8x7B-Instruct-v0.1)

VectorStore

class VectorStoreBase(ABC):@abstractmethoddef create_col(self, name, vector_size, distance):"""Create a new collection."""pass@abstractmethoddef insert(self, name, vectors, payloads=None, ids=None):"""Insert vectors into a collection."""pass@abstractmethoddef search(self, name, query, limit=5, filters=None):"""Search for similar vectors."""pass@abstractmethoddef delete(self, name, vector_id):"""Delete a vector by ID."""pass@abstractmethoddef update(self, name, vector_id, vector=None, payload=None):"""Update a vector and its payload."""pass@abstractmethoddef get(self, name, vector_id):"""Retrieve a vector by ID."""pass@abstractmethoddef list_cols(self):"""List all collections."""pass@abstractmethoddef delete_col(self, name):"""Delete a collection."""pass@abstractmethoddef col_info(self, name):"""Get information about a collection."""pass

只有 Qdrant 一个实现

总结

核心就是 Memory 类，实现了 MemoryBase 接口
通过 embedding_model 来处理文本
通过 vector_store 存储 embedding
通过 llm 处理数据
通过 db 记录 Memory 的历史

github
doc
AI 博客 - 从零开始学AI
公众号 - 从零开始学AI

[AI Mem0] 源码解读，带你了解 Mem0 的实现

Mem0 的 CRUD 到底是如何实现的？我们来看下源码。使用先来看下，如何使用 Mem0 import os os.environ["OPENAI_API_KEY"] "sk-xxx"from mem0 import Memorym Memory()# 1. Add: Store a memory from any unstructured text re…...

编程日记 2024/7/31 8:49:43

【LLM】-10-部署llama-3-chinese-8b-instruct-v3 大模型

目录 1、模型下载 2、下载项目代码 3、启动模型 4、模型调用 4.1、completion接口 4.2、聊天（chat completion） 4.3、多轮对话 4.4、文本嵌入向量 5、Java代码实现调用由于在【LLM】-09-搭建问答系统-对输入Prompt检查-CSDN博客关于提示词注入…...

编程日记 2024/7/31 8:46:40

C语言之理解指针（4）

文章目录 1. 字符指针变量2. 数组指针变量2.1 对数组指针变量的理解2.2 数组指针变量的初始化 3. 二维数组传参的本质4. 函数指针变量4.1 函数指针变量的创建4.2 函数指针变量的使用 5. 函数指针数组 1. 字符指针变量我们在前面使用的主要是整形指针变量，现在要学…...

编程日记 2024/7/31 8:40:33

Java设计模式—单例模式（Singleton Pattern）

目录一、定义二、应用场景三、具体实现示例一示例二四、懒汉与饿汉饿汉模式懒汉模式五、总结六、说明一、定义二、应用场景 ‌单例模式的应用场景主要包括以下几个方面： ‌日志系统：在应用程序中，通常只需要一个日…...

编程日记 2024/7/31 8:38:32

AV1帧间预测（二）：运动补偿

运动补偿(Motion Compensation,MC)是帧间预测最基础的工具，AV1支持两种运动补偿方式，一种是传统的平移运动补偿，另一种是仿射运动补偿。下面分别介绍这两种运动补偿方法。平移运动补偿平移运动补偿是最传统的运动补偿方式，H.26…...

编程日记 2024/7/31 8:37:31

一、二分类 import numpy as np import matplotlib.pyplot as plt from sklearn import datasets from sklearn.model_selection import train_test_split from sklearn.preprocessing import StandardScaler from sklearn.linear_model import LogisticRegression from sklea…...

编程日记 2024/7/31 8:34:28

【C++高阶】：深入探索C++11

✨ 心似白云常自在，意如流水任东西 🌏 📃个人主页：island1314 🔥个人专栏：C学习 🚀 欢迎关注：👍点赞 &#x1f4…...

编程日记 2024/7/31 8:32:25

6. 自定义Docker镜像

如何自定义Docker镜像：从基础到实践 Docker作为一个容器化平台，使得应用的打包、分发和运行变得更加高效和便捷。本文将详细介绍如何自定义一个Docker镜像，包括镜像的构成、分层原理、创建自定义镜像的具体步骤，并演示如何打包和…...

编程日记 2024/7/31 8:30:22

「12月·长沙」人工智能与网络安全国际学术会议（ISAICS 2024）

人工智能与网络安全国际学术会议(ISAICS 2024)将于2024年12月20日-2024年12月22日在湖南长沙召开。会议中发表的文章将会被收录,并于见刊后提交EI核心索引。会议旨在在为国内与国际学者搭建交流平台,推进不同学科领域的融合发展，就当今人工智能与网络安全范畴内各学…...

编程日记 2024/7/31 8:29:21

【技术支持案例】使用S32K144+NSD8381驱动电子膨胀阀

文章目录 1. 前言2. 问题描述3. 理论分析3.1 NSD8381如何连接电机3.2 S32K144和NSD8381的软件配置 4.测试验证4.1 测试环境4.2 测试效果4.3 测试记录 1. 前言最近有客户在使用S32K144NSD8381驱动电子膨胀阀时，遇到无法正常驱动电子膨胀阀的情况。因为笔者也是刚开…...

编程日记 2024/7/31 8:28:19

第二期：集成电路（IC）——智能世界的微观建筑大师

嘿，小伙伴们！👋 我是你们的老朋友小竹笋，一名热爱创作和技术的工程师。上一期我们聊了聊AI芯片，这次我们要深入到更微观的层面，来探究集成电路（IC）的世界。准备好一起探索了吗&#…...

编程日记 2024/7/31 8:21:12

基于物联网的区块链算力网络，ＩＧＰ／ＢＧＰ协议

目录基于物联网的区块链算力网络ＩＧＰ／ＢＧＰ协议 IGP（内部网关协议） BGP（边界网关协议）内部使用ISP的外部使用BGP的原因一、网络规模和复杂性二、路由协议的特性三、满足业务需求四、结论基于物联网的区块链算力网络通过多个物联网传感器将本地计算…...

编程日记 2024/7/31 8:18:08

每日一题~960 div2 A+B+C(简单奇偶博弈，构造，观察性质算贡献)

A题意： N 长的数组。一次操作： 最开始的mx 为零。选出一个数（使得这个数>mx) ,之后将mx 更新为这个数，将这个数置为零。不能做这个操作的，输。问是否有先手赢的策略。有的话，输出yes 否则no 当时一…...

编程日记 2024/7/31 8:15:02

音视频入门基础：H.264专题（17）——FFmpeg源码获取H.264裸流文件信息（视频压缩编码格式、色彩格式、视频分辨率、帧率）的总流程

音视频入门基础：H.264专题系列文章： 音视频入门基础：H.264专题（1）——H.264官方文档下载音视频入门基础：H.264专题（2）——使用FFmpeg命令生成H.264裸流文件音视频入门基础&…...

编程日记 2024/7/31 8:14:01

Aboboo一些操作

常用快捷键⌨ 快捷键/操作方式功能鼠标中键/Esc 进入/退出全屏空格/Tab 暂停/恢复播放左/右箭头快退/快进 Ctrl-左/右箭头 30秒快退/快进 Alt-左/右箭头 60秒快退/快进 Ctrl-Alt-左/右箭头播放速率调节 PageUp/PageDown 上一句/下一句上下箭头/滚轮 …...

编程日记 2024/7/31 8:10:57

获取行号LineNumberReader

(每日持续更新）jdk api之LineNumberReader基础、应用、实战-CSDN博客...

编程日记 2024/7/31 8:09:55

python数据结构与算法

0.时间复杂度和空间复杂度快速判断算法时间复杂度：算法运行时间 1.确定问题规模n 2.循环减半 logn 3.k层关于n的循环 n^k 空间复杂度：评估算法内存占用大小使用几个变量 O（1） 使用长度为n的一维列表 O（n&#xff09…...

编程日记 2024/7/31 8:08:54

大数据学习之Flink基础（补充）

Flink基础 1、系统时间与事件时间系统时间（处理时间） 在Sparksreaming的任务计算时，使用的是系统时间。假设所用窗口为滚动窗口，大小为5分钟。那么每五分钟，都会对接收的数据进行提交任务. 但是，这里有…...

编程日记 2024/7/31 8:06:52

C++基础语法:友元

前言 "打牢基础,万事不愁" .C的基础语法的学习."学以致用,边学边用",编程是实践性很强的技术,在运用中理解,总结. 以<C Prime Plus> 6th Edition(以下称"本书")的内容开展学习引入友元提供了一种特别的方式,访问对象私有数据. 友元有三…...

编程日记 2024/7/31 8:05:51

【大模型系列】Video-LaVIT(2024.06)

Paper：https://arxiv.org/abs/2402.03161Github：https://video-lavit.github.io/Title：Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional TokenizationAuthor：Yang Jin， 北大&#x…...

编程日记 2024/7/31 8:00:45

wordpress后台更新后前端没变化的解决方法

使用siteground主机的wordpress网站，会出现更新了网站内容和修改了php模板文件、js文件、css文件、图片文件后，网站没有变化的情况。不熟悉siteground主机的新手，遇到这个问题，就很抓狂，明明是哪都没操作错误&#x…...

编程新知 2025/7/7 5:57:21

深入剖析AI大模型：大模型时代的 Prompt 工程全解析

今天聊的内容，我认为是AI开发里面非常重要的内容。它在AI开发里无处不在，当你对 AI 助手说 "用李白的风格写一首关于人工智能的诗"，或者让翻译模型 "将这段合同翻译成商务日语" 时，输入的这句话就是 Prompt。…...

编程新知 2025/7/1 6:09:18

相机Camera日志实例分析之二：相机Camx【专业模式开启直方图拍照】单帧流程日志详解

【关注我，后续持续新增专题博文，谢谢！！！】上一篇我们讲了： 这一篇我们开始讲： 目录一、场景操作步骤二、日志基础关键字分级如下三、场景日志如下： 一、场景操作步骤操作步…...

编程新知 2025/7/6 8:45:08

Java如何权衡是使用无序的数组还是有序的数组

在 Java 中，选择有序数组还是无序数组取决于具体场景的性能需求与操作特点。以下是关键权衡因素及决策指南： ⚖️ 核心权衡维度维度有序数组无序数组查询性能二分查找 O(log n) ✅线性扫描 O(n) ❌插入/删除需移位维护顺序 O(n) ❌直接操作尾部 O(1) ✅内存开销与无序数组相…...

编程新知 2025/7/4 21:35:34

ABAP设计模式之---“简单设计原则(Simple Design)”

“Simple Design”（简单设计）是软件开发中的一个重要理念，倡导以最简单的方式实现软件功能，以确保代码清晰易懂、易维护，并在项目需求变化时能够快速适应。其核心目标是避免复杂和过度设计，遵循“让事情保…...

编程新知 2025/7/7 0:36:08

Reasoning over Uncertain Text by Generative Large Language Models

https://ojs.aaai.org/index.php/AAAI/article/view/34674/36829https://ojs.aaai.org/index.php/AAAI/article/view/34674/36829 1. 概述文本中的不确定性在许多语境中传达，从日常对话到特定领域的文档（例如医学文档）（Heritage 2013；Landmark、Gulbrandsen 和 Svenevei…...

编程新知 2025/7/7 2:29:38