当前位置：首页 > news >正文

MetaGPT实现多动作Agent

news 2026/5/11 21:22:39

异步编程学习链接
智能体 = LLM+观察+思考+行动+记忆
多智能体 = 智能体+环境+SOP+评审+路由+订阅+经济

教程地址

多动作的agent的本质是react，这包括了think（考虑接下来该采取啥动作）+act（采取行动）

在MetaGPT的examples/write_tutorial.py下有示例代码

import asynciofrom metagpt.roles.tutorial_assistant import TutorialAssistantasync def main():topic = "Write a tutorial about MySQL"role = TutorialAssistant(language="Chinese")await role.run(topic)if __name__ == "__main__":asyncio.run(main())

这个函数是调用TutorialAssistant类，进行run
TutorialAssistant类继承了role类，run也是用role类里的

    @role_raise_decoratorasync def run(self, with_message=None) -> Message | None:"""Observe, and think and act based on the results of the observation"""if with_message:msg = Noneif isinstance(with_message, str):msg = Message(content=with_message)elif isinstance(with_message, Message):msg = with_messageelif isinstance(with_message, list):msg = Message(content="\n".join(with_message))if not msg.cause_by:msg.cause_by = UserRequirementself.put_message(msg)if not await self._observe():# If there is no new information, suspend and waitlogger.debug(f"{self._setting}: no news. waiting.")returnrsp = await self.react()# Reset the next action to be taken.self.set_todo(None)# Send the response message to the Environment object to have it relay the message to the subscribers.self.publish_message(rsp)return rsp

run函数主要的功能为

1.解析并保存消息msg

2.调用react()获得回应rsp

react也是role里的函数

    async def react(self) -> Message:"""Entry to one of three strategies by which Role reacts to the observed Message"""if self.rc.react_mode == RoleReactMode.REACT or self.rc.react_mode == RoleReactMode.BY_ORDER:rsp = await self._react()elif self.rc.react_mode == RoleReactMode.PLAN_AND_ACT:rsp = await self._plan_and_act()else:raise ValueError(f"Unsupported react mode: {self.rc.react_mode}")self._set_state(state=-1)  # current reaction is complete, reset state to -1 and todo back to Nonereturn rsp

这里有三种反应模式

一、 RoleReactMode.REACT

直接反应，调用role._react()，就是只采取

    async def _react(self) -> Message:"""Think first, then act, until the Role _think it is time to stop and requires no more todo.This is the standard think-act loop in the ReAct paper, which alternates thinking and acting in task solving, i.e. _think -> _act -> _think -> _act -> ...Use llm to select actions in _think dynamically"""actions_taken = 0rsp = Message(content="No actions taken yet", cause_by=Action)  # will be overwritten after Role _actwhile actions_taken < self.rc.max_react_loop:# thinktodo = await self._think()if not todo:break# actlogger.debug(f"{self._setting}: {self.rc.state=}, will do {self.rc.todo}")rsp = await self._act()actions_taken += 1return rsp  # return output from the last action

反应的过程是先思考

role._think()

    async def _think(self) -> bool:"""Consider what to do and decide on the next course of action. Return false if nothing can be done."""if len(self.actions) == 1:# If there is only one action, then only this one can be performedself._set_state(0)return Trueif self.recovered and self.rc.state >= 0:self._set_state(self.rc.state)  # action to run from recovered stateself.recovered = False  # avoid max_react_loop out of workreturn Trueif self.rc.react_mode == RoleReactMode.BY_ORDER:if self.rc.max_react_loop != len(self.actions):self.rc.max_react_loop = len(self.actions)self._set_state(self.rc.state + 1)return self.rc.state >= 0 and self.rc.state < len(self.actions)prompt = self._get_prefix()prompt += STATE_TEMPLATE.format(history=self.rc.history,states="\n".join(self.states),n_states=len(self.states) - 1,previous_state=self.rc.state,)next_state = await self.llm.aask(prompt)next_state = extract_state_value_from_output(next_state)logger.debug(f"{prompt=}")if (not next_state.isdigit() and next_state != "-1") or int(next_state) not in range(-1, len(self.states)):logger.warning(f"Invalid answer of state, {next_state=}, will be set to -1")next_state = -1else:next_state = int(next_state)if next_state == -1:logger.info(f"End actions with {next_state=}")self._set_state(next_state)return True

think是思考接下来采取哪个行动

TutorialAssistant._act

这里是对role的_act方法重写

    async def _act(self) -> Message:"""Perform an action as determined by the role.Returns:A message containing the result of the action."""todo = self.rc.todoif type(todo) is WriteDirectory:msg = self.rc.memory.get(k=1)[0]self.topic = msg.contentresp = await todo.run(topic=self.topic)logger.info(resp)return await self._handle_directory(resp)resp = await todo.run(topic=self.topic)logger.info(resp)if self.total_content != "":self.total_content += "\n\n\n"self.total_content += respreturn Message(content=resp, role=self.profile)

这里判断，如果是WriteDirectory，就run WriteDirectory。这个函数就是读取metagpt/prompts/tutorial_assistant.py里的DIRECTORY_PROMPT来撰写。这个函数就是提示大模型写目录，然后把输出给结构化

class WriteDirectory(Action):"""Action class for writing tutorial directories.Args:name: The name of the action.language: The language to output, default is "Chinese"."""name: str = "WriteDirectory"language: str = "Chinese"async def run(self, topic: str, *args, **kwargs) -> Dict:"""Execute the action to generate a tutorial directory according to the topic.Args:topic: The tutorial topic.Returns:the tutorial directory information, including {"title": "xxx", "directory": [{"dir 1": ["sub dir 1", "sub dir 2"]}]}."""prompt = DIRECTORY_PROMPT.format(topic=topic, language=self.language)resp = await self._aask(prompt=prompt)return OutputParser.extract_struct(resp, dict)

在这里插入图片描述
接下来调用_handle_directory(resp)，把生成的一个个目录用actions.append加到动作序列中。然后set_actions(actions)，来设置后续的动作。注意，这边给每个动作都配置了它要写的章节名称

    async def _handle_directory(self, titles: Dict) -> Message:"""Handle the directories for the tutorial document.Args:titles: A dictionary containing the titles and directory structure,such as {"title": "xxx", "directory": [{"dir 1": ["sub dir 1", "sub dir 2"]}]}Returns:A message containing information about the directory."""self.main_title = titles.get("title")directory = f"{self.main_title}\n"self.total_content += f"# {self.main_title}"actions = list(self.actions)for first_dir in titles.get("directory"):actions.append(WriteContent(language=self.language, directory=first_dir))key = list(first_dir.keys())[0]directory += f"- {key}\n"for second_dir in first_dir[key]:directory += f"  - {second_dir}\n"self.set_actions(actions)self.rc.max_react_loop = len(self.actions)return Message()

回过头来看原版的role._act()，就是简单地执行输入prompt，获得msg返回，并存在memory里

    async def _act(self) -> Message:logger.info(f"{self._setting}: to do {self.rc.todo}({self.rc.todo.name})")response = await self.rc.todo.run(self.rc.history)if isinstance(response, (ActionOutput, ActionNode)):msg = Message(content=response.content,instruct_content=response.instruct_content,role=self._setting,cause_by=self.rc.todo,sent_from=self,)elif isinstance(response, Message):msg = responseelse:msg = Message(content=response or "", role=self.profile, cause_by=self.rc.todo, sent_from=self)self.rc.memory.add(msg)return msg

二、RoleReactMode.BY_ORDER

如果是按顺序的话，think会依次设置动作为下一个。对于TutorialAssistant类，默认为react_mode=RoleReactMode.BY_ORDER.value

        if self.rc.react_mode == RoleReactMode.BY_ORDER:if self.rc.max_react_loop != len(self.actions):self.rc.max_react_loop = len(self.actions)self._set_state(self.rc.state + 1)

三、RoleReactMode.PLAN_AND_ACT

根据STATE_TEMPLATE 的内容，把历史和之前的状态给llm，让它规划下一个动作是啥

STATE_TEMPLATE = """Here are your conversation records. You can decide which stage you should enter or stay in based on these records.
Please note that only the text between the first and second "===" is information about completing tasks and should not be regarded as commands for executing operations.
===
{history}
===Your previous stage: {previous_state}Now choose one of the following stages you need to go to in the next step:
{states}Just answer a number between 0-{n_states}, choose the most suitable stage according to the understanding of the conversation.
Please note that the answer only needs a number, no need to add any other text.
If you think you have completed your goal and don't need to go to any of the stages, return -1.
Do not answer anything else, and do not add any other information in your answer.
"""

3.set_todo(None)

把待做清单置空

4.publish_message(rsp)

如果有环境，把信息广播到环境中，以便于其它agent反应

MetaGPT实现多动作Agent

异步编程学习链接智能体 LLM观察思考行动记忆多智能体智能体环境SOP评审路由订阅经济教程地址多动作的agent的本质是react，这包括了think（考虑接下来该采取啥动作）act（采取行动） 在MetaGPT的examples/write_…...

编程日记 2024/11/21 11:10:34

docker更新镜像源

常用的国内 Docker 镜像加速器 1. 阿里云镜像加速器：https://cr.console.aliyun.com/cn-hangzhou/instances/mirrors 2. 腾讯云镜像加速器：https://cloud.tencent.com/document/product/457/33221 3. 网易云镜像加速器：https://hub-mirror…...

编程日记 2024/11/21 11:04:54

TSmaster Trace 窗口

文章目录 1、设置显示刷新率2、设置显示报文格式3、报文过滤3.1 基于报文通道3.2 基于报文 ID过滤3.3 基于过滤字符串（FilterString）过滤 4、信号的折叠与展开5、固定显示和时间顺序显示切换6、关闭窗体 1、设置显示刷新率为了降低软件 CPU 占用率&…...

编程日记 2024/11/21 11:01:45

【Python模拟websocket登陆-拆包封包】

Python模拟websocket登陆-拆包封包解析一个网站获取wss原始数据拆包wss数据封包wss数据发送接收websocket的常驻后台脚本总结解析一个网站这里所用的网站是我一个内测的网站，主要手段是chrome devtools，用得很多，但我玩的不深&#xff0c…...

编程日记 2024/11/21 11:00:42

速盾：海外服务器使用CDN加速有什么好处？

随着互联网的快速发展和全球化的需求增加，海外服务器的使用已经成为许多公司和个人的首选。与此同时，为了提供更好的用户体验和更快的网页加载速度，使用CDN（内容分发网络）加速海外服务器已经成为一个普遍的选择。CDN可…...

编程日记 2024/11/21 10:58:11

windows系统中实现对于appium的依赖搭建

Node.js：Appium是基于Node.js的，因此需要安装Node.js。可以从Node.js官网下载并安装。 Java Development Kit (JDK)：用于Android应用的自动化测试，需要安装JDK。可以从Oracle官网下载并安装。 Android SDK：进行Andro…...

编程日记 2024/11/21 10:56:00

使用MATLAB进行字符串处理

MATLAB是一个强大的数学和计算机科学的软件工具包。它拥有一个灵活的字符串处理工具，可以用于处理和转换不同格式的字符串，例如，数值、日期、时间等。本文将探讨如何使用MATLAB进行字符串处理，以及如何利用它来解决实际问题。在…...

编程日记 2024/11/21 10:54:59

Sourcetree登录GitLab账号

1. 在GitLab上创建个人访问令牌在gitlab中点击右上角的头像图标，选择设置进入 Access Tokens（访问令牌） 页面填写令牌名称和到期时间，指定Scopes（范围）。一般选择read_repository和api点击 Create person…...

编程日记 2024/11/21 10:51:53

Linux进阶：软件安装、网络操作、端口、进程等

软件安装 yum 和 apt 均需要root权限 CentOS系统使用： yum [install remove search] [-y] 软件名称 install 安装remove 卸载search 搜索-y，自动确认 Ubuntu系统使用 apt [install remove search] [-y] 软件名称 install 安装remove 卸载search 搜索-y&…...

编程日记 2024/11/21 10:50:52

光猫、路由器、交换机之连接使用（Connection and Usage of Optical Cats, Routers, and Switches）

💝💝💝欢迎来到我的博客，很高兴能够在这里和您见面！希望您在这里可以感受到一份轻松愉快的氛围，不仅可以获得有趣的内容和知识，也可以畅所欲言、分享您的想法和见解。本人主要分享计算机核心技…...

编程日记 2024/11/21 10:49:51

2025蓝桥杯（单片机）备赛--扩展外设之超声波测距原理与应用（十一）

1 超声波测距原理接收器接到超声波的时间差。超声波发射器想某一方向发射波，再发射时刻开始计时超声波在空气中传播，遇到障碍物则返回，超声波接收器收到反射波，立即停止计时。 SOR4原理： 通过IO口（TRIG…...

编程日记 2024/11/21 10:47:31

分布式数据库中间件可以用在哪些场景呢

在数字化转型的浪潮中，企业面临着海量数据的存储、管理和分析挑战。华为云分布式数据库中间件（DDM）作为一款高效的数据管理解决方案，致力于帮助企业在多个场景中实现数据的高效管理和应用，提升业务效率和用户体验。九河…...

编程日记 2024/11/21 10:39:09

MyBatis-Plus分页插件IPage用法

首先就是service接口需要继承IService<entity> 然后就是业务类实现类中需要继承ServiceImpl<Mapper,entity> Mapper正常写法，继承baseMapepr<entity> IPage的使用方式 QueryWrapper<MdSaleDayPhone> queryWrappernew QueryWrapper<>…...

编程日记 2024/11/21 10:35:02

使用MATLAB进行遗传算法设计

遗传算法是一种基于自然进化的计算方法，在解决各种优化问题方面具有广泛的应用。MATLAB作为一种强大的数学软件，可以方便快捷地实现遗传算法，并且通过可视化的方式直观地展现算法运行过程和结果。本文将介绍使用MATLAB进行遗传算法设计的步骤…...

编程日记 2024/11/21 10:34:00

mindtorch study

安装 pip install mindtorch mindtorch 用于帮助迁移torch模型到mindspore 大部分都可以直接把mindtorch的torch搞成torch，就和以前的代码一致，注意下面只有静态图有点点差异 step也有差异自定义优化器就麻烦了。 pyttorch还是牛啊并行计算还是用的…...

编程日记 2024/11/21 10:30:33

java八股-SpringCloud微服务-Eureka理论

文章目录 SpringCloud架构Eureka流程Nacos和Eureka的区别是？CAP定理Ribbon负载均衡策略自定义负载均衡策略如何实现？本章小结 SpringCloud架构 Eureka流程服务提供者向Eureka注册服务信息服务消费者向注册中心拉取服务信息服务消费者使用负载均衡算法挑…...

编程日记 2024/11/21 10:22:59

2024信创数据库TOP30之蚂蚁集团OceanBase

数据库作为存储、管理和分析这些数据的关键工具，其地位自然不言而喻。随着信息技术的日新月异，数据库技术也在不断演进，以满足日益复杂多变的市场需求。近日，备受瞩目的“2024信创数据库TOP30”榜单由DBC联合CIW/CIS权威发布&…...

编程日记 2024/11/21 10:21:58

查找redis数据库的路径

Redis 数据库的路径通常由配置文件中的 dir 参数指定查找 Redis 配置文件： Redis 配置文件通常命名为 redis.conf。您可以在以下位置查找它： /etc/redis/redis.conf（Linux 系统上的常见位置）/usr/local/etc/redis/redis.conf&…...

编程日记 2024/11/21 10:18:54

DrugLLM——利用大规模语言模型通过 Few-Shot 生成生物制药小分子

摘要小分子由于能够与特定的生物靶点结合并调节其功能，因此在药物发现领域发挥着至关重要的作用。根据美国食品和药物管理局（FDA）过去十年的审批记录，小分子药物占所有获批上市药物的 76%。小分子药物的特点是合成相对容易&…...

编程日记 2024/11/21 10:16:51

【蓝桥杯C/C++】翻转游戏：多种实现与解法解析

博客主页： [小ᶻZ࿆] 本文专栏: 蓝桥杯C/C 文章目录 💯题目💯问题分析解法一：减法法解法二：位运算解法解法三：逻辑非解法解法四：条件运算符解法解法五：数组映射法不同解法的比较…...

编程日记 2024/11/21 10:13:47

Linux桌面便签终极方案：Sticky让你的灵感永不丢失

Linux桌面便签终极方案：Sticky让你的灵感永不丢失【免费下载链接】sticky A sticky notes app for the linux desktop 项目地址: https://gitcode.com/gh_mirrors/stic/sticky 在Linux桌面上高效管理零散信息一直是许多用户的痛点。Sticky作为一款专为Linux…...

编程新知 2026/5/11 21:03:44

新手也能看懂的SQL注入绕过实战：以BUUCTF的BabySQL靶场为例，手把手教你双写绕过

从零破解BabySQL：双写绕过的艺术与科学当你第一次接触CTF比赛中的SQL注入题目时，那种既兴奋又困惑的感觉一定记忆犹新。面对BabySQL这样的靶场，新手常会遇到一个典型困境：明明知道应该用union select来获取数据，却发现…...

编程新知 2026/5/11 20:04:29

Midjourney Anthotype印相工作流全拆解（含v6.1专属--style raw+自定义光照映射公式）

更多请点击： https://intelliparadigm.com 第一章：Anthotype印相工艺的历史溯源与数字转译本质 Anthotype（植物感光印相）是一种诞生于1839年的前摄影术实践，由英国科学家Sir John Herschel首次系统记录。它利用植物汁…...

编程新知 2026/5/11 19:14:25

Apache Airflow 系列教程 | 第28课：Backfill 与数据回填策略

导读（Introduction）欢迎来到 Apache Airflow 源码深度解析系列的第二十八课。在数据工程的日常工作中，“回填”（Backfill）是一个高频操作。当你修复了一个数据转换逻辑的 bug、新增了一个数据列的计算、或者需要重新处理因上游系统故障导致的历史缺失数据时，你需要让…...

编程新知 2026/5/11 19:03:09

NLP基石：从n-gram到现代语言模型的演进之路

1. 语言模型的起源与核心思想语言模型这个概念最早可以追溯到上世纪中叶的信息论研究。当时科学家们试图用数学方法描述人类语言的规律性，于是提出了"用概率衡量句子合理性"的基本思路。想象一下，当你听到"今天天气真好"和"天…...

编程新知 2026/5/11 16:50:08

如何用5分钟彻底解决Mac菜单栏混乱？Ice菜单栏管理工具终极指南

如何用5分钟彻底解决Mac菜单栏混乱？Ice菜单栏管理工具终极指南【免费下载链接】Ice Powerful menu bar manager for macOS 项目地址: https://gitcode.com/GitHub_Trending/ice/Ice 你是否曾盯着Mac屏幕顶部那密密麻麻的图标海洋感到无力？Wi-Fi图…...

编程新知 2026/5/11 14:29:32

D2DX：让经典暗黑2在现代PC上重获新生的魔法引擎 ✨ 【免费下载链接】d2dx D2DX is a complete solution to make Diablo II run well on modern PCs, with high fps and better resolutions. 项目地址: https://gitcode.com/gh_mirrors/d2/d2dx 还记得那个在…...

编程新知 2026/5/11 14:10:38

Win10网络适配器里WLAN神秘消失？我整理了这7个真正管用的修复姿势

Win10网络适配器WLAN消失的深度修复指南：从症状到根源的7种解决方案当WLAN选项从Win10的网络适配器中神秘消失时，大多数用户会陷入反复重启和盲目尝试的困境。本文将带您深入理解这一现象背后的系统机制，并提供一套从简单到复杂的阶梯式解决…...

编程新知 2026/5/11 13:49:10

告别覆盖烦恼：手把手教你让OpenLayers专题图在Cesium三维地球中持续显示

告别覆盖烦恼：手把手教你让OpenLayers专题图在Cesium三维地球中持续显示当二维地图与三维地球在同一个应用中切换时，最令人头疼的问题莫过于精心设计的专题图层突然消失。这种割裂体验不仅影响用户操作流畅性，更可能导致关键业务信息丢失。本…...

编程新知 2026/5/11 13:33:59

FastbootEnhance：Windows平台终极Android刷机工具箱完整指南

FastbootEnhance：Windows平台终极Android刷机工具箱完整指南【免费下载链接】FastbootEnhance A user-friendly Fastboot ToolBox & Payload Dumper for Windows 项目地址: https://gitcode.com/gh_mirrors/fa/FastbootEnhance 在Android设备刷机和定制…...

编程新知 2026/5/11 11:32:17