行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

多智能体系统：从经典范式到大基础模型驱动的未来

信息技术 2026-04-20 华东理工&南通大学&复旦&斯威本科技 caddie💞

核心观点

本文探讨了多智能体系统（MASs）从经典范式向基于大型基础模型（LFMs）的未来架构的演变。文章首先回顾了经典MASs（CMASs）在感知、通信、决策和控制四个基本维度上的研究进展。然后，文章分析了基于LFMs的MASs（LMASs）的核心模块、交互机制、优化方法和涌现集体智能。接着，文章对CMASs和LMASs进行了多维度的比较分析，对比了它们的架构、运行机制、适应性和应用。最后，文章提出了MASs的未来研究方向，总结了开放挑战和潜在的研究机会。

关键数据

文章引用了大量最新的研究论文和资料，涵盖了MASs的多个领域，包括机器人、社交智能和卫星系统等。
文章提到了多个具体的MASs案例，例如ChatDev、MetaGPT、OpenClaw、RoCo、AgentVerse、MACNET、SocioVerse等。
文章分析了多个LMASs的核心模块，包括角色定义、感知、规划、记忆和执行等。

研究结论

CMASs和LMASs在架构上存在相似之处，例如都具有自主性、反应性、主动性和社会能力等特性，都追求实现系统级的智能目标，都支持集中式、分散式或混合式的组织结构，都可用于并行或分布式地执行任务，都可能出现超越单个智能体能力的系统级行为。
CMASs和LMASs在运行机制上存在差异，主要体现在感知、行动、通信和安全风险等方面。CMASs通常依赖于低维、结构化的物理或数值状态变量，行动通常是低级和细粒度的，通信使用紧凑、结构化的信息交换，安全风险主要来自模型错误、噪声和通信故障。LMASs则扩展到非结构化的多模态输入，行动是高级的、以意图为导向的，通信依赖于自然语言，安全风险则表现为语言推理固有的问题，例如幻觉、指令漂移、上下文误解和工具滥用等。
CMASs和LMASs在适应性上存在差异。CMASs通常需要重新设计或重新训练才能适应新的任务，而LMASs可以通过零样本或少样本迁移学习快速适应新的场景。
CMASs和LMASs在应用场景上存在差异。CMASs适用于可建模、紧密约束和结构定义的任务，例如编队控制、任务调度和能源管理。LMASs则适用于开放世界、知识密集和非结构化的场景，例如具身AI、GUI操作、软件工程和社会模拟。
未来MASs的研究将关注跨范式集成、多模态扩展、因果增强推理、可扩展的设备-边缘-云部署、具身智能和伦理问题等方向。

未来研究方向

CMASs和LMASs的协同进化
多模态扩展
因果增强推理
设备-边缘-云部署
具身智能
伦理和安全

Zixiang Wang, Mengjia Gong, Qiyu Sun, Jing Xu,Senior Member, IEEE, Shuai Mao, Xin Jin,Qing-Long Han,Fellow, IEEE, and Yang Tang,Fellow, IEEE MASs (CMASs) [9], [10]. CMASs rely on explicitly designedsystem models or task-specific learning mechanisms. Froma methodological perspective, existing CMASs research canbe broadly categorized into model-based and learning-basedapproaches. Model-based research has gradually establishedseveral classical problem domains and theoretical frameworks,including consensus control [11], formation control [12], taskscheduling [13], and bio-inspired optimization [14]. Thesestudies typically assume modelable systems and clear objec-tives to ensure provable stability and performance [9]. How-ever, in scenarios with unmodelable environments, unknownsystem dynamics and partial observability, reliance on explicitmodeling and control design is often limited. As a result,learning-basedmethods such as multi-agent reinforcementlearning (MARL) have emerged as an important alternative,enabling agents to learn coordinated policies through inter-action without accurate models [15]. While this paradigmpartially mitigates model dependency in complex settings,MARL still suffers from limitations in sample efficiency,stability, interpretability, and generalization [16].Thelimitations of CMASs motivate the exploration of Abstract—With the rapid advancement of artificial intelli-gence, multi-agent systems (MASs) are evolving from classicalparadigmstoward architectures built upon large foundationmodels (LFMs). This survey provides a systematic review andcomparative analysis of classical MASs (CMASs) and LFM-basedMASs(LMASs).First,within a closed-loop coordina-tion framework, CMASs are reviewed across four fundamentaldimensions: perception, communication, decision-making, andcontrol.Beyond this framework,LMASs integrate LFMs tolift collaboration from low-level state exchanges to semantic-levelreasoning,enabling more flexible coordination and im-proved adaptability across diverse scenarios. Then, a comparativeanalysis is conducted to contrast CMASs and LMASs acrossarchitecture, operating mechanism, adaptability, and application.Finally, future perspectives on MASs are presented, summarizingopen challenges and potential research opportunities. Index Terms—Artificial intelligence, Multi-agent System, Largefoundation model, Agentic AI. I. INTRODUCTION MUlti-agent systems (MASs) have become a core arti-ficial intelligence research paradigm with broad appli-cations in multiple disciplines, including robotics [1], socialintelligence [2], and satellite systems [3]. Inspired by biologi-cal swarms and functional requirements of complex distributedsystems [4], [5], MASs focus on how multiple autonomousagents achieve global coordination or collective intelligencethrough interaction [6]. Compared with single-agent systems,MASs provide a natural framework for modeling complexinteractionsand coordination among multiple autonomousentities in real-world environments [7], [8].In this survey, MASs that do not incorporate large foun- more general approaches with reasoning capabilities, leadingto the integration of large foundation models (LFMs) intoMASs [17]. In the context of MASs, LFMs serve as the cog-nitive core of agents. They enable agents to interpret unstruc-tured multimodal inputs, maintain contextual understanding,reason over complex tasks, and generate high-level actions orinteraction messages [18]. This shifts agent operation awayfrom predefined system models, handcrafted rules, or task-specific policies in CMASs toward semantic-level perceptionand language-based interaction, enabling more flexible coor-dination [19]. This evolution marks a fundamental paradigmshift from task-specific, environment-constrained CMASs tomore adaptive, general-purpose, and cognitively empoweredLFM-based MASs (LMASs). By leveraging the pretrainedknowledge and reasoning abilities of LFMs, these systems canperform complex multi-step planning, knowledge retrieval, andhigh-level decision-making [20], [21]. As illustrated in Fig. 1,unlike CMASs tailored to fixed environments, LMASs gener-alize well and accumulate experience across tasks, supportingflexible collaboration in open, dynamic scenarios [22], [23].The existing surveys on LMASs primarily focus on thearXiv:2604.18133v1 [cs.AI] 20 Apr 2026 dation models (LFMs) are collectively referred to as classical LFM-based paradigm and summarize system architectures,coordination mechanisms, and applications [8], [20], [24],[19], [25]. Unlike these surveys that examine LMASs inisolation, we propose a unified perspective bridging CMASsand LMASs. LMASs are not a replacement for CMASs, As shown in Fig. 2, this section introduces CMASs fromfouraspects:perception,communication,decision-making,and control. The first two address information acquisition anddissemination, while the latter enable distributed reasoningand coordinated actions under given objectives. This four-dimensional framew

点击免费查看完整报告

多智能体系统：从经典范式到大基础模型驱动的未来

核心观点

关键数据

研究结论

未来研究方向

你可能感兴趣

基础模型驱动的推荐系统综述：从特征驱动、生成式到智能体范式

2025年企业级AI客服市场深度研究报告：从争夺软件市场到争夺劳动力市场的范式转移行业拐点：智能体时代全面到来

浙江大学：从大模型、智能体到复杂AI应用系统的构建——以产业大脑为例

DeepSeek系列专题线上公开课（第二季）：从大模型、智能体到复杂AI应用系统的构建——以产业大脑为例

基础智能体的进步与挑战：从类脑智能到进化、协作和安全系统

一场由智能体人工智能驱动的消费者健康革命：基础模型和人工智能智能体如何改变发现、信任与商业的场景

2025驾驭采购到付款的未来：从纸质流转到智能体自动化效率报告（英）

2026年世界云报告 - 金融服务：通过云驱动的AI智能体实现大规模增长，从流程自动化到行业重塑

资产配置量化模型：从经典到创新

从推荐模型的基础特点看大规模推荐类深度学习系统的设计_袁镱

多智能体系统：从经典范式到大基础模型驱动的未来

你可能感兴趣

基础模型驱动的推荐系统综述：从特征驱动、生成式到智能体范式

2025年企业级AI客服市场深度研究报告：从争夺软件市场到争夺劳动力市场的范式转移 行业拐点：智能体时代全面到来

浙江大学：从大模型、智能体到复杂AI应用系统的构建——以产业大脑为例

DeepSeek系列专题线上公开课（第二季）：从大模型、智能体到复杂AI应用系统的构建——以产业大脑为例

基础智能体的进步与挑战：从类脑智能到进化、协作和安全系统

一场由智能体人工智能驱动的消费者健康革命：基础模型和人工智能智能体如何改变发现、信任与商业的场景

2025驾驭采购到付款的未来：从纸质流转到智能体自动化效率报告（英）

2026年世界云报告 - 金融服务：通过云驱动的AI智能体实现大规模增长，从流程自动化到行业重塑

资产配置量化模型：从经典到创新

从推荐模型的基础特点看大规模推荐类深度学习系统的设计_袁镱

2025年企业级AI客服市场深度研究报告：从争夺软件市场到争夺劳动力市场的范式转移行业拐点：智能体时代全面到来