行业研究公司研究宏观策略财报招股书会议纪要海南封关低空经济 DeepSeek AIGC 大模型

智能体与智能体架构导论

信息技术2025-11-25-谷歌洪

AI智能总结

AI 代理是语言模型的自然演进，能够自主完成目标，结合了语言模型的推理能力和实际行动能力。本文档作为五部分系列的第一部分，为开发者、架构师和产品领导提供了从概念验证到生产级代理系统的正式指南。

AI 代理的核心组成部分：

模型（大脑）：核心语言模型或基础模型，作为代理的推理引擎，用于处理信息、评估选项和做出决策。
工具（双手）：连接代理的推理和外部世界，使其能够执行超越文本生成的操作。包括 API 扩展、代码函数和数据存储（如数据库或向量存储）。
编排层（神经系统）：管理代理的操作循环，处理计划、记忆（状态）和推理，并赋予代理记忆能力。

AI 代理的解决问题流程：

获取任务：代理接收一个特定的、高级的目标。
扫描环境：代理感知其环境以收集上下文信息。
思考：代理通过推理模型分析任务和环境，制定计划。
采取行动：编排层执行计划的第一步，选择并调用合适的工具。
观察和迭代：代理观察行动的结果，并将其添加到上下文中，然后重复循环。

AI 代理的分类：

Level 0：核心推理系统：仅包含语言模型，没有工具、记忆或工具调用能力。
Level 1：连接型问题解决者：语言模型连接到外部工具，能够执行简单任务。
Level 2：策略型问题解决者：能够战略性地规划复杂的多步骤目标，并进行上下文工程。
Level 3：协作型多代理系统：由多个专门代理组成的团队，通过协作完成复杂任务。
Level 4：自进化系统：能够识别自身能力差距并动态创建和适应新代理。

AI 代理的架构设计：

模型：选择合适的模型，考虑其推理能力、成本和速度。
工具：定义工具接口，包括信息检索工具和行动执行工具。
编排层：运行“思考、行动、观察”循环，管理代理行为。

AI 代理的设计选择：

自主程度：确定代理的自主程度，从完全确定的工作流到高度自主的代理。
实现方法：选择无代码构建器或代码优先框架。
领域知识和角色：使用系统提示为代理提供领域知识和角色。
上下文增强：使用短期记忆和长期记忆来增强上下文。
多代理系统和设计模式：采用“专家团队”方法，并使用协调器模式、顺序模式等设计模式。

AI 代理的部署和服务：

将代理部署到服务器，并集成必要的服务。
使用专门的部署选项或框架提供的部署命令。
采用新的测试方法，以适应代理的随机性。

AI 代理的运维（Agent Ops）：

衡量重要指标：定义关键绩效指标（KPI）来衡量代理的价值。
质量而非通过/失败：使用语言模型作为裁判来评估代理的输出质量。
指标驱动开发：使用自动化评估来测试开发代理的更改。
使用 OpenTelemetry 跟踪进行调试：通过跟踪来理解代理行为。
珍惜人类反馈：收集和分析人类反馈以改进代理。

AI 代理的互操作性：

代理与人类：通过用户界面进行交互，包括聊天机器人、计算机使用和实时多模态通信。
代理与代理：使用 Agent2Agent (A2A) 协议进行通信。
代理与金钱：使用 Agent Payments Protocol (AP2) 进行交易。

AI 代理的安全性：

单个代理的安全：在效用和安全性之间进行权衡，并采用多层防御策略。
代理身份：为代理创建一个独特的身份，并实施访问控制策略。
策略：限制代理的能力，并实施策略来控制代理的访问。
ADK 代理的安全：使用身份、策略和工具保护来保护 ADK 代理。
动态安全：使用回调和插件来实施动态安全检查。
成本和可靠性：设计可靠且经济高效的代理。

AI 代理的治理：

安全性和隐私：采用纵深防御策略来保护数据和代理。
代理治理：实施控制平面来管理代理生态系统，防止代理蔓延。

AI 代理的成本和可靠性：

基础设施：选择合适的底层基础设施来支持代理的运行。
成本：优化代理的成本，并确保其经济高效。
可靠性：确保代理的可靠性，并提供高可用性。

AI 代理的自学习和自进化：

学习来源：从运行时经验和外部信号中学习。
自进化机制：优化上下文工程、工具优化和创建。

AI 代理的未来：

模拟和代理健身房：提供一个独立的平台，用于模拟和优化代理。
Co-Scientist：一个高级 AI 代理，用于加速科学发现。
AlphaEvolve Agent：一个 AI 代理，用于发现和优化算法。

AI 代理代表了人工智能的重大进步，将人工智能从被动的内容创建工具转变为积极的自主合作伙伴。通过遵循本文档中概述的原则和架构模式，我们可以构建真正协作、强大和适应性强的代理，为企业和个人带来巨大的价值。

Authors: Alan Blount, Antonio Gulli, Shubham Saboo,Michael Zimmermann, and Vladimir Vuskovic Content contributors Enrique ChanMike ClarkDerek Egan Curators and editorsAnant Nawalgaria Designer Table of contents Table of contents The Orchestration Layer22Core Design Choices23Instruct with Domain Knowledge and Persona23Augment with Context24Multi-Agent Systems and Design Patterns24Agent Deployment and Services26Agent Ops: A Structured Approach to the Unpredictable27Measure What Matters: Instrumenting Success Like an A/B Experiment29Quality Instead of Pass/Fail: Using a LM Judge29 Table of contents Agents are the natural evolutionof Language Models, made useful From Predictive AI toAutonomous Agents Artificial intelligence is changing. For years, the focus has been on models that excel atpassive, discrete tasks: answering a question, translating text, or generating an image froma prompt. This paradigm, while powerful, requires constant human direction for every step. This new frontier is built around AI agents. An agent is not simply an AI model in a staticworkflow; it's a complete application, making plans and taking actions to achieve goals. Itcombines a Language Model's (LM) ability toreasonwith the practical ability toact, allowing it to handle complex, multi-step tasks that a model alone cannot. The critical capability is thatagents can work on their own, figuring out the next steps needed to reach a goal without a This document is the first in a five-part series, acting as a formal guide for the developers,architects, and product leaders transitioning from proofs-of-concept to robust,production-grade agentic systems. While building a simple prototype is straightforward, •Core Anatomy:Deconstructing an agent into its three essential components: the •A Taxonomy of Capabilities:Classifying agents from simple, connected problem-solvers •Architectural Design:Diving into the practical design considerations for each •Building for Production:Establishing the Agent Ops discipline needed to evaluate,debug, secure, and scale agentic systems from a single instance to a fleet with Building on the previousAgents whitepaper1andAgent Companion2; this guide providesthe foundational concepts and strategic frameworks you will need to successfully build, Words are insufficient to describe how humans interact with AI. We tend toanthropomorphize and use human terms like “think” and “reason” and “know.” We don'tyet have words for "know with semantic meaning" vs "know with high probability of In the simplest terms, an AI Agent can be defined as the combination of models, tools, anorchestration layer, and runtime services which uses the LM in a loop to accomplish a goal. •The Model (The "Brain"):The core language model (LM) or foundation model that servesas the agent's central reasoning engine to process information, evaluate options, andmake decisions. The type of model (general-purpose, fine-tuned, or multimodal) dictates •Tools (The "Hands"):These mechanisms connect the agent's reasoning to the outsideworld, enabling actions beyond text generation. They include API extensions, codefunctions, and data stores (like databases or vector stores) for accessing real-time, factual •The Orchestration Layer (The "Nervous System"):The governing process thatmanages the agent's operational loop. It handles planning, memory (state), and reasoning Chain-of-Thought4orReAct5) to break down complex goals into steps and decide whento think versus use a tool. This layer is also responsible for giving agents the memory •Deployment (The "Body and Legs"):While building an agent on a laptop is effective forprototyping, production deployment is what makes it a reliable and accessible service.This involves hosting the agent on a secure, scalable server and integrating it with At the end of the day, building a generative AI agent is a new way to develop solutions tosolve tasks. The traditional developer acts as a "bricklayer," precisely defining every logicalstep. The agent developer, in contrast, is more like a director. Instead of writing explicit code You'll quickly find that an LM's greatest strength—its incredible flexibility—is also your biggestheadache. A large language model's capacity to doanythingmakes it difficult to compel it todoone specific thingreliably and perfectly. What we used to call “prompt engineering” andnow call “context engineering” guides LMs to generate the desired output. For any single Debugging becomes essential when issues arise. "Agent Ops" essentially redefines thefamiliar cycle of measurement, analysis, and system optimization. Through traces and logs,you can monitor the agent's "thought process" to identify deviations from the intended critical components: domain expertise, a defined personality, and seamless integrationwith the tools necessary for practical task completion. It's crucial to remember that When an agent is precisely configured with clear instructions, reliable tools, and a

点击免费查看完整报告

你可能感兴趣

智能体与智能体架构导论

你可能感兴趣

面向新型电力系统的安全韧性AI智能体：架构、关键技术与落地实践

强化学习环境与科学中的强化学习：数据铸造厂与多智能体架构

AI智能体领域前沿技术研究报告：架构、挑战与范式演进

传媒行业周报：大模型升级混合推理架构，智能体能力与开源生态持续发展

AI Infra：加速智能体落地的基础架构发展趋势与产业实践