行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

全球软件：生成式AI 401：智能体（深入解析）

信息技术 2026-03-25 - 伯恩斯坦 Andy Yang 杨敏

核心观点

**生成式AI代理（Agentic AI）**是构建在基础大型语言模型（LLM）之上的应用层技术，旨在解决传统LLM的局限性，如缺乏记忆、信息过时、无法执行操作和频繁产生幻觉等问题。
代理通过提供LLM记忆、领域知识、上下文、数据、工具和规则来弥补这些缺陷，使其能够更好地完成复杂任务。
代理的工作原理是将所有必要信息（用户信息、业务上下文、操作步骤、工具使用方法、相关数据等）作为“上下文”输入LLM，但由于上下文窗口的限制，需要将这些信息组织成不同的模块，包括短期记忆、长期记忆、工具和规则/工作流/技能。

关键数据和研究结论

代理组件的演变：代理组件正在快速发展，但其核心主题是围绕LLM构建，而不是在LLM内部构建，这意味着代理可以解锁模型能力，但并不意味着模型本身的改进。
从通用到专用：与预训练缩放时代更通用的方法相比，构建代理需要更专门化的方法，需要大量的数据整理、设计和基础设施工作，这可能导致突破性进展的时间延长。
CPU使用率增加：与纯LLM计算主要基于GPU不同，许多代理步骤将在CPU上执行，随着代理AI的采用增加，CPU消耗也将增加，这对超大规模企业来说可能是一个利好消息。
数据基础设施的重要性：构建有效的代理需要大量的数据基础设施工作，因此，许多公司需要投入更多资源来支持代理的发展。
多代理系统的挑战：随着任务复杂性的增加，需要将任务分配给多个代理，这带来了新的挑战，例如多代理系统中的协作需求。
定制代理的开发时间线：由于设置的复杂性，定制代理的开发时间线将比许多人想象的要长，需要更多专家的参与。

代理组件详解

记忆：包括短期记忆和长期记忆，用于存储对话历史和检索相关知识。
工具和MCP：工具使LLM能够执行操作，MCP（模型上下文协议）为LLM与第三方应用程序之间的交互提供标准化和标准化的方式。
工作流/技能/系统提示：通过定义规则、工作流、技能和系统提示，可以提高代理的准确性和效率，并在灵活性和一致性之间取得平衡。

未来展望

长期上下文挑战：随着代理AI的发展，如何管理和发展优化上下文空间将成为一个重要的挑战。
确定性与非确定性的平衡：找到确定性和非确定性之间的完美平衡对于每个类型的流程至关重要。
多代理系统：未来，随着技术的成熟，我们将看到更多复杂的多代理系统的发展。

Generative Al 401: Agents (under the hood) Although the concept of Al Agents started to popularize around late 2023/2024, itspotential really came to life recently with the release of products like Claude Code/Cowork, OpenAl Codex/Agent Builder, and OpenClaw. In this primer, given the amount oftechnicalities, we try to peel off the jargon and packaging of agentic systems and explainwhat is going on under the hood. Mark L. Moerdler, Ph.D.+19173448506mark.moerdler@bernsteinsg.com Firoz Vallji, CFA+19173448316firoz.vallji@bernsteinsg.com Al agents are built on top of the basic LLM models and are likely to become an importantpart of the application layer of the Al software tech stack. Compared to a basic LLMthat cannot remember, has stale information, cannot perform actions and hallucinatesfrequently, Agentic Al remedies many of these issues by providing the LLM memory, domainknowledge, context, data, tools, rules and guidelines. Shelly Tang. CFA+19173448342shelly.tang@bernsteinsg.com The easiest way to understand how agents work is this: During inference, the basic LLMdoes not have any knowledge beyond its training data and user input, so how to supplythe LLM more context to help it succeed at a task? The answer is to dump everything itneeds to know in the input including: user information, business context, how to break downthe steps, what tools are required and how to use them, and relevant data such as login,password, customer and order details etc. This is known as the context. However, the context window space is limited, so the brute force way of ingesting all theinformation intheinputsoonfeelsoverwhelmingasthevarietyoftasksand complexityhave parsed out parts of the context into different modules. Namely, short-term memory,long-termmemory,Toolsandrules/Workflows/Skills. These agentic components are evolving fast and might morph into something elsetomorrow. However, the key themes underpinning this era of Gen Al development is that 1)we are moving from building inside an LLM to building around an LLM, which means thatagentic Al could unlock model capability but does not imply model improvement; 2) we aretask performance. The amount of design and engineering choices that go into building anagent, we believe, sig nals that step-function breakthroughs might take longer than duringthe pretraining scaling era. We are excited about the future of Agentic Al, and believe that by finding the perfectbalance between determinism (consistency) and non-determinism (flexibility) for each typeof workflow, agentic Al can open up a lot more new use cases for software. At the sametime,it is important to recognize the amount of complexity and work that goes into settinglong context performance and multi-agent communications, we believe the developmenttimeline for customized agents will take time, longer than many think and will require moreexperts(thustheneedforForwardDeployedEngineers) BERNSTEINTICKERTABLE INVESTMENTIMPLICATIONS Agentic Al represents the next wave of innovation in Generative Al technology and is likely to become an important part of theapplication layer of the Al software tech stack (versus LLM's and agentic development tools which will become part of the Paaslayer). We will see more agents to be rolled out at the application software companies in our coverage. We believe havingcustomer data and domain knowledge expertise are key advantages for these companies, but in some use cases buildingagentic Al could require a lot more efforts than many imagined so it is likely to take longer. Unlike pure LLM where compute is predominantly GPU-based, many of the agentic steps will be performed on the CPU's. Asagentic Aladoption increases we should see an increase in CPU consumption -a possible tailwind for the hyperscalers suchas Microsoft and Oracle within our coverage, driving both additional revenue and incrementally higher gross margins for Alworkloads overtraining orpure inferencing. Building effective agents also require significant groundwork in data infrastructure. Hence, we see many names in ourwill also use more data driving an incremental tailwind for Cloud database vendors such as Microsoft, Oracle, MongoDB andSnowflake. The complexity of building Agents is also driving the need for more high level consultants and Forward Deployed Engineers asdiscussed in Forward Deployed Engineering (FDE): Where Al software meets the real world. Table Of Contents High-level Summary and lingering food for thoug ht..3How did we go from the 2022 ChatGPT to agents today?.6A very stripped-down way of looking at agent....7Memory: Keeping track of history.9Memory: Retrieve additional knowledge.11Tools and MCP: Enable actions.Agent Tool schema....13What is an MCP?...15How LLMs know when and which tool to call...17Workflow/skills/system prompts: Improve accuracy and efficiency..19 DETAILS After ChatGPT's meteoric rise to popularity in late 2022/early 2023, many have wondered what the next killer product /capabi

点击免费查看完整报告

全球软件：生成式AI 401：智能体（深入解析）

核心观点

关键数据和研究结论

代理组件详解

未来展望

你可能感兴趣

生成式AI 401：智能体（底层技术）

全球内容软件龙头，踏生成式AI浪潮再启航

全球软件行业：“生成式AI将摧毁SaaS”的叙事：增长冲击已被定价

百度AI营销认证 | 生成式AI重构商家经营-商家智能体

人工智能领域的新突破：利用生成式与智能体AI创新提升临床试验效率与质量

2026年行业数据与AI终极指南：从炒作到商业价值，驾驭数据、生成式AI与AI智能体

推动盈利增长的AI创新方法：生成式AI与AI智能体变现指南

软银、Arm：从生成式AI到智能体AI；首次覆盖给予跑赢评级

AI智能体将推动通讯软件进化？

国君：产业调研-智能驾驶算法解析及生成式AI应用20230909