行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

China's Internet: Strategic Insights from AI Model Architecture

信息技术 2026-03-30 - 伯恩斯坦 Zt

核心观点与关键数据

人工智能模型架构选择与战略定位：中国顶级AI实验室在模型架构选择上存在差异，反映了其战略定位和市场竞争策略。Minimax采用小型模型和MoE架构，专注于低成本代理AI推理；Z.ai的GLM5模型更大，更注重通用推理能力和幻觉控制；阿里巴巴的Qwen系列模型则提供广泛的多模型，旨在满足广泛的AI计算需求。
KV缓存使用与定价策略：Minimax通过降低缓存读写代币价格鼓励缓存使用，降低推理成本并提高首次输出代币时间；Z.ai的GLM5模型对缓存读取代币的收费与标准输入代币价格相同。Minimax的缓存命中率远高于Z.ai，反映了其模型设计对缓存利用的优化。
AI发展与成本：AI发展与成本持续占据投资者讨论焦点。阿里巴巴、腾讯和百度都宣布了AI云业务的提价，独立计算供应商的AI服务器租赁报价也指向相同方向。预计更高的计算成本将影响推理利润和训练成本的增长。
估值与市场表现：预计Minimax和Z.ai的股价主要由产品发布和年度经常性收入增长驱使，但研发费用增长预期可能被上调。预计2026年现存的计算成本将逐渐贬值，但更像是平稳或更好的情况更像是基本案例。

研究结论

长期竞争格局：在长期时间范围内，以通用推理能力和可靠性构建的市场地位将更持久。除非“足够好”的人工智能计算市场变成一个存在多个竞争对手愿意为市场份额而牺牲利润的行业，否则现有企业将能够访问更广泛的企业发展路径，从而做得更好。
代理AI市场：代理AI的采用将继续由病毒式消费者成功、高级用户和具有竞争力的企业驱动。预计“低成本代理商后端”的定位将变得越来越拥挤，既来自竞争的中国模型制造商，也来自全球领先者发布的以闪存风格的前沿模型版本。
计算成本上升：日益增长的计算成本引发了关于人工智能实验室研发支出预期的质疑。预计计算成本的上升将支持超大规模计算的增长，但也可能成为Minimax和Z.ai等AI实验室培训和推理成本压力的来源。
中国AI模型发展：中国顶级AI实验室虽然得益于像蒸馏等开发技术，但仍然很好地与世界最先进技术水平保持同步。小米和Stepfun能够接近中国智能前沿，这对当前被认为是“落后者”的其他人具有有趣的含义。
OpenClaw市场：OpenClaw通过模型在OpenRouter上共享代币股份，Minimax M2.5和其他中国型号能成为OpenClaw最受欢迎的模型之一，反映了其低成本代理工具的优化。
阿里巴巴Qwen策略：阿里巴巴的Qwen策略更加注重提供不同模型尺寸和模态的一整套模型，包括小型设备模型、中型企业部署模型、大型通用推理模型，以及针对不同专业化的智能代理AI服务，反映了公司捕捉尽可能广泛的AI计算需求的策略。
AI应用发展：开发AI工具以允许用户检索信息和编排更复杂的任务已成为焦点，尽管这种行为的增加性相对于现有功能的替代程度尚不明确。安全和权限始终是关键关注的主题，这有助于推动OpenClaw类型技术的采用。

China Internet: The strategic implications of AI model Strategic choices drive AI model architecture.AI development and costs continue todominate investor discussions across our China Internet coverage. This note is intended asa low-jargon discussion on the model design choices of China’s top AI labs, and what they Robin Zhu+852 2123 2659robin.zhu@bernsteinsg.com Charles Gou+852 2123 2618charles.gou@bernsteinsg.com A brief primer on AI model architectures, KV cache use, RL.Global AI modeldevelopers have increasingly adopted MoE architectures in recent years, where onlysmall sections of parameters in a large model are activated per token - depending onspecialisation on different vertical domains, languages, or skill sets. The key values (KV) Min-Joo Kang+852 2123 2644minjoo.kang@bernsteinsg.com Choices along the cost vs. performance spectrum.Across the top Chinese AI labs,Minimax stands out for offering a smaller model optimised for low active parameter scaleper token, reinforcement learning frameworks that prioritise agentic tool use, while thecompany’s pricing strategy incentivises high KV cache usage. Zhipu’s GLM5 model islarger and benchmarks better on general reasoning, coding capabilities, and hallucination Thoughts on the adoption curve.Year to date, the M2.5 model's optimisation for lowcost agentic use has made it one of the most popular models used to support OpenClaw.Z.ai’s focus on leading edge reasoning capabilities and reliability aligns well with thecompany’s more academic background, and focus on enterprise use cases where reliabilityis key. Thinking from the perspective of early adopter cohorts and growth S-curves, heavy Competition, and model commoditisation.Over long time horizons, our bias is thatmarket positions built around general reasoning strength and reliability, and specialisttask completion will prove more durable, while the “low-cost agentic back-end” corner ofthe market becomes more crowded with competition from both Chinese devs (includingthe independent AI labs but also the Internet platforms seeking to develop consumer use Is 20-30% training cost growth going to be enough?Alibaba, Tencent, and Baidu allannounced price hikes in their respective AI cloud units, while AI server rental quotes fromindependent compute suppliers have pointed in the same direction. Alibaba managementhinted that ongoing market tightness could support further price hikes this year. In contrast, BERNSTEIN TICKER TABLE INVESTMENT IMPLICATIONS AI development and costs continue to dominate investor discussions across our China Internet coverage. In this note we’veoutlined some observations about differences in AI model architecture choices across the leading Chinese AI labs, andwhat these choices say about developer market positioning and competitive strategy. While Minimax’s recent (M2+) modelreleases have been optimised for low-cost agentic tool use, Z.ai’s GLM5 model was much more focused on general reasoningcapabilities, and hallucination control. Alibaba’s strategy for its Qwen family of models meanwhile has been to offer a broad Over long time horizons, we expect leading edge general reasoning and specialised task completion capabilities to representmore defensive competitive positions than a low-cost, “good enough” agentic AI backbone… unless the latter is embeddedwithin a large consumer-facing ecosystem. For the latter, our bias remains that most consumers care more about “getting stuff To date, the top Chinese AI labs have done a good job keeping pace with the global SOTA, albeit with help from developmenttechniques like distillation. As agentic workflows become more complex, and task completion horizons lengthen, the possibilitythat the latter becomes less effective will be important to monitor. Nearer-term, we’d expect the rising cost of compute (e.g. seeAlicloud and Tencent price hikes) to support hyperscaler growth - but serve as a source of training and inference cost pressure VALUATION COMPS TABLE DETAILS A PRIMER ON MODEL AI ARCHITECTURE… AND STRATEGIC IMPLICATIONS Research on AI development has dominated our research bandwidth year to date. Our discussions with investors havecontinued to focus heavily on the strategic implications of OpenClaw adoption on our large cap coverage (e.g. Tencent, Alibaba),and the growth of AI model companies like Minimax and Z.ai. A common thread in these conversations though has been atendency for investors to treat “AI models” as monolithic products… and treat OpenRouter data almost like Sensor Tower, asthe arbiter of top-line traction. This note is intended as a basic, low-jargon primer of AI models, focused on key aspects which Bigger is usually better… but there are trade-offs At a high level, frontier AI models are next-token predictors that are trained on large training datasets, which try to predict themost optimal responses to user prompts. Frontier AI models have mainly scaled over time by adding (1) parameter count; (2)adding exper

点击免费查看完整报告

China's Internet: Strategic Insights from AI Model Architecture

核心观点与关键数据

研究结论

你可能感兴趣

Chinese Internet: Strategic Significance of AI Model Architecture

50 CLIMATE SOLUTIONS FROM CITIES IN THE PEOPLE’S REPUBLIC OF CHINA

China's Retailers and the Coronavirus Outbreak: Lessons from the Past

China Inside Out: Capital account reforms: From people’s bank to people’s hands

From SNEC 2026: Key Points of China's Solar Energy Industry Development

Navigating Tariff Uncertainty: Strategic Insights for Electrical Industry Leaders

【T112017-技术驱动未来分会场】CNN Architecture Design - From Deeper to Wider

Buy:China’s AI leader in pole position for autonomous driving

英伟达 CEO 黄仁勋计划在推出中国特供版 AI 芯片前访问北京 — Nvidia’s Jensen Huang plans Beijing trip ahead of new China AI chip launch20250710

Strategic shift from scale to profitability