行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

中国互联网：AI模型架构的战略性影响

信息技术 2026-03-30 伯恩斯坦张博卿

核心观点

中国互联网领域，AI模型架构的选择反映了不同公司的战略定位和竞争策略。
Minimax的模型以效率优先，参数量小，激活参数少，适合低成本、中等复杂度的智能体工作负载。
Z.ai的GLM5模型参数量大，激活参数多，更注重通用推理能力和可靠性，适合大型企业客户。
阿里巴巴的Qwen系列模型覆盖范围广，包括不同规模和模态的模型，旨在捕捉尽可能广泛的计算需求。
随着智能体工作负载的复杂化和任务完成周期的延长，小型学生模型可能逐渐落后于前沿模型。
长期来看，基于通用推理能力和可靠性的市场地位将更具持久性，而“低成本智能体后端”的市场将面临更激烈的竞争。

关键数据

Minimax M2.5模型的参数量为2300亿，但激活参数仅为100亿。
Z.ai的GLM5模型参数量约为7440亿，激活参数约为400亿。
Minimax的缓存读取价格约为0.03元/百万token，远低于Z.ai的GLM5模型。
Minimax M2.5模型的缓存命中率约为70%，高于Z.ai的GLM5模型的40%。
OpenRouter上使用最多的三个OpenAI模型均来自中国开发者。
阿里云的AI计算产品价格近期上涨了5-34%。
独立计算供应商的AI服务器租赁报价也呈现上涨趋势。

研究结论

AI计算成本的上升将对AI实验室的训练成本和推理成本造成压力。
投资者对AI实验室的预期研发成本增长（25-30%）可能过于保守。
阿里云凭借其规模和基础设施优势，在AI计算市场具有成本优势。
长期来看，AI计算市场可能陷入价格战，拥有消费者牵引力和企业销售网络的现有企业将更具优势。

China Internet: The strategic implications of Al modelarchitecture Strategic choices drive Al model architecture. Al development and costs continue todominate investordiscussionsacrossourChinaInternetcoverage.Thisnoteis intendedasa low-jargon discussion on the model design choices of China's top Al labs, and what theyreveal about strategic positioning and go-to-market. Robin Zhu+852 2123 2659robin.zhu@bernsteinsg.com Charles Gou+85221232618charles.gou@bernsteinsg.com A brief primer on Al model architectures, KV cache use, RL. Global Al modeldevelopers have increasingly adopted MoE architectures in recent years, where onlyspecialisation on different vertical domains, languages, or skill sets. The key values (KV)cache meanwhile represents a key feature of Al models which supports reduced memoryusage and faster inference. Reinforcement learning serves as a key input into inferenceperformance, and the types of responses considered preferable. Min-Joo Kang+85221232644minjoo.kang@bernsteinsg.com Choices alongthe costvs.performance spectrum.Across the top Chinese Al labs.per token, reinforcement learning frameworks that prioritise agentic tool use, while thelarger and benchmarks better on general reasoning, coding capabilities, and hallucinationcontrol... but comes with higher token costs. Qwen's strategy has been to offer a broadrange of models aimed at maximising capture of Al compute demand. Thoughts on the adoption curve. Year to date, the M2.5 model's optimisation for lowcost agentic use has made it one of the most popular models used to support OpenClawcompany's more academic background, and focus on enterprise use cases where reliabilityis key. Thinking from the perspective of early adopter cohorts and growth S-curves, heavyuse among power users and enterprises strikes us as less vulnerable to a "trough ofdisillusionment" moment than viral consumer OpenClaw adoption. Competition, and model commoditisation. Over long time horizons, our bias is thatmarketpositionsbuiltaroundgeneralreasoningstrengthandreliability,andspecialisttask completion will prove more durable, while the "low-costagentic back-end" corner ofthe market becomes more crowded with competition from both Chinese devs (includingthe independent Al labs but also the Internet platforms seeking to develop consumer usecases), and flash-style models from the global leaders. Minimax's pivot to leading edgereasoning capabilities for M3 struck us as notable.. and necessary. Is 20-30% training cost growth going to be enough? Alibaba, Tencent, and Baidu allannounced price hikes in their respective Al cloud units, while Al server rental quotes fromindependent compute suppliers have pointed in the same direction. Alibaba managementhinted that ongoing market tightness could support further price hikes this year. In contrast,costs to influence inference margins and training cost growth. BERNSTEINTICKERTABLE INVESTMENT IMPLICATIONS Al development and costs continue to dominate investor discussions across our China Internet coverage. In this note we'veoutlinedsomeobservationsaboutdifferencesinAlmodelarchitecturechoicesacrosstheleadingChineseAllabs,andwhat these choices say about developer market positioning and competitive strategy. While Minimax's recent (M2+) modelreleases have been optimised for low-cost agentic tool use, Z.ai's GLM5 model was much more focused on general reasoningcapabilities,andhallucinationcontrol.Alibaba'sstrategyforitsQwenfamilyofmodelsmeanwhilehasbeentoofferabroadrange of models across model sizes andmodalities..to capture as broad ofa range of compute use cases as possible, for thepurposes ofdrivingdemandforMaaS andbroadercomputedemand. Over long time horizons, we expect leading edge general reasoning and specialised task completion capabilities to representmoredefensivecompetitivepositionsthanalow-cost,goodenough"agenticAibackbone...unlessthelatterisembeddedwithin a large consumer-facing ecosystem. For the latter, our bias remains that most consumers care more about "getting stuffdone" efficiently and cheaply than necessarily differentiating between underlying reasoning capabilities. To date, the top Chinese Al labs have done a good job keeping pace with the global SOTA, albeit with help from developmenttechniques like distillation. As agentic workflows become more complex, and task completion horizons lengthen, the possibilitythat the latter becomes less effective will be important to monitor. Nearer-term, we'd expect the rising cost of compute (e.g. seeAlicloud and Tencent price hikes) to support hyperscaler growth - but serve as a source of training and inference cost pressurefor Al labs like Minimax and Z.ai. The 25-30% training cost growth that investors seem to contemplate for these stocks strikesus as being too low in an environment of rising compute costs. VALUATIONCOMPSTABLE DETAILS A PRIMER ON MODEL AI ARCHITECTURE... AND STRATEGIC IMPLICATIONS Research on Al development has dominated our research bandwidth

点击免费查看完整报告

中国互联网：AI模型架构的战略性影响

核心观点

关键数据

研究结论

你可能感兴趣

快手可灵：国产首发对标Sora的DiT架构文生视频AI模型

智领未来，模型驱动，AI大模型纪元下的企业架构转型

企业级SSD供需紧张延续再次强调模型架构对存储的影响国联民生海外

传媒互联网行业周报：国产类ChatGPT”模型“MOSS”发布；人工智能将作为战略性新兴产业

中国互联网导航：顶级AI应用追踪：AI模型升级、ARR趋势以及芯片供应关注；7月应用参与度良好

互联网金融行业数字货币系列报告一：中国DCEP架构下的数字货币

传媒日报 |北京政策鼓励算力发展，新模型架构提升大模型运作效率【建投传媒互联网】

互联网行业动态点评：StabilityAI发布最新模型SDXL1.0；Adobe公司推出“GenerativeExpand”的AI功能

软件与互联网行业动态分析：关注AI模型及产品迭代驱动的投资机会

【东吴传媒互联网张良卫团队】国产模型及应用正处于爆发的起点位置，AI主线依然清晰，同时演绎具有持续性