行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

实践中的人工智能推理：来自医院车间的新智能

信息技术 2025-07-01 GSMA 杨春

核心观点与关键数据

医院等组织面临提升效率与保持服务质量的双重压力，新技术的应用成为关键解决方案。Chooch 公司的 AI 边缘计算方案通过实时供应链监控与安全执法，显著提升医院运营效率与患者安全，并展示了边缘 AI 推理相较于纯云处理的优越性。

实时性能与韧性

边缘推理通过本地处理数据，避免了远程数据中心传输带来的延迟，显著提升 AI 模型响应速度。Chooch 的解决方案基于 NVIDIA Jetson 平台和 Dell PowerEdge 服务器，支持手术库存跟踪等实时应用。边缘推理还增强了组织韧性，避免数据中心故障或网络中断导致的服务中断，其方案通过本地化运行及边缘节点负载均衡实现高可用性，云端仅用于模型更新与数据分析。

控制、合规与成本

边缘推理帮助组织满足数据主权要求，如 HIPAA 和 GDPR，确保患者数据本地化处理。Chooch 的方案在本地防火墙内运行 AI 模型，简化数据治理。成本方面，边缘计算显著降低网络带宽与存储需求，部分医院通过切换至边缘处理减少 30-40% 的网络与存储支出。Chooch 的案例显示，AI 边缘摄像头替代人工库存检查可年节省 60 万美元。

研究结论

AI 推理的部署位置（边缘或云端）对应用性能、数据主权、韧性及能耗有重大影响。企业需根据需求选择合适的边缘位置。运营商在 5G SA 网络和私有无线网络方面具有优势，但需针对不同行业（如医院）的特定需求定制解决方案。销售策略需从传统连接销售转向系统集成与 IT 咨询模式，采用收益分享、按量付费等灵活定价。边缘推理是 AI-RAN 发展的关键部分，设备商需通过标准化 AI 集成提升竞争力。

关键指标对比

库存管理 AI 推理选项评分（医院场景）：边缘推理在韧性、能耗、计算延迟、成本和数据主权方面均优于云端。

AI inference in practice: new intelligence from thehospital floor INSIGHT SPOTLIGHT Inferencing is the real-time decision-making of an AImodel. As AI adoption grows, inferencing will accelerate,raising questions about workload processing and businessbenefits. As outlined inDistributed inference: how AI can implications for application performance, data sovereignty, This research forms part of a series illustrating the impactof AI inference, with each report focusing on a distinctedge location and featuring an example company. This data is processed and stored. The solutions offered by Chooch(among other providers) run AI models directly at the local edgeusing GPU-accelerated computing, and process data within thehospital’s firewall. This ensures patient information stays on-site, simplifying data governance and compliance with Analysis Like many organisations, hospitals are under growing pressureto improve efficiency while maintaining service quality. Newtechnology can offer a way to achieve this. For example,Chooch’s AI-powered solution provides real-time supply chainmonitoring and safety enforcement. It can be used to detectmissing personal protective equipment (PPE), empty or Edge inferencing can also deliver cost savings fororganisations. This primarily stems from the use of edge-centric, GPU-accelerated computing instead of cloud-basedcomputing, as well as the lower bandwidth and storage costsfrom processing data locally. Data provided to GSMAIntelligence from Chooch and other providers indicates thatsome hospitals have reduced spend on network infrastructure Real-time performance and resilience Running AI inference at the edge avoids sending data to remotedata centres to be processed. This is key to improving thecompute latency challenge from large-scale AI workloads,delivering improvements in AI model response times (trafficlatency is not the issue). For example, Chooch’s solutionprocesses data locally at the edge using the NVIDIA Jetson Source: GSMA Intelligence Comparison of AI inferencing options for inventorymanagement Scores based on typical performance characteristics for inventorymanagement deployments at a hospital. 1 = least favourable; 5 = most Edge inferencing can also improve an organisation’s resilience,since AI workloads do not depend on a single point of failuresuch as a data centre outage or connectivity loss. This is criticalin settings such as hospitals, where AI solutions must be able torun without downtime. Chooch’s solution runs all criticalinferencing on-premises, supported by failover and load Control, compliance and costs Running AI workloads at the edge helps organisations meetdata sovereignty requirements, which place restrictions on how Implications Mobile operators Network equipment vendors •Right network, right place– In several countries, a first-moveradvantage remains for operators seeking to monetise theenterprise segment using AI inference. The on-premisesenvironment is a localised setting often requiring networkcapabilities for high-grade enterprise applications. Edge inferenceplays to this objective by reducing compute latency and pound- •AI-RAN is coming– Edge inferencing is part of a widerstory of AI becoming part of the telco network fabric. Inpractice, this means a unified RAN and edge AI capabilityknown as AI-RAN – see NVIDIA’sWhat is AI-RAN?This isevident from the product upgrades from all the globalnetwork equipment makers (e.g. Ericsson, Nokia, Huawei, •Competitive positioning– The infrastructure business ischanging. 4G and early 5G sales five years ago werelargely based on the notion that evolutionary upgradeswould be sufficient to underpin a new revenue growth storyfor telco customers. This has played out only marginally,and is still primarily driven by 5G consumer upgradesfeeding through the base over time, rather than a productreset in enterprise. GenAI and inference therefore come at •Know your vertical– It pays to understand individual customersegments. The dialogue on selling into enterprises isunfortunately often generalised, with verticals viewed as part of asingle group, despite often having very different operatingenvironments and digital maturities. Dell and NVIDIA haveenterprise expertise, showing how best to apply what are a •Sales strategy– The complex value chain for selling inferencecapabilities to enterprise buyers means operators would be wellserved to review and (in some cases) redesign sales incentiveprogrammes. These should encompass the full spectrum of sales,including pre-sales, customer value, customer success andchannel partners. Finally, a more flexible pricing approach playing This Spotlight forms part of a GSMA Intelligence researchseries on AI inference in the telecoms industry,supported by Dell Technologies (seeDell AI for Telecom) Related reading Authors Tim Hatt, Head of Research and ConsultingJames Joiner, Lead Analyst AI inference in practice: time is moneyAI inference in practice: choosing the

点击免费查看完整报告

实践中的人工智能推理：来自医院车间的新智能

核心观点与关键数据

实时性能与韧性

控制、合规与成本

研究结论

关键指标对比

你可能感兴趣

探索家庭采用和使用生成性人工智能：来自意大利的新证据

探索家庭采用和使用生成性人工智能：来自意大利的新证据（英）

国际清算银行-探索家庭采用和使用生成性人工智能：来自意大利的新证据

2024年人工智能在建筑实践中的应用研究报告

实践中的数字主权：欧盟推动塑造新的全球经济

计算机行业人工智能系列报告(一)：终端智能，人工智能AI的新革命.PDF

2025年将人工智能提升到一个新的水平：更智能的代理和量子支持

人工智能推理实践：选择正确的边缘

我们上调了对光学收发器行业的预测预计到2025年由于人工智能训练和推理需求强

博通第二季度业绩强劲；关注人工智能推理的发展