行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

LLM援助是否改善了医疗服务？现场医生和实验室测试进行评估（英文）

医药生物 2026-01-01 世界银行小烨

本研究评估了大型语言模型（LLM）决策支持对尼日利亚两家门诊诊所患者护理的影响。研究发现，LLM 辅助护理在改善基层卫生工作者决策方面存在一些局限性。

研究设计
研究在尼日利亚卡诺州的两家诊所进行，参与者为社区卫生推广工作者（CHEWs）和现场医生（MOs）。CHEWs 首先制定初步护理计划，然后根据 LLM 反馈进行修订，形成配对的未辅助和辅助护理记录。所有患者随后由 MOs 进行独立评估和治疗。

主要发现

卫生工作者对 LLM 反馈的反应：LLM 辅助护理导致卫生工作者显著修改诊断（41% 的记录）、测试订单（33% 的记录）和处方决策（54% 的记录）。卫生工作者普遍认为 LLM 反馈有助于改善患者护理。
医生评估：MOs 对辅助护理计划的评估结果更为复杂。虽然 LLM 反馈有助于减少部分护理计划中的错误和潜在危害，但并未显著提高与 MOs 护理计划的符合度或客观测试结果。
测试和治疗方案分配：LLM 对疟疾的检测率有积极影响，但对贫血和尿路感染检测率的增加大部分是无效的。LLM 辅助护理并未显著改善治疗分配。
LLM 反馈分析：LLM 平均每个患者提出 3.75 条建议，其中约 53% 的建议使护理计划更符合医生的要求。然而，卫生工作者仅采纳了约三分之一的建议，且对改善与医生符合度的建议的采纳率仅略高于随机水平。

结论
尽管 LLM 辅助护理受到卫生工作者欢迎，并在部分情况下改善了护理质量，但研究结果表明，LLM 辅助护理在提高与高级别医疗提供者护理计划的符合度或客观测试结果方面并未产生显著效益。因此，LLM 辅助护理在低收入和中等收入国家尚未成为公共卫生优先事项。

Policy Research Working Paper Does LLM Assistance Improve HealthcareDelivery? An Evaluation Using On-Site Physiciansand Laboratory Tests Jason AbaluckRobert PlessNirmal RaviAnja SautmannAaron Schwartz Policy Research Working Paper11298 Abstract In a selected sample, retrospective review by academicphysicians also suggested improvements in care related tolong-term risk management. However, the three metricsshow mixed effects of LLM-assistance, with on averageno significant improvement in diagnostic alignment withphysicians, detection rates for the tested conditions, or phy-sician subjective assessments. Health workers follow LLM This study tests the effects of large language model (LLM)decision support on patient care at two outpatient clinicsin Nigeria. Health workers were given the option to makerevisions to their initial care plan based on LLM feedback.The unassisted and assisted plans are evaluated using (1)comparisons with independent care plans created by on-sitephysicians, (2) laboratory tests for malaria, anemia, and This paper is a product of the Development Research Group, Development Economics. It is part of a larger effort by theWorld Bank to provide open access to its research and make a contribution to development policy discussions aroundthe world. Policy Research Working Papers are also posted on the Web at http://www.worldbank.org/prwp. The authors The Policy Research Working Paper Series disseminates the findings of work in progress to encourage the exchange of ideas about developmentissues. An objective of the series is to get the findings out quickly, even if the presentations are less than fully polished. The papers carry thenames of the authors and should be cited accordingly. The findings, interpretations, and conclusions expressed in this paper are entirely those DoesLLMAssistanceImproveHealthcareDelivery? AnEvaluationUsingOn-SitePhysiciansandLaboratoryTests JasonAbaluck,†RobertPless,‡NirmalRavi,§AnjaSautmann,¶AaronSchwartz‖ JEL: I10, O12, O15; keywords: LLM, primary care, health care quality 1Introduction Large language models (LLMs) have the potential to improve health provider decision-making withoutrequiring substantial infrastructure to train and deploy (Pressman et al., 2024). LLMs have shown diagnosticperformance comparable to general physicians on written tests and in simulated patient encounters such asvignettes or model patients (Takita et al., 2025; Huang et al., 2024; Tu et al., 2025). A smaller number ofstudies use real patient information, and very few test LLM decision support in a realistic clinical environment et al., 2024). In both, the evaluation of quality of care with and without LLM is done via retrospective chart We build on the existing literature by evaluating the (subjective and objective) alignment of LLM-assisted care with care provided by higher-level providers, as well as with medical test results. We evaluatea prototype intervention in two outpatient clinics in Kano, Nigeria, in which an LLM gives health workersan instant “second opinion” on their care plans.We purposely did not modify the LLM beyond prompt Our design allows us to compare unassisted and LLM-assisted care plans through three complementarylenses: (1) concordance with independent care plans created by on-site physicians, (2) agreement of testingand treatment decisions with laboratory test results for the three most commonly tested conditions (malaria, In any of our metrics of care plan quality, health workers’ care plansprior to LLM assistance showsubstantial deficits.LLM assistance led health workers to meaningfully revise diagnoses (41% of notes), test ordering (33% of notes), and prescribing decisions (54% of notes).The health workers themselvesoverwhelmingly reported that they found the LLM feedback helpful. However, the on-site physicians did notevaluate the assisted care plans more positively, and the assisted plans did not objectively resemble physician Three academic physicians (MDs) who teach in health worker degree programs retrospectively reviewedthe case records of a selected subsample with high physician-assessed patient harms in the unassisted careplan. For these cases, the academic reviewers rated the LLM-assisted notes relatively more favorably than When analyzing the LLM feedback itself, we find that the LLM makes 3.75 recommendations per patient,and these recommendations were about equally likely to increase or decrease alignment with physician testingdecisions and prescriptions, although they better aligned with physician behavioral advice. Health workers In summary, our results show that LLM assistance is welcomed and accepted by frontline health workers,who make significant changes to their care plans in response. We also find some indication of improvementsin care from retrospective chart reviews (supplemented by access to the physician’s patient record and careplan) by academic physicians, possibly driven by better care for chronic degenerative

点击免费查看完整报告

LLM援助是否改善了医疗服务？现场医生和实验室测试进行评估（英文）

你可能感兴趣

改善卫生服务提供：使用患者途径分析进行以人为本的卫生系统评估——第2卷：患者途径分析介绍以及来自现场和已发表文献的实例2025

塞舌尔：技术援助报告-宏观审慎压力测试和气候风险评估

一年：新冠肺炎是否阻止了GBS和共享服务的发展？

使用salesforce改善现场服务运营和客户服务

与康美进行供应链、渠道和医疗服务合作，释放公司巨大潜力

移民社区需求评估：探讨移民社区在寻求癌症和医疗服务方面的经验

埃塞俄比亚奥罗米亚州和南方各族州地区通过综合基本社会服务和社会现金转移试点计划（IN-SCT）改善营养的影响评估：底线影响评估报告

2020E NP同比增长> 50％;评估改善第三方扩展和增值服务