行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

弥合差距：援助实效、项目评级与情境化

2025-01-17 - 世界银行&麻省理工苏吃吃

本文探讨了援助有效性问题，研究了捐赠项目与可衡量的部门级发展成果之间的关系。研究发现，项目评级无法提供项目对部门级成果影响的可靠信息，而项目对国家环境的适应程度是项目对发展成果贡献的最强预测因素。此外，在制度薄弱的环境中，大型项目评级与实际影响之间的差异最大。研究还发现，项目后评估的内容可能预测部门成果，即使评级不能。研究使用机器学习方法构建了一个新的数据集，利用大型语言模型对发展项目文件进行编码，以量化项目设计对国家环境的适应程度。研究结果表明，项目设计中对国家环境的适应程度越高，项目对部门成果的贡献就越大。

Diana Goldemberg1, Luke Jordan2,∗, Thomas Kenyon1 Abstract This paper applies novel techniques to long-standing questions of aid effectiveness.Itconstructs a new dataset using machine learning methods to encode aspects of develop-ment project documents that would be infeasible with manual methods. It then uses thatdataset to show that the strongest predictor of these projects’ contribution to develop-ment outcomes is not the self-evaluation ratings assigned by donors, but their degree ofadaptation to country context and that the largest differences between ratings and actualimpact occur in large projects in institutionally weak settings.It also finds suggestiveevidence that the content of ex post reviews of project effectiveness may predict sectoroutcomes, even if ratings do not. Keywords:aid effectiveness, machine learning, World Bank projects JEL Codes: O12, O15, O19 1. Introduction Many empirical studies have explored whether foreign aid effectively improves develop-ment outcomes in recipient countries. This research typically takes one of two approaches.The first looks at the macro-level effect of aid at the country level, assessing its impacton economic growth or sectoral outcomes. The second focuses on the micro-level effectof individual development projects or interventions, often using donors’ self-evaluationratings or randomized control trials to assess their success. In this study, we bridge theseapproaches by examining the relationship between donor-funded projects and measurabledevelopment outcomes at the sectoral level. We begin by asking whether project ratingsprovide a link between project-level and sector-level outcomes and find that they onlyprovide limited insight. We then turn to machine-learning methods, constructing a newdataset utilizing large language models and the texts of over a thousand World Bankprojects.We find that this text analysis produces measurable project features that dopredict sector outcomes. We first replicate previous findings of positive effects of aid on sector outcomes. Wethen use ratings from projects undertaken in 183 developing countries by eight donorssince the 1990s, concentrating on a few service delivery sectors with readily availabledata on beneficiary-level outcomes, introducing aggregates of the ratings as independentvariables to the sector specifications. For World Bank projects, for which more granulardata is available, we create what are called “text embeddings” of project documents usingrecent advances in machine learning models, turning texts into numerical representationsof their similarities and differences. These replicate expert human assessments of projectcharacteristics but at greater accuracy and with far greater efficiency.The dataset weconstruct of the embeddings of project documents is likely to have additional uses and ispublicly available3. Its construction is explained in some detail in the Methods sectionbelow.With this dataset, we are able to quantify the degree to which a project’s coredescription differs from others in its sector and country, deriving a new measure of project “contextualisation”. We then use non-linear methods to predict projects’ sector outcomes, and probe whatfeatures of the projects the model paid most attention to.We find that projects withwhat appear to be high degrees of tailoring to country context and concentration offunds in fewer sectors are associated with stronger outcomes. To our knowledge, this isthe first attempt to quantify the importance of project contextualization to developmenteffectiveness. Our findings have actionable implications for the system through which theWorld Bank and other development institutions evaluate project performance, as well asimplications for the design and staffing of these projects. 2. Literature and Theory 2.1. Development Effectiveness Aid effectiveness has been evaluated at several levels (see Table 1).At the mostaggregate level, cross-country studies have focused on the volume of aid as input andeconomic growth as the outcome, with institutional quality and political environment asother explanatory variables. A second approach has examined the relationship betweenaid and outcomes in sectors such as education, health, water and sanitation.A thirdhas concentrated on the self-evaluated outcomes, or ratings, of donor-financed projects,typically those of multilateral development banks. At the lowest level, a large literaturehas used randomized control trials to evaluate the impact of interventions and projectcomponents. The focus of this paper is at the sector and project level and the relationshipbetween them. [insert Table 1 here] The project-level approach has burgeoned recently, examining the relationship be-tween project characteristics and country level features as independent variables, anddonors’ ratings of project outcomes, which are considered a noisy but valid measure ofproject performance (Denizer, Kaufmann, and Kraay 2013).Explanatory factors for

点击免费查看完整报告

弥合差距：援助实效、项目评级与情境化

你可能感兴趣

关注差距：援助效果、项目评级和情境化

评级和报价的未来：弥合速度和复杂性之间的差距

COVID - 19 与妇女权利组织：弥合应对差距并要求更多

弥合差距？金融科技与金融普惠

指南针指数2025年第二季度：弥合营销与电子商务绩效之间的差距

2026亚太并购中的人力因素：弥合亚太地区交易中雄心与执行之间的差距

英国：选定问题 - 弥合差距：理解英国与美国的生产力脱钩

埃森哲报告发现，C-Suite与CISO之间需要进行更紧密的合作以弥合网络准备不足的差距

弥合差距：泰国的不平等与就业-2023

2025年市场脉动报告：弥合营销工作与商业影响之间的差距

弥合差距：援助实效、项目评级与情境化

你可能感兴趣

关注差距：援助效果、项目评级和情境化

评级和报价的未来 ： 弥合速度和复杂性之间的差距

COVID - 19 与妇女权利组织 ： 弥合应对差距并要求更多

弥合差距？金融科技与金融普惠

指南针指数2025年第二季度：弥合营销与电子商务绩效之间的差距

2026亚太并购中的人力因素：弥合亚太地区交易中雄心与执行之间的差距

英国：选定问题 - 弥合差距：理解英国与美国的生产力脱钩

埃森哲报告发现，C-Suite与CISO之间需要进行更紧密的合作以弥合网络准备不足的差距

弥合差距：泰国的不平等与就业-2023

2025年市场脉动报告：弥合营销工作与商业影响之间的差距

评级和报价的未来：弥合速度和复杂性之间的差距

COVID - 19 与妇女权利组织：弥合应对差距并要求更多