行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

图基础模型中的分布外泛化

2026-01-28 清华大学亓qí

摘要

核心观点：该研报从分布外泛化（OOD）的角度综述了图基础模型（GFM）的最新进展。GFM旨在通过在多样化的图和任务上进行大规模预训练来学习通用的图表示，从而解决传统图学习模型在训练分布之外泛化能力有限的问题。

关键挑战：GFM面临的主要挑战包括结构多样性、领域语义的不稳定性、模态信号的不一致性以及任务形式的多变性。这些挑战源于图结构、领域语义、可用模态和任务形式的变化，导致模型难以在未见过的环境中保持稳定的性能。

问题设定：研报提出了一个统一的OD泛化问题设定，将分布偏移归纳为结构、领域、模态和任务四个层面的变化。OD泛化的目标是学习在Φ（Φstruct, Φdom, Φmod, Φtask）变化下保持稳定的图预测函数f∗θ，以在测试分布上实现最小化预期损失。

方法分类：根据任务形式Φtask是否固定，现有方法被分为两类：

同质任务GFM：专注于固定任务形式的场景，如节点分类、链接预测或图级预测。这类模型通过学习在结构、领域或模态变化下保持有效的表示来实现OD泛化。代表性方法包括GraphFM、AnyGraph、MDGPT、PatchNet、GraphAny、GraphLoRA、MDGFM、SAMGPT、GraphCLIP、RiemannGFM、MDP-GNN、GraphPFN和GOODFormer。
异质任务GFM：设计用于支持灵活任务转换，通过显式建模任务因子的变化来实现OD泛化。这类模型通过任务无关的预训练目标、统一的语义接口和基于指令或提示的适应机制来提供共享的控制平面。代表性方法包括OFA、LLaGA、OpenGraph、GOFA、LLM-BP、GIT、GFT、AutoGFM、UniGraph和UniGraph2。

评估方法：OD泛化的评估主要关注模型在结构变化、领域差异、模态变化和任务形式变化下的性能稳定性。常见的评估任务包括：

结构变化下的泛化：评估模型在图拓扑或节点特征统计变化时的性能。
领域差异下的泛化：评估模型在不同领域数据集上的泛化能力。
模态变化下的泛化：评估模型在辅助模态缺失或损坏时的性能。
任务形式变化下的泛化：评估模型在不同任务形式（如节点级、链接级和图级）下的泛化能力。

未来方向：未来研究方向包括：

建立通用的图词汇表，以实现跨领域的原子单元表示。
建立图模型的缩放规律，以揭示数据、模型容量和性能之间的关系。
对齐预训练与OD目标，以设计更有效的预训练目标。
在图分布偏移下建立理论保证，以提供更可靠的理论支持。
建立更真实的OD部署场景基准，以更全面地评估模型性能。

研究结论：GFM通过大规模预训练学习通用的图表示，为解决传统图学习模型的OD泛化问题提供了新的思路。未来，随着研究的深入，GFM将在更多领域展现出其强大的泛化能力和应用潜力。

Haoyang Li, Haibo Chen, Xin Wang Tsinghua Universitylihy218@gmail.com,chb24@mails.tsinghua.edu.cn,{xin_wang, wwzhu}@tsinghua.edu.cn Abstract Graphs are a fundamental data structure for representing relational informationin domains such as social networks, molecular systems, and knowledge graphs.However, graph learning models often suffer from limited generalization whenapplied beyond their training distributions. In practice, distribution shifts may arisefrom changes in graph structure, domain semantics, available modalities, or taskformulations. To address these challenges, graph foundation models (GFMs) haverecently emerged, aiming to learn general-purpose representations through large-scale pretraining across diverse graphs and tasks. In this survey, we review recentprogress on GFMs from the perspective of out-of-distribution (OOD) generalization.We first discuss the main challenges posed by distribution shifts in graph learningand outline a unified problem setting. We then organize existing approaches based 1Introduction Graphs are a fundamental data structure for representing relational information in many applications,including social and information networks, molecular and biological systems, recommendationplatforms, and knowledge graphs [1,2,3]. By encoding entities as nodes and interactions as edges,graphs capture complex dependency structures that are difficult to model using independent featurerepresentations. Graph learning methods, such as graph neural networks, have become a central toolfor predictive and reasoning tasks, including node classification, link prediction, and graph-levelarXiv:2601.21067v1 [cs.LG] 28 Jan 2026 prediction [4,5].However, models trained on a specific dataset or graph often exhibit limitedgeneralization when applied to new testing environments, where graph topology, feature distributions, Out-of-distribution (OOD) generalization provides a useful perspective for addressing these lim-itations [7,8]. In the field of graph learning, distribution shifts may arise from multiple sources.Structural properties such as connectivity patterns or motif statistics can change across graphs [9,10].Domain-specific factors, including data collection and annotation practices, may introduce datasetbiases [11]. Auxiliary modalities, such as text or molecular features, may be missing, noisy, or Corresponding authors Recently, graph foundation models (GFMs) have emerged and attracted growing attention fromthe research community. Inspired by foundation models in language and vision [14,15], GFMsaim to learn general-purpose graph representations through large-scale pretraining on diverse graphcollections [16,17]. Instead of only optimizing for a specific dataset, these models explore capturinggeneralizable patterns that can be reused and stable across graphs, domains, and downstream objec-tives. An increasing number of work has explored different approaches to building GFMs [18,19,20],including multi-graph pretraining [21], alignment across domains and modalities [22], invariantrepresentation learning [23], prompt- or instruction-based interfaces for task generalization [24],etc. There foundation models address both practical and methodological limitations of traditionalgraph learning. In many applications, labeled data for new graphs is scarce, and retraining models increased interest in GFMs [26,27,23], providing a promising paradigm for handling distributionshifts for OOD generalization. Several recent surveys [28,16,17] have reviewed GFMs from perspectives such as model architec-ture [29], pretraining objectives, scalability, and application domains [30,31]. In contrast, this surveyorganizes the literature explicitly from the perspective of OOD generalization. Rather than focusingon model design alone, we examine how different GFMs address distribution shifts arising fromchanges in graph structure, domain semantics, modality availability, and task formulation, providing In this survey, we provide a comprehensive overview of graph foundation models from the perspectiveof OOD generalization. We first identify the key challenges posed by distribution shifts in graphlearning and introduce a unified problem formulation that captures the OOD in structure/feature,domain, modality, and task. We then organize existing methods into two broad categories according towhether they explicitly support generalization across different task specifications. The first categoryincludes approaches that focus on generalization under a fixed task setting, where OOD generalizationis achieved by learning representations that remain effective across structural, domain, or modalityshifts. The second category comprises methods that are designed to generalize across more complex 2Challenges and Problem Formulation GFMs can support learning and inference across diverse environments, where the data-generatingprocess may differ substantially between training and deployment. In such settings, the poor gen-era

点击免费查看完整报告

图基础模型中的分布外泛化

摘要

你可能感兴趣

因子选股系列之一一五：DFQ-diversify：解决分布外泛化问题的自监督领域识别与对抗解耦模型

机器人行业2025美国CES展点评：英伟达推出世界基础模型，助力人形机器人场景泛化能力提升

1-5 分布外鲁棒图学习的一些新进展

量化专题报告：BL模型的泛化扩展，熵池模型之理论篇

神经网络中的元学习与组合泛化

视觉语言模型泛化到新领域：全面综述

大话模型化调动的 AI 新代中电信营商的思考与实践

细胞和基因疗法的下一代分布模型

使用 Coxian 分布描述肿瘤延迟的扩展转运隔室模型

AI+HI系列（6）：对端到端模型泛化性的思考与改进——基于样本加权与风格约束