行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

生成式人工智能的扩散模型概述

信息技术 2024-12-03 - 未知机构杨春

1. 引言

生成模型的目标是从已知数据集中学习潜在分布，并生成新的数据样本。本文介绍了去噪扩散概率模型（DDPM），一种基于扩散过程的生成方法。DDPM 包含两个随机过程：扩散过程和去噪过程。扩散过程将初始状态逐步添加噪声，最终变为纯噪声状态；去噪过程则学习参数，使每个时间步的分布与扩散过程的对应时间步相似。通过逆向去噪，DDPM 将纯噪声转换为有意义的数据。

2. 去噪扩散概率模型（DDPM）

2.1 DDPM 的一般框架

DDPM 的框架包含以下要素：

维度：d，表示生成数据的维度。
时间步数：T，表示 DDPM 的时间步数。
扩散过程：X∅，一个从初始状态 X∅0 开始，逐步添加噪声的随机过程。
初始状态：X∅0，表示扩散过程的初始状态，其分布近似于我们希望采样的潜在分布。
去噪过程：Xθ，一个从纯噪声状态开始，逐步去除噪声的参数化随机过程。
去噪过程初始状态分布：Π，表示去噪过程初始状态的分布，例如高斯分布。

DDPM 的目标是学习去噪过程的参数 θ，使得去噪过程的终端状态 Xθ0 近似于扩散过程的初始状态 X∅0。

2.2 DDPM 的训练目标

DDPM 使用期望负对数似然（ENLL）作为训练目标，通过最小化 ENLL 来使去噪过程的分布近似于扩散过程的分布。

2.3 简化的 DDPM 生成方法

本文介绍了一种简化的 DDPM 生成方法，该方法基于 ENLL 的上界，通过训练一个神经网络来预测噪声，并从纯噪声状态开始逐步去除噪声，最终生成新的数据样本。

3. 高斯噪声的 DDPM

3.1 高斯分布的性质

本节讨论了高斯分布的一些性质，包括高斯过渡核的性质、贝叶斯法则以及高斯分布之间的 KL 散度等。

3.2 高斯噪声 DDPM 的框架

在本文讨论的 DDPM 框架中，扩散过程和去噪过程都采用高斯分布作为过渡核。扩散过程通过逐步添加高斯噪声来将初始状态转换为纯噪声状态；去噪过程则通过神经网络学习去除噪声。

3.3 高斯噪声 DDPM 中正向过程的分布

在本文的高斯噪声 DDPM 框架中，正向过程的条件分布和终端分布都是高斯分布。此外，本文还证明了当扩散过程添加足够多的噪声时，其终端分布将趋向于标准正态分布。

3.4 高斯噪声 DDPM 中训练目标的重新表述

本文选择特定的均值函数和方差函数，使得训练目标的上界可以简化为便于训练的表达式。该表达式涉及正向过程中添加的累积噪声，并指导神经网络学习去除噪声。

3.5 高斯噪声 DDPM 生成方法

本文介绍了一种基于高斯噪声 DDPM 框架的生成方法，该方法通过逐步去除噪声来生成新的数据样本。该方法使用 UNet 神经网络作为去噪模型，并使用标准正态随机变量来模拟扩散过程和去噪过程。

3.6 去噪过程的网络架构

本文讨论了去噪过程常用的网络架构，即 UNet。UNet 具有编码器-解码器结构，并使用跳跃连接来提高生成效果。

4. 生成模型的评估

生成模型的评估需要使用合适的指标来衡量生成数据的质量。本文讨论了两种类型的指标：

内容可变指标：例如 Inception Score（IS）和 Fréchet Inception Distance（FID），用于衡量生成数据的多样性和质量。
内容不变指标：例如结构相似性指数度量（SSIM）、峰值信噪比（PSNR）和感知图像块相似性（LPIPS），用于衡量生成数据与参考数据在结构、细节和整体质量方面的相似程度。

5. DDPM 的高级变体和扩展

本节讨论了 DDPM 的一些高级变体和扩展，包括：

改进 DDPM：通过学习去噪过程中的方差、使用余弦调度器和增加训练时间步数来提高 DDPM 的性能。
去噪扩散隐式模型（DDIM）：将扩散过程定义为非马尔可夫过程，同时保持与 DDPM 相同的训练目标，从而提高生成效率。
无分类器扩散引导：使用自适应组归一化（AdaGN）将类别信息直接整合到 UNet 架构中，从而控制生成数据的类别。
Stable Diffusion：结合扩散模型和自动编码器，使用文本编码器来生成与文本描述相符的图像。
其他最新扩散技术：例如 GLIDE、DALL-E 2/3 和 Imagen，这些模型在图像生成方面取得了显著的进展。

本文提供的 DDPM 框架和变体为生成式人工智能研究提供了理论基础和技术支持，并为进一步研究和发展扩散模型提供了方向。

Davide Gallon1, Arnulf Jentzen2,3, and Philippe von Wurstemberger4,5 1Applied Mathematics: Institute for Analysisand Numerics, University of M¨unster,Germany, e-mail: davide.gallon@uni-muenster.de2School of Data Science and Shenzhen Research Institute ofBig Data, The Chinese University of Hong Kong, Shenzhen(CUHK-Shenzhen), China, e-mail: ajentzen@cuhk.edu.cn3Applied Mathematics: Institute for Analysis and Numerics,University of M¨unster, Germany, e-mail: ajentzen@uni-muenster.de4Risklab, Department of Mathematics, ETH Zurich,Switzerland, e-mail: philippe.vonwurstemberger@math.ethz.ch5School of Data Science, The Chinese University ofHong Kong, Shenzhen (CUHK-Shenzhen),China, e-mail: philippevw@cuhk.edu.cn December 3, 2024 Abstract This article provides a mathematically rigorous introduction todenoising diffusion prob-abilistic models(DDPMs), sometimes also referred to asdiffusion probabilistic models ordiffusion models, for generative artificial intelligence.We provide a detailed basic mathe-matical framework for DDPMs and explain the main ideas behind training and generationprocedures.In this overview article we also review selected extensions and improvementsof the basic framework from the literature such as improved DDPMs, denoising diffusionimplicit models, classifier-free diffusion guidance models, and latent diffusion models.arXiv:2412.01371v1 [cs.LG] 2 Dec 2024 Contents 1Introduction 2Denoising diffusion probabilistic models (DDPMs)42.1General framework for DDPMs. . . . . . . . . . . . . . . . . . . . . . . . . . . .4 2.2Training objective in DDPMs . . . . . . . . . . . . . . . . . . . . . . . . . . . . .82.3A first simplified DDPM generative method . . . . . . . . . . . . . . . . . . . . .12 3.1Properties of Gaussian distributions. . . . . . . . . . . . . . . . . . . . . . . . .143.1.1On Gaussian transition kernels. . . . . . . . . . . . . . . . . . . . . . . .153.1.2Explicit constructions for Gaussian transition kernels . . . . . . . . . . . .153.1.3Bayes rule for Gaussian distributions . . . . . . . . . . . . . . . . . . . . .163.1.4KL divergence between Gaussian distributions. . . . . . . . . . . . . . .173.2Framework for DDPMs with Gaussian noise . . . . . . . . . . . . . . . . . . . . .173.3Distributions of the forward process in DDPMs with Gaussian noise. . . . . . .183.3.1Conditional distributions going forward. . . . . . . . . . . . . . . . . . .183.3.2Terminal distributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . .193.3.3Conditional distributions going backwards . . . . . . . . . . . . . . . . . .203.4Reformulated training objective in DDPMs with Gaussian noise . . . . . . . . . .213.5DDPM generative method with Gaussian noise. . . . . . . . . . . . . . . . . . .263.6Network architectures for the backward process . . . . . . . . . . . . . . . . . . .293.6.1UNets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .293.6.2Time embedding. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .31 4Evaluation of generative models32 4.1Content variant metrics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .334.1.1Inception score. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .334.1.2Fr´echet inception distance . . . . . . . . . . . . . . . . . . . . . . . . . . .344.2Content invariant metrics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .35 5Advanced variants and extensions of DDPMs36 5.1Improved DDPM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .365.2Denoising Diffusion Implicit Model (DDIM) . . . . . . . . . . . . . . . . . . . . .405.2.1Framework for DDIM. . . . . . . . . . . . . . . . . . . . . . . . . . . . .405.2.2Distribution for the forward process in DDIM . . . . . . . . . . . . . . . .415.2.3Explicit objective function in DDIM. . . . . . . . . . . . . . . . . . . . .425.2.4Generative method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .425.3Classifier-free diffusion guidance . . . . . . . . . . . . . . . . . . . . . . . . . . . .445.3.1Controlling with adaptive group normalization. . . . . . . . . . . . . . .445.3.2Generative method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .455.4Stable Diffusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .475.4.1Controlling with cross attention layer. . . . . . . . . . . . . . . . . . . .475.4.2Generative method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .485.5Further state of the art diffusion techniques. . . . . . . . . . . . . . . . . . . . .495.5.1GLIDE. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .505.5.2DALL-E 2 and DALL-E 3 . . . . . . . . . . . . . . . . . . . . . . . . . . .505.5.3Imagen. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .51 1Introduction The goal of generative modelling is to generate new data samples from an unknown underlyingdistribution based on a dataset of samples from tha

点击免费查看完整报告

生成式人工智能的扩散模型概述

1. 引言

2. 去噪扩散概率模型（DDPM）

2.1 DDPM 的一般框架

2.2 DDPM 的训练目标

2.3 简化的 DDPM 生成方法

3. 高斯噪声的 DDPM

3.1 高斯分布的性质

3.2 高斯噪声 DDPM 的框架

3.3 高斯噪声 DDPM 中正向过程的分布

3.4 高斯噪声 DDPM 中训练目标的重新表述

3.5 高斯噪声 DDPM 生成方法

3.6 去噪过程的网络架构

4. 生成模型的评估

5. DDPM 的高级变体和扩展

你可能感兴趣

人工智能行业跟踪报告：谷歌发布开源模型Gemma，端侧生成式AI或现增量需求

生成式人工智能治理模型框架

人工智能周报(24年第5周)：谷歌发布AI新工具ImageFX，谷歌推出AI扩散模型Lumiere

核医学产业中的人工智能：2023年第一季度景观概述

人工智能在金融服务业的可靠应用：亚太地区监管概述

释放公共部门的人工智能，盒子中的人工智能采购，项目概述

人工智能行业系列一：概述，基于机器学习的量化投资策略

2022年1季度人工智能在药物发现中的应用前景概述

学校教育中的人工智能：23个教育系统的政策重点与举措概述

生成式 AI 模型 — 业务中的风险和潜在回报