Paper The following article is Open access

Review: Recent advances for the diffusion model

Published under licence by IOP Publishing Ltd
, , Citation Yufeng Wei 2024 J. Phys.: Conf. Ser. 2711 012005 DOI 10.1088/1742-6596/2711/1/012005

1742-6596/2711/1/012005

Abstract

As the generative model technology becomes more and more popular, more and more people have invested in the research of the current State-of-the-art (SOTA) generative model-diffusion model. This paper reviews all SOTA generation models using the diffusion model for text-to-image generation since the emergence of the diffusion model, including the denoising diffusion probabilistic model (DDPM), DALL·E model, imagen model, stable diffusion model, and diffusion transformer architecture (DiT) model. In the theoretical section, the basic principles behind the diffusion model are reviewed in detail in the way of mathematical calculation, including the training process of the model and the mathematical principles behind the sampling process. Moreover, this paper focuses on the technical characteristics of these models and various improvements made after model iteration, such as model structure optimization, more efficient and accurate training methods, and the application of other optimization techniques widely used in the field of deep learning to diffusion models. In the end, the technical route of the development of the diffusion model is summarized, and some predictions are made.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1742-6596/2711/1/012005