Adaptive Spectral Normalization for Generative Models

Egorov, E. A.; Rogachev, A. I.

doi:10.1134/S1064562423701089

Adaptive Spectral Normalization for Generative Models

Published: 09 February 2024

Volume 108, pages S205–S214, (2023)
Cite this article

Doklady Mathematics Aims and scope Submit manuscript

E. A. Egorov¹ &
A. I. Rogachev¹

24 Accesses
Explore all metrics

Abstract

When using Wasserstein GAN loss function for training generative adversarial networks (GAN), it is theoretically necessary to limit the discriminators’ expressive power (so-called discriminator normalization). Such limitation increases the stability of GAN training at the expense of a less expressive final model. Spectral normalization is one of the normalization algorithms that involves applying a fixed operation independently to each discriminator layer. However, the optimal strength of the discriminator limitation varies for different tasks, which requires a parameterized normalization method. This paper proposes modifications to the spectral normalization algorithm that allow changing the strength of the discriminator limitation. In addition to parameterization, the proposed methods can change the degree of limitation during training, unlike the original algorithm. The quality of the obtained models is explored for each of the proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improved Performance of GANs via Integrating Gradient Penalty with Spectral Normalization

A Review on Mode Collapse Reducing GANs with GAN’s Algorithm and Theory

Effect of Input Noise Dimension in GANs

Notes

For a normed space V, henceforward, ||⋅||_V will denote the norm in this space.
GitHub repository with the used implementation of the training procedure and measuring metrics: https://github.com/TrickmanOff/GAN_project
An epoch in this context is one pass through the entire training set when performing generator training steps.
Further in the text, for brevity, this metric will be called “conditional average PRD-AUC,” when it is clear what partition of the set of conditions we are talking about.

REFERENCES

S. Agostinelli et al., “Geant4—a simulation toolkit,” Nucl. Instrum. Methods Phys. Res. Sect. A: Accel. Spectrom. Detect. Assoc. Equip. 506 (3), 250–303 (2003). https://doi.org/10.1016/S0168-9002(03)01368-8
Article Google Scholar
V. Chekalina et al., “Generative models for fast calorimeter simulation: The LHCb case,” EPJ Web Conf. 214, 02034 (2019). https://doi.org/10.1051/epjconf/201921402034
A. Rogachev and F. Ratnikov, “GAN with an auxiliary regressor for the fast simulation of the electromagnetic calorimeter response,” J. Phys.: Conf. Ser. 2438, 012086 (2023). https://doi.org/10.1088/1742-6596/2438/1/012086
M. Arjovsky, S. Chintala, and L. Bottou, “Wasserstein generative adversarial networks,” in Proceedings of the 34th International Conference on Machine Learning, Proc. Mach. Learn. Res. 70, 214–223 (2017). https://proceedings.mlr.press/v70/arjovsky17a/arjovsky17a.pdf
T. Miyato et al., “Spectral normalization for generative adversarial networks,” in International Conference on Learning Representations (2018). https://arxiv.org/abs/1802.05957
I. J. Goodfellow et al., “Generative adversarial nets,” in Advances in Neural Information Processing Systems (2014), Vol. 2, pp. 2672–2680. https://arxiv.org/pdf/1406.2661.pdf
Google Scholar
C. Villani, Optimal Transport: Old and New (Springer, Berlin, 2016).
Google Scholar
S. Hirose et al., “ABCAS: Adaptive bound control of spectral norm as automatic stabilizer,” in 2023 IEEE International Conference on Consumer Electronics (2023), pp. 1–5. https://doi.org/10.1109/ICCE56470.2023.10043368
M. S. M. Sajjadi et al. “Assessing generative models via precision and recall,” in Neural Information Processing (2018), pp. 5234–5243. https://arxiv.org/abs/1806.00035
P. S. Kostenetskiy, R. A. Chulkevich, and V. I. Ko-zyrev, “HPC resources of the Higher School of Economics,” J. Phys.: Conf. Ser. 1740 (2021). https://doi.org/10.1088/1742-6596/1740/1/012050

Download references

ACKNOWLEDGEMENTS

This study was carried out using the supercomputer complex of the National Research University Higher School of Economics [10]; the authors are grateful for providing access to it.

Funding

This work was supported by ongoing institutional funding. No additional grants to carry out or direct this particular research were obtained.

Author information

Authors and Affiliations

HSE University, 101000, Moscow, Russia
E. A. Egorov & A. I. Rogachev

Authors

E. A. Egorov
View author publications
You can also search for this author in PubMed Google Scholar
A. I. Rogachev
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to E. A. Egorov or A. I. Rogachev.

Ethics declarations

The authors of this work declare that they have no conflicts of interest.

Additional information

Publisher’s Note.

Pleiades Publishing remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

APPENDIX

1.1 ARCHITECTURE OF MODELS USED

Rights and permissions

Reprints and permissions

About this article

Cite this article

Egorov, E.A., Rogachev, A.I. Adaptive Spectral Normalization for Generative Models. Dokl. Math. 108 (Suppl 2), S205–S214 (2023). https://doi.org/10.1134/S1064562423701089

Download citation

Received: 04 September 2023
Revised: 08 September 2023
Accepted: 18 October 2023
Published: 09 February 2024
Issue Date: December 2023
DOI: https://doi.org/10.1134/S1064562423701089

Keywords:

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions