High-Fidelity Image Compression with Score-based Generative Models

Hoogeboom, Emiel; Agustsson, Eirikur; Mentzer, Fabian; Versari, Luca; Toderici, George; Theis, Lucas

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2305.18231 (eess)

[Submitted on 26 May 2023 (v1), last revised 7 Mar 2024 (this version, v3)]

Title:High-Fidelity Image Compression with Score-based Generative Models

Authors:Emiel Hoogeboom, Eirikur Agustsson, Fabian Mentzer, Luca Versari, George Toderici, Lucas Theis

View PDF

Abstract:Despite the tremendous success of diffusion generative models in text-to-image generation, replicating this success in the domain of image compression has proven difficult. In this paper, we demonstrate that diffusion can significantly improve perceptual quality at a given bit-rate, outperforming state-of-the-art approaches PO-ELIC and HiFiC as measured by FID score. This is achieved using a simple but theoretically motivated two-stage approach combining an autoencoder targeting MSE followed by a further score-based decoder. However, as we will show, implementation details matter and the optimal design decisions can differ greatly from typical text-to-image models.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2305.18231 [eess.IV]
	(or arXiv:2305.18231v3 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2305.18231

Submission history

From: Lucas Theis [view email]
[v1] Fri, 26 May 2023 17:16:16 UTC (41,703 KB)
[v2] Wed, 6 Mar 2024 18:27:34 UTC (41,707 KB)
[v3] Thu, 7 Mar 2024 20:28:54 UTC (41,703 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:High-Fidelity Image Compression with Score-based Generative Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:High-Fidelity Image Compression with Score-based Generative Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators