C3: High-performance and low-complexity neural compression from a single image or video

December 05, 2023 · Declared Dead · 🏛 Computer Vision and Pattern Recognition

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Hyunjik Kim, Matthias Bauer, Lucas Theis, Jonathan Richard Schwarz, Emilien Dupont arXiv ID 2312.02753 Category eess.IV: Image & Video Processing Cross-listed cs.CV, cs.LG, stat.ML Citations 64 Venue Computer Vision and Pattern Recognition Last Checked 2 months ago

Abstract

Most neural compression models are trained on large datasets of images or videos in order to generalize to unseen data. Such generalization typically requires large and expressive architectures with a high decoding complexity. Here we introduce C3, a neural compression method with strong rate-distortion (RD) performance that instead overfits a small model to each image or video separately. The resulting decoding complexity of C3 can be an order of magnitude lower than neural baselines with similar RD performance. C3 builds on COOL-CHIC (Ladune et al.) and makes several simple and effective improvements for images. We further develop new methodology to apply C3 to videos. On the CLIC2020 image benchmark, we match the RD performance of VTM, the reference implementation of the H.266 codec, with less than 3k MACs/pixel for decoding. On the UVG video benchmark, we match the RD performance of the Video Compression Transformer (Mentzer et al.), a well-established neural video codec, with less than 5k MACs/pixel for decoding.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Image & Video Processing

R.I.P. 👻 Ghosted

Variational image compression with a scale hyperprior

Johannes Ballé, David Minnen, ... (+3 more)

eess.IV 🏛 ICLR 📚 2.2K cites 8 years ago

R.I.P. 👻 Ghosted

Kvasir-SEG: A Segmented Polyp Dataset

Debesh Jha, Pia H. Smedsrud, ... (+5 more)

eess.IV 🏛 ICMM 📚 1.7K cites 6 years ago

R.I.P. 👻 Ghosted

Deep Learning for Hyperspectral Image Classification: An Overview

Shutao Li, Weiwei Song, ... (+4 more)

eess.IV 🏛 IEEE TGRS 📚 1.5K cites 6 years ago

R.I.P. 👻 Ghosted

U-Net and its variants for medical image segmentation: theory and applications

Nahian Siddique, Paheding Sidike, ... (+2 more)

eess.IV 🏛 IEEE Access 📚 1.4K cites 5 years ago

R.I.P. 👻 Ghosted

Algorithm Unrolling: Interpretable, Efficient Deep Learning for Signal and Image Processing

Vishal Monga, Yuelong Li, Yonina C. Eldar

eess.IV 🏛 IEEE Signal Processing Magazine 📚 1.3K cites 6 years ago

R.I.P. 👻 Ghosted

ResUNet++: An Advanced Architecture for Medical Image Segmentation

Debesh Jha, Pia H. Smedsrud, ... (+5 more)

eess.IV 🏛 ICM 📚 1.2K cites 6 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago