TransVDM: Motion-Constrained Video Diffusion Model for Transparent Video Synthesis

February 26, 2025 · Declared Dead · 🏛 IEEE International Conference on Acoustics, Speech, and Signal Processing

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Menghao Li, Zhenghao Zhang, Junchao Liao, Long Qin, Weizhi Wang arXiv ID 2502.19454 Category cs.GR: Graphics Citations 1 Venue IEEE International Conference on Acoustics, Speech, and Signal Processing Last Checked 4 months ago

Abstract

Recent developments in Video Diffusion Models (VDMs) have demonstrated remarkable capability to generate high-quality video content. Nonetheless, the potential of VDMs for creating transparent videos remains largely uncharted. In this paper, we introduce TransVDM, the first diffusion-based model specifically designed for transparent video generation. TransVDM integrates a Transparent Variational Autoencoder (TVAE) and a pretrained UNet-based VDM, along with a novel Alpha Motion Constraint Module (AMCM). The TVAE captures the alpha channel transparency of video frames and encodes it into the latent space of the VDMs, facilitating a seamless transition to transparent video diffusion models. To improve the detection of transparent areas, the AMCM integrates motion constraints from the foreground within the VDM, helping to reduce undesirable artifacts. Moreover, we curate a dataset containing 250K transparent frames for training. Experimental results demonstrate the effectiveness of our approach across various benchmarks.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Graphics

R.I.P. 👻 Ghosted

Everybody Dance Now

Caroline Chan, Shiry Ginosar, ... (+2 more)

cs.GR 🏛 ICCV 📚 820 cites 7 years ago

R.I.P. 👻 Ghosted

Deep Bilateral Learning for Real-Time Image Enhancement

Michaël Gharbi, Jiawen Chen, ... (+3 more)

cs.GR 🏛 ACM TOG 📚 800 cites 8 years ago

R.I.P. 👻 Ghosted

Animating Human Athletics

Jessica K. Hodgins, Wayne L. Wooten, ... (+2 more)

cs.GR 🏛 SIGGRAPH 📚 765 cites 3 years ago

R.I.P. 👻 Ghosted

BundleFusion: Real-time Globally Consistent 3D Reconstruction using On-the-fly Surface Re-integration

Angela Dai, Matthias Nießner, ... (+3 more)

cs.GR 🏛 TOGS 📚 595 cites 10 years ago

R.I.P. 👻 Ghosted

Shape Transformation Using Variational Implicit Functions

Greg Turk, James F. O'Brien

cs.GR 🏛 SIGGRAPH Courses 📚 581 cites 3 years ago

R.I.P. 👻 Ghosted

ABC: A Big CAD Model Dataset For Geometric Deep Learning

Sebastian Koch, Albert Matveev, ... (+7 more)

cs.GR 🏛 CVPR 📚 580 cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago