Art2Mus: Bridging Visual Arts and Music through Cross-Modal Generation

October 07, 2024 · Declared Dead · 🏛 ECCV Workshops

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Ivan Rinaldi, Nicola Fanelli, Giovanna Castellano, Gennaro Vessio arXiv ID 2410.04906 Category cs.MM: Multimedia Cross-listed cs.CV, cs.SD, eess.AS Citations 6 Venue ECCV Workshops Last Checked 3 months ago

Abstract

Artificial Intelligence and generative models have revolutionized music creation, with many models leveraging textual or visual prompts for guidance. However, existing image-to-music models are limited to simple images, lacking the capability to generate music from complex digitized artworks. To address this gap, we introduce $\mathcal{A}\textit{rt2}\mathcal{M}\textit{us}$, a novel model designed to create music from digitized artworks or text inputs. $\mathcal{A}\textit{rt2}\mathcal{M}\textit{us}$ extends the AudioLDM~2 architecture, a text-to-audio model, and employs our newly curated datasets, created via ImageBind, which pair digitized artworks with music. Experimental results demonstrate that $\mathcal{A}\textit{rt2}\mathcal{M}\textit{us}$ can generate music that resonates with the input stimuli. These findings suggest promising applications in multimedia art, interactive installations, and AI-driven creative tools.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Multimedia

🌅 🌅 Old Age

Quality Assessment of In-the-Wild Videos

Dingquan Li, Tingting Jiang, Ming Jiang

cs.MM 🏛 ACM MM 📚 375 cites 6 years ago

R.I.P. 👻 Ghosted

Viewport-Adaptive Navigable 360-Degree Video Delivery

Xavier Corbillon, Gwendal Simon, ... (+2 more)

cs.MM 🏛 ICC 📚 328 cites 9 years ago

📚 📚 The Cartographer

A Comprehensive Survey on Cross-modal Retrieval

Kaiye Wang, Qiyue Yin, ... (+3 more)

cs.MM 🏛 arXiv 📚 322 cites 9 years ago

📚 📚 The Cartographer

An Overview of Cross-media Retrieval: Concepts, Methodologies, Benchmarks and Challenges

Yuxin Peng, Xin Huang, Yunzhen Zhao

cs.MM 🏛 IEEE TCSVT 📚 309 cites 9 years ago

R.I.P. 👻 Ghosted

A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding

Yuanying Dai, Dong Liu, Feng Wu

cs.MM 🏛 ICMM 📚 305 cites 9 years ago

R.I.P. 👻 Ghosted

Video Generation From Text

Yitong Li, Martin Renqiang Min, ... (+3 more)

cs.MM 🏛 AAAI 📚 300 cites 8 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago