🏛️ The Multimedia Crypt
cs.MM: Where Multimedia papers rest without their code.
2009
Total Papers
1676
No Code
61
Twilight
272
Has Code
13.5%
Survival Rate
🌅
💤
Eternal Rest
R.I.P.
👻
Ghosted
Towards Real-World Stickers Use: A New Dataset for Multi-Tag Sticker Recognition
R.I.P.
👻
Ghosted
Deep Reinforcement Learning with Importance Weighted A3C for QoE enhancement in Video Delivery Services
R.I.P.
👻
Ghosted
Confidence-based Event-centric Online Video Question Answering on a Newly Constructed ATBS Dataset
R.I.P.
👻
Ghosted
Dance2MIDI: Dance-driven multi-instruments music generation
R.I.P.
👻
Ghosted
FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework
R.I.P.
👻
Ghosted
diveXplore at the Video Browser Showdown 2024
R.I.P.
👻
Ghosted
Ges-QA: A Multidimensional Quality Assessment Dataset for Audio-to-3D Gesture Generation
R.I.P.
👻
Ghosted
VGGSounder: Audio-Visual Evaluations for Foundation Models
R.I.P.
👻
Ghosted
DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis
R.I.P.
👻
Ghosted
LayLens: Improving Deepfake Understanding through Simplified Explanations
📚
📚
The Cartographer
A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects
R.I.P.
👻
Ghosted
FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing
📚
📚
The Cartographer
A Survey on Multimodal Music Emotion Recognition
R.I.P.
👻
Ghosted
Fact-Checking with Contextual Narratives: Leveraging Retrieval-Augmented LLMs for Social Media Analysis
R.I.P.
👻
Ghosted
Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Representation
R.I.P.
👻
Ghosted
Multimodal Graph-Based Variational Mixture of Experts Network for Zero-Shot Multimodal Information Extraction
R.I.P.
👻
Ghosted
REAL: Realism Evaluation of Text-to-Image Generation Models for Effective Data Augmentation
R.I.P.
👻
Ghosted
FlexCache: Flexible Approximate Cache System for Video Diffusion
📚
📚
The Cartographer
A review on Machine Learning based User-Centric Multimedia Streaming Techniques
R.I.P.
👻
Ghosted
Counterfactual Reasoning Using Predicted Latent Personality Dimensions for Optimizing Persuasion Outcome
R.I.P.
👻
Ghosted
MetaCast: A Self-Driven Metaverse Announcer Architecture Based on Quality of Experience Evaluation Model
R.I.P.
👻
Ghosted
OAcode: Overall Aesthetic 2D Barcode on Screen
R.I.P.
👻
Ghosted