🏛️ The Multimedia Crypt
cs.MM: Where Multimedia papers rest without their code.
2009
Total Papers
1676
No Code
61
Twilight
272
Has Code
13.5%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Evaluating Strong Idempotence of Image Codec
R.I.P.
👻
Ghosted
Practical Analyses of How Common Social Media Platforms and Photo Storage Services Handle Uploaded Images
R.I.P.
👻
Ghosted
Training Data Improvement for Image Forgery Detection using Comprint
R.I.P.
👻
Ghosted
OpenLifelogQA: An Open-Ended Multi-Modal Lifelog Question-Answering Dataset
R.I.P.
👻
Ghosted
MHier-RAG: Multi-Modal RAG for Visual-Rich Document Question-Answering via Hierarchical and Multi-Granularity Reasoning
R.I.P.
👻
Ghosted
Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective
R.I.P.
👻
Ghosted
An Efficient Recommendation System in E-commerce using Passer learning optimization based on Bi-LSTM
R.I.P.
👻
Ghosted
Toward Accessible and Safe Live Streaming Using Distributed Content Filtering with MoQ
R.I.P.
💀
404 Not Found
Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
R.I.P.
👻
Ghosted
VCEMO: Multi-Modal Emotion Recognition for Chinese Voiceprints
R.I.P.
👻
Ghosted
PPVF: An Efficient Privacy-Preserving Online Video Fetching Framework with Correlated Differential Privacy
R.I.P.
👻
Ghosted
Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming
R.I.P.
👻
Ghosted
Personality Analysis from Online Short Video Platforms with Multi-domain Adaptation
R.I.P.
👻
Ghosted
AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
R.I.P.
👻
Ghosted
E-FreeM2: Efficient Training-Free Multi-Scale and Cross-Modal News Verification via MLLMs
R.I.P.
👻
Ghosted
AV-EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Omni-modal LLMS with Audio-visual Cues
R.I.P.
👻
Ghosted
XGC-AVis: Towards Audio-Visual Content Understanding with a Multi-Agent Collaborative System
R.I.P.
👻
Ghosted
Fact-Checking at Scale: Multimodal AI for Authenticity and Context Verification in Online Media
R.I.P.
👻
Ghosted
CatchPhrase: EXPrompt-Guided Encoder Adaptation for Audio-to-Image Generation
R.I.P.
👻
Ghosted
Multimodal Framework for Explainable Autonomous Driving: Integrating Video, Sensor, and Textual Data for Enhanced Decision-Making and Transparency
R.I.P.
👻
Ghosted
Music2Palette: Emotion-aligned Color Palette Generation via Cross-Modal Representation Learning
R.I.P.
👻
Ghosted
Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning?
R.I.P.
👻
Ghosted