⚰️ Multimedia

R.I.P. 👻 Ghosted

JPEG Steganalysis Based on Steganographic Feature Enhancement and Graph Attention Learning

Qiyun Liu, Zhiguang Yang, Hanzhou Wu

cs.MM 🏛 J. Electronic Imaging 📚 3 cites 3 years ago

R.I.P. 👻 Ghosted

Evaluating Strong Idempotence of Image Codec

Qian Zhang, Tongda Xu, ... (+2 more)

cs.MM 🏛 arXiv 📚 3 cites 3 years ago

R.I.P. 👻 Ghosted

Practical Analyses of How Common Social Media Platforms and Photo Storage Services Handle Uploaded Images

Duc-Tien Dang-Nguyen, Vegard Velle Sjøen, ... (+4 more)

cs.MM 🏛 ICMM 📚 3 cites 3 years ago

R.I.P. 👻 Ghosted

Training Data Improvement for Image Forgery Detection using Comprint

Hannes Mareen, Dante Vanden Bussche, ... (+3 more)

cs.MM 🏛 ICCE 📚 3 cites 3 years ago

R.I.P. 👻 Ghosted

OpenLifelogQA: An Open-Ended Multi-Modal Lifelog Question-Answering Dataset

Quang-Linh Tran, Binh Nguyen, ... (+2 more)

cs.MM 🏛 arXiv 📚 2 cites 10 months ago

R.I.P. 👻 Ghosted

MHier-RAG: Multi-Modal RAG for Visual-Rich Document Question-Answering via Hierarchical and Multi-Granularity Reasoning

Ziyu Gong, Chengcheng Mai, Yihua Huang

cs.MM 🏛 arXiv 📚 2 cites 10 months ago

R.I.P. 👻 Ghosted

Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective

Taoyu Su, Jiawei Sheng, ... (+6 more)

cs.MM 🏛 SIGIR 📚 2 cites 1 year ago

R.I.P. 👻 Ghosted

An Efficient Recommendation System in E-commerce using Passer learning optimization based on Bi-LSTM

Hemn Barzan Abdalla, Awder Ahmed, ... (+4 more)

cs.MM 🏛 J.CCE 📚 2 cites 2 years ago

R.I.P. 👻 Ghosted

Toward Accessible and Safe Live Streaming Using Distributed Content Filtering with MoQ

Andrew C. Freeman

cs.MM 🏛 ICME W 📚 2 cites 1 year ago

R.I.P. 💀 404 Not Found

Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning

Yi Bin, Junrong Liao, ... (+5 more)

cs.MM 🏛 ACM MM 📚 2 cites 1 year ago

R.I.P. 👻 Ghosted

VCEMO: Multi-Modal Emotion Recognition for Chinese Voiceprints

Jinghua Tang, Liyun Zhang, ... (+7 more)

cs.MM 🏛 arXiv 📚 2 cites 1 year ago

R.I.P. 👻 Ghosted

PPVF: An Efficient Privacy-Preserving Online Video Fetching Framework with Correlated Differential Privacy

Xianzhi Zhang, Yipeng Zhou, ... (+4 more)

cs.MM 🏛 IEEE/ACM ToN 📚 2 cites 1 year ago

R.I.P. 👻 Ghosted

Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming

Xinqi Jin, Zhui Zhu, ... (+7 more)

cs.MM 🏛 ACM SIGMM Conference on Multimedia Systems 📚 2 cites 1 year ago

R.I.P. 👻 Ghosted

Personality Analysis from Online Short Video Platforms with Multi-domain Adaptation

Sixu An, Xiangguo Sun, ... (+3 more)

cs.MM 🏛 arXiv 📚 2 cites 1 year ago

R.I.P. 👻 Ghosted

AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization

Zhonghua Jiang, Kui Chen, ... (+6 more)

cs.MM 🏛 arXiv 📚 2 cites 7 months ago

R.I.P. 👻 Ghosted

E-FreeM2: Efficient Training-Free Multi-Scale and Cross-Modal News Verification via MLLMs

Van-Hoang Phan, Long-Khanh Pham, ... (+3 more)

cs.MM 🏛 Proceedings of the 2nd Workshop on Security-Centric Strategies for Combating Information Disorder 📚 2 cites 11 months ago

R.I.P. 👻 Ghosted

AV-EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Omni-modal LLMS with Audio-visual Cues

Krish Patel, Dingkun Zhou, ... (+15 more)

cs.MM 🏛 arXiv 📚 2 cites 8 months ago

R.I.P. 👻 Ghosted

XGC-AVis: Towards Audio-Visual Content Understanding with a Multi-Agent Collaborative System

Yuqin Cao, Xiongkuo Min, ... (+5 more)

cs.MM 🏛 arXiv 📚 2 cites 8 months ago

R.I.P. 👻 Ghosted

Fact-Checking at Scale: Multimodal AI for Authenticity and Context Verification in Online Media

Van-Hoang Phan, Tung-Duong Le-Duc, ... (+8 more)

cs.MM 🏛 ACM MM 📚 2 cites 10 months ago

R.I.P. 👻 Ghosted

CatchPhrase: EXPrompt-Guided Encoder Adaptation for Audio-to-Image Generation

Hyunwoo Oh, SeungJu Cha, ... (+3 more)

cs.MM 🏛 ACM MM 📚 2 cites 11 months ago

R.I.P. 👻 Ghosted

Multimodal Framework for Explainable Autonomous Driving: Integrating Video, Sensor, and Textual Data for Enhanced Decision-Making and Transparency

Abolfazl Zarghani, Amirhossein Ebrahimi, Amir Malekesfandiari

cs.MM 🏛 arXiv 📚 2 cites 11 months ago

R.I.P. 👻 Ghosted

Music2Palette: Emotion-aligned Color Palette Generation via Cross-Modal Representation Learning

Jiayun Hu, Yueyi He, ... (+3 more)

cs.MM 🏛 ACM MM 📚 2 cites 11 months ago

R.I.P. 👻 Ghosted

Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning?

Yuesheng Huang, Peng Zhang, ... (+2 more)

cs.MM 🏛 arXiv 📚 2 cites 12 months ago

R.I.P. 👻 Ghosted

Learning Quality from Complexity and Structure: A Feature-Fused XGBoost Model for Video Quality Assessment

Amritha Premkumar, Prajit T Rajendran, Vignesh V Menon

cs.MM 🏛 arXiv 📚 2 cites 1 year ago

🏛️ The Multimedia Crypt