R.I.P.
๐ป
Ghosted
A Survey of Multimodal Retrieval-Augmented Generation
March 26, 2025 ยท The Cartographer ยท ๐ arXiv.org
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Multimodal Retrieval-Augmented Generation"
Evidence collected by the PWNC Scanner
Authors
Lang Mei, Siyu Mo, Zhihan Yang, Chong Chen
arXiv ID
2504.08748
Category
cs.IR: Information Retrieval
Cross-listed
cs.AI,
cs.CL,
cs.ET,
cs.LG
Citations
28
Venue
arXiv.org
Last Checked
2 days ago
Abstract
Multimodal Retrieval-Augmented Generation (MRAG) enhances large language models (LLMs) by integrating multimodal data (text, images, videos) into retrieval and generation processes, overcoming the limitations of text-only Retrieval-Augmented Generation (RAG). While RAG improves response accuracy by incorporating external textual knowledge, MRAG extends this framework to include multimodal retrieval and generation, leveraging contextual information from diverse data types. This approach reduces hallucinations and enhances question-answering systems by grounding responses in factual, multimodal knowledge. Recent studies show MRAG outperforms traditional RAG, especially in scenarios requiring both visual and textual understanding. This survey reviews MRAG's essential components, datasets, evaluation methods, and limitations, providing insights into its construction and improvement. It also identifies challenges and future research directions, highlighting MRAG's potential to revolutionize multimodal information retrieval and generation. By offering a comprehensive perspective, this work encourages further exploration into this promising paradigm.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Information Retrieval
๐
๐
Old Age
Neural Graph Collaborative Filtering
R.I.P.
๐ป
Ghosted
DeepFM: A Factorization-Machine based Neural Network for CTR Prediction
R.I.P.
๐ป
Ghosted
BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer
R.I.P.
๐
404 Not Found
Graph Neural Networks for Social Recommendation
R.I.P.
๐ป
Ghosted