BERGEN: A Benchmarking Library for Retrieval-Augmented Generation

July 01, 2024 Β· Declared Dead Β· πŸ› Conference on Empirical Methods in Natural Language Processing

πŸ’€ CAUSE OF DEATH: 404 Not Found
Code link is broken/dead
Authors David Rau, HervΓ© DΓ©jean, Nadezhda Chirkova, Thibault Formal, Shuai Wang, Vassilina Nikoulina, StΓ©phane Clinchant arXiv ID 2407.01102 Category cs.CL: Computation & Language Cross-listed cs.IR Citations 25 Venue Conference on Empirical Methods in Natural Language Processing Repository https://github.com/naver/bergen} Last Checked 2 months ago
Abstract
Retrieval-Augmented Generation allows to enhance Large Language Models with external knowledge. In response to the recent popularity of generative LLMs, many RAG approaches have been proposed, which involve an intricate number of different configurations such as evaluation datasets, collections, metrics, retrievers, and LLMs. Inconsistent benchmarking poses a major challenge in comparing approaches and understanding the impact of each component in the pipeline. In this work, we study best practices that lay the groundwork for a systematic evaluation of RAG and present BERGEN, an end-to-end library for reproducible research standardizing RAG experiments. In an extensive study focusing on QA, we benchmark different state-of-the-art retrievers, rerankers, and LLMs. Additionally, we analyze existing RAG metrics and datasets. Our open-source library BERGEN is available under \url{https://github.com/naver/bergen}.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Computation & Language

πŸŒ… πŸŒ… Old Age

Attention Is All You Need

Ashish Vaswani, Noam Shazeer, ... (+6 more)

cs.CL πŸ› NeurIPS πŸ“š 166.0K cites 8 years ago

Died the same way β€” πŸ’€ 404 Not Found