Large-Scale Evaluation of Keyphrase Extraction Models

March 10, 2020 Β· Declared Dead Β· πŸ› ACM/IEEE Joint Conference on Digital Libraries

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Ygor Gallina, Florian Boudin, BΓ©atrice Daille arXiv ID 2003.04628 Category cs.IR: Information Retrieval Citations 16 Venue ACM/IEEE Joint Conference on Digital Libraries Last Checked 4 months ago
Abstract
Keyphrase extraction models are usually evaluated under different, not directly comparable, experimental setups. As a result, it remains unclear how well proposed models actually perform, and how they compare to each other. In this work, we address this issue by presenting a systematic large-scale analysis of state-of-the-art keyphrase extraction models involving multiple benchmark datasets from various sources and domains. Our main results reveal that state-of-the-art models are in fact still challenged by simple baselines on some datasets. We also present new insights about the impact of using author- or reader-assigned keyphrases as a proxy for gold standard, and give recommendations for strong baselines and reliable benchmark datasets.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Information Retrieval

Died the same way β€” πŸ‘» Ghosted