A Survey of Document-Level Information Extraction

September 23, 2023 · The Cartographer · 🏛 arXiv.org

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Document-Level Information Extraction"

Evidence collected by the PWNC Scanner

Authors Hanwen Zheng, Sijia Wang, Lifu Huang arXiv ID 2309.13249 Category cs.CL: Computation & Language Citations 3 Venue arXiv.org Last Checked 4 days ago

Abstract

Document-level information extraction (IE) is a crucial task in natural language processing (NLP). This paper conducts a systematic review of recent document-level IE literature. In addition, we conduct a thorough error analysis with current state-of-the-art algorithms and identify their limitations as well as the remaining challenges for the task of document-level IE. According to our findings, labeling noises, entity coreference resolution, and lack of reasoning, severely affect the performance of document-level IE. The objective of this survey paper is to provide more insights and help NLP researchers to further enhance document-level IE performance.