Neural Named Entity Recognition for Kazakh

July 17, 2020 Β· Declared Dead Β· πŸ› Conference on Intelligent Text Processing and Computational Linguistics

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Gulmira Tolegen, Alymzhan Toleu, Orken Mamyrbayev, Rustam Mussabayev arXiv ID 2007.13626 Category cs.IR: Information Retrieval Cross-listed cs.CL Citations 12 Venue Conference on Intelligent Text Processing and Computational Linguistics Last Checked 4 months ago
Abstract
We present several neural networks to address the task of named entity recognition for morphologically complex languages (MCL). Kazakh is a morphologically complex language in which each root/stem can produce hundreds or thousands of variant word forms. This nature of the language could lead to a serious data sparsity problem, which may prevent the deep learning models from being well trained for under-resourced MCLs. In order to model the MCLs' words effectively, we introduce root and entity tag embedding plus tensor layer to the neural networks. The effects of those are significant for improving NER model performance of MCLs. The proposed models outperform state-of-the-art including character-based approaches, and can be potentially applied to other morphologically complex languages.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Information Retrieval

Died the same way β€” πŸ‘» Ghosted