From Word to Sense Embeddings: A Survey on Vector Representations of Meaning
May 10, 2018 ยท The Cartographer ยท ๐ Journal of Artificial Intelligence Research
"No code URL or promise found in abstract"
"Title-pattern auto-detect: From Word to Sense Embeddings: A Survey on Vector Representations of Meaning"
Evidence collected by the PWNC Scanner
Authors
Jose Camacho-Collados, Mohammad Taher Pilehvar
arXiv ID
1805.04032
Category
cs.CL: Computation & Language
Cross-listed
cs.AI
Citations
363
Venue
Journal of Artificial Intelligence Research
Last Checked
1 day ago
Abstract
Over the past years, distributed semantic representations have proved to be effective and flexible keepers of prior knowledge to be integrated into downstream applications. This survey focuses on the representation of meaning. We start from the theoretical background behind word vector space models and highlight one of their major limitations: the meaning conflation deficiency, which arises from representing a word with all its possible meanings as a single vector. Then, we explain how this deficiency can be addressed through a transition from the word level to the more fine-grained level of word senses (in its broader acceptation) as a method for modelling unambiguous lexical meaning. We present a comprehensive overview of the wide range of techniques in the two main branches of sense representation, i.e., unsupervised and knowledge-based. Finally, this survey covers the main evaluation procedures and applications for this type of representation, and provides an analysis of four of its important aspects: interpretability, sense granularity, adaptability to different domains and compositionality.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐๏ธ
๐๏ธ
Transcended
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age