๐
๐
Old Age
Enriching language models with graph-based context information to better understand textual data
May 10, 2023 ยท Entered Twilight ยท ๐ Electronics
Repo contents: .gitignore, .gitmodules, LICENSE, README.md, gc_bert, pubmed, requirements.txt, setup.py
Authors
Albert Roethel, Maria Ganzha, Anna Wrรณblewska
arXiv ID
2305.11070
Category
cs.CL: Computation & Language
Cross-listed
cs.AI,
cs.LG,
cs.NE
Citations
3
Venue
Electronics
Repository
https://github.com/tryptofanik/gc-bert
Last Checked
3 months ago
Abstract
A considerable number of texts encountered daily are somehow connected with each other. For example, Wikipedia articles refer to other articles via hyperlinks, scientific papers relate to others via citations or (co)authors, while tweets relate via users that follow each other or reshare content. Hence, a graph-like structure can represent existing connections and be seen as capturing the "context" of the texts. The question thus arises if extracting and integrating such context information into a language model might help facilitate a better automated understanding of the text. In this study, we experimentally demonstrate that incorporating graph-based contextualization into BERT model enhances its performance on an example of a classification task. Specifically, on Pubmed dataset, we observed a reduction in error from 8.51% to 7.96%, while increasing the number of parameters just by 1.6%. Our source code: https://github.com/tryptofanik/gc-bert
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐ฎ
๐ฎ
The Ethereal
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age