Testing pre-trained Transformer models for Lithuanian news clustering

April 03, 2020 · Declared Dead · 🏛 International Conference on Information Technology

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Lukas Stankevičius, Mantas Lukoševičius arXiv ID 2004.03461 Category cs.IR: Information Retrieval Cross-listed cs.CL, cs.LG Citations 8 Venue International Conference on Information Technology Last Checked 4 months ago

Abstract

A recent introduction of Transformer deep learning architecture made breakthroughs in various natural language processing tasks. However, non-English languages could not leverage such new opportunities with the English text pre-trained models. This changed with research focusing on multilingual models, where less-spoken languages are the main beneficiaries. We compare pre-trained multilingual BERT, XLM-R, and older learned text representation methods as encodings for the task of Lithuanian news clustering. Our results indicate that publicly available pre-trained multilingual Transformer models can be fine-tuned to surpass word vectors but still score much lower than specially trained doc2vec embeddings.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Information Retrieval

R.I.P. 👻 Ghosted

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

Rex Ying, Ruining He, ... (+4 more)

cs.IR 🏛 KDD 📚 4.0K cites 8 years ago

🌅 🌅 Old Age

Neural Graph Collaborative Filtering

Xiang Wang, Xiangnan He, ... (+3 more)

cs.IR 🏛 SIGIR 📚 3.6K cites 7 years ago

R.I.P. 👻 Ghosted

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

Huifeng Guo, Ruiming Tang, ... (+3 more)

cs.IR 🏛 IJCAI 📚 3.0K cites 9 years ago

R.I.P. 👻 Ghosted

BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer

Fei Sun, Jun Liu, ... (+5 more)

cs.IR 🏛 CIKM 📚 2.9K cites 7 years ago

R.I.P. 💀 404 Not Found

Graph Neural Networks for Social Recommendation

Wenqi Fan, Yao Ma, ... (+5 more)

cs.IR 🏛 WWW 📚 2.2K cites 7 years ago

R.I.P. 👻 Ghosted

Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding

Jiaxi Tang, Ke Wang

cs.IR 🏛 WSDM 📚 2.0K cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago