Leveraging Decoder Architectures for Learned Sparse Retrieval

April 25, 2025 · Declared Dead · 🏛 KEIR

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Jingfen Qiao, Thong Nguyen, Evangelos Kanoulas, Andrew Yates arXiv ID 2504.18151 Category cs.IR: Information Retrieval Citations 4 Venue KEIR Last Checked 4 months ago

Abstract

Learned Sparse Retrieval (LSR) has traditionally focused on small-scale encoder-only transformer architectures. With the advent of large-scale pre-trained language models, their capability to generate sparse representations for retrieval tasks across different transformer-based architectures, including encoder-only, decoder-only, and encoder-decoder models, remains largely unexplored. This study investigates the effectiveness of LSR across these architectures, exploring various sparse representation heads and model scales. Our results highlight the limitations of using large language models to create effective sparse representations in zero-shot settings, identifying challenges such as inappropriate term expansions and reduced performance due to the lack of expansion. We find that the encoder-decoder architecture with multi-tokens decoding approach achieves the best performance among the three backbones. While the decoder-only model performs worse than the encoder-only model, it demonstrates the potential to outperform when scaled to a high number of parameters.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Information Retrieval

R.I.P. 👻 Ghosted

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

Rex Ying, Ruining He, ... (+4 more)

cs.IR 🏛 KDD 📚 4.0K cites 8 years ago

🌅 🌅 Old Age

Neural Graph Collaborative Filtering

Xiang Wang, Xiangnan He, ... (+3 more)

cs.IR 🏛 SIGIR 📚 3.6K cites 7 years ago

R.I.P. 👻 Ghosted

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

Huifeng Guo, Ruiming Tang, ... (+3 more)

cs.IR 🏛 IJCAI 📚 3.0K cites 9 years ago

R.I.P. 👻 Ghosted

BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer

Fei Sun, Jun Liu, ... (+5 more)

cs.IR 🏛 CIKM 📚 2.9K cites 7 years ago

R.I.P. 💀 404 Not Found

Graph Neural Networks for Social Recommendation

Wenqi Fan, Yao Ma, ... (+5 more)

cs.IR 🏛 WWW 📚 2.2K cites 7 years ago

R.I.P. 👻 Ghosted

Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding

Jiaxi Tang, Ke Wang

cs.IR 🏛 WSDM 📚 2.0K cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago