A Review on the Applications of Transformer-based language models for Nucleotide Sequence Analysis
December 10, 2024 ยท The Cartographer ยท ๐ Computational and Structural Biotechnology Journal
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Review on the Applications of Transformer-based language models for Nucleotide Sequence Analysis"
Evidence collected by the PWNC Scanner
Authors
Nimisha Ghosh, Daniele Santoni, Indrajit Saha, Giovanni Felici
arXiv ID
2412.07201
Category
cs.CL: Computation & Language
Cross-listed
cs.AI
Citations
8
Venue
Computational and Structural Biotechnology Journal
Last Checked
3 days ago
Abstract
In recent times, Transformer-based language models are making quite an impact in the field of natural language processing. As relevant parallels can be drawn between biological sequences and natural languages, the models used in NLP can be easily extended and adapted for various applications in bioinformatics. In this regard, this paper introduces the major developments of Transformer-based models in the recent past in the context of nucleotide sequences. We have reviewed and analysed a large number of application-based papers on this subject, giving evidence of the main characterizing features and to different approaches that may be adopted to customize such powerful computational machines. We have also provided a structured description of the functioning of Transformers, that may enable even first time users to grab the essence of such complex architectures. We believe this review will help the scientific community in understanding the various applications of Transformer-based language models to nucleotide sequences. This work will motivate the readers to build on these methodologies to tackle also various other problems in the field of bioinformatics.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐ฎ
๐ฎ
The Ethereal
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age