Training-free Measures Based on Algorithmic Probability Identify High Nucleosome Occupancy in DNA Sequences
August 05, 2017 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Hector Zenil, Peter Minary
arXiv ID
1708.01751
Category
q-bio.QM
Cross-listed
cs.IT,
q-bio.GN
Citations
0
Venue
arXiv.org
Last Checked
3 months ago
Abstract
We introduce and study a set of training-free methods of information-theoretic and algorithmic complexity nature applied to DNA sequences to identify their potential capabilities to determine nucleosomal binding sites. We test our measures on well-studied genomic sequences of different sizes drawn from different sources. The measures reveal the known in vivo versus in vitro predictive discrepancies and uncover their potential to pinpoint (high) nucleosome occupancy. We explore different possible signals within and beyond the nucleosome length and find that complexity indices are informative of nucleosome occupancy. We compare against the gold standard (Kaplan model) and find similar and complementary results with the main difference that our sequence complexity approach. For example, for high occupancy, complexity-based scores outperform the Kaplan model for predicting binding representing a significant advancement in predicting the highest nucleosome occupancy following a training-free approach.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-bio.QM
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences
R.I.P.
π»
Ghosted
ProtVec: A Continuous Distributed Representation of Biological Sequences
R.I.P.
π»
Ghosted
A Perspective on Deep Imaging
R.I.P.
π
404 Not Found
Deep learning in bioinformatics: introduction, application, and perspective in big data era
R.I.P.
π»
Ghosted
Data-driven Advice for Applying Machine Learning to Bioinformatics Problems
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted