PhyloGen: Language Model-Enhanced Phylogenetic Inference via Graph Structure Generation
December 25, 2024 Β· Declared Dead Β· π Neural Information Processing Systems
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
ChenRui Duan, Zelin Zang, Siyuan Li, Yongjie Xu, Stan Z. Li
arXiv ID
2412.18827
Category
q-bio.PE
Cross-listed
cs.AI
Citations
5
Venue
Neural Information Processing Systems
Last Checked
3 months ago
Abstract
Phylogenetic trees elucidate evolutionary relationships among species, but phylogenetic inference remains challenging due to the complexity of combining continuous (branch lengths) and discrete parameters (tree topology). Traditional Markov Chain Monte Carlo methods face slow convergence and computational burdens. Existing Variational Inference methods, which require pre-generated topologies and typically treat tree structures and branch lengths independently, may overlook critical sequence features, limiting their accuracy and flexibility. We propose PhyloGen, a novel method leveraging a pre-trained genomic language model to generate and optimize phylogenetic trees without dependence on evolutionary models or aligned sequence constraints. PhyloGen views phylogenetic inference as a conditionally constrained tree structure generation problem, jointly optimizing tree topology and branch lengths through three core modules: (i) Feature Extraction, (ii) PhyloTree Construction, and (iii) PhyloTree Structure Modeling. Meanwhile, we introduce a Scoring Function to guide the model towards a more stable gradient descent. We demonstrate the effectiveness and robustness of PhyloGen on eight real-world benchmark datasets. Visualization results confirm PhyloGen provides deeper insights into phylogenetic relationships.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-bio.PE
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Simulating COVID-19 in a University Environment
R.I.P.
π»
Ghosted
How morphological development can guide evolution
R.I.P.
π»
Ghosted
Evolutionary forces in language change
R.I.P.
π»
Ghosted
Entropy and Diversity: The Axiomatic Approach
R.I.P.
π»
Ghosted
The evolution of conditional moral assessment in indirect reciprocity
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted