A Vector Representation for Phylogenetic Trees
May 11, 2024 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Cedric Chauve, Caroline Colijn, Louxin Zhang
arXiv ID
2405.07110
Category
q-bio.PE
Cross-listed
cs.DS,
math.CO
Citations
6
Venue
arXiv.org
Last Checked
3 months ago
Abstract
Good representations for phylogenetic trees and networks are important for optimizing storage efficiency and implementation of scalable methods for the inference and analysis of evolutionary trees for genes, genomes and species. We introduce a new representation for rooted phylogenetic trees that encodes a binary tree on n taxa as a vector of length 2n in which each taxon appears exactly twice. Using this new tree representation, we introduce a novel tree rearrangement operator, called a HOP, that results in a tree space of diameter n and a quadratic neighbourhood size. We also introduce a novel metric, the HOP distance, which is the minimum number of HOPs to transform a tree into another tree. The HOP distance can be computed in near-linear time, a rare instance of a tree rearrangement distance that is tractable. Our experiments show that the HOP distance is better correlated to the Subtree-Prune-and-Regraft distance than the widely used Robinson-Foulds distance. We also describe how the novel tree representation we introduce can be further generalized to tree-child networks.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-bio.PE
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Simulating COVID-19 in a University Environment
R.I.P.
π»
Ghosted
How morphological development can guide evolution
R.I.P.
π»
Ghosted
Evolutionary forces in language change
R.I.P.
π»
Ghosted
Entropy and Diversity: The Axiomatic Approach
R.I.P.
π»
Ghosted
The evolution of conditional moral assessment in indirect reciprocity
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted