WeChat Mini Program
Old Version Features

A Vector Representation for Phylogenetic Trees

Philosophical transactions of the Royal Society of London Series B, Biological sciences(2025)

Department of Mathematics

Cited 0|Views22
Abstract
Good representations for phylogenetic trees and networks are important for enhancing storage efficiency and scalability for the inference and analysis of evolutionary trees for genes, genomes and species. We propose a new representation for rooted phylogenetic trees that encodes a tree on [Formula: see text] ordered taxa as a vector of length [Formula: see text] in which each taxon appears exactly twice. Using this new tree representation, we introduce a novel tree rearrangement operator, termed an HOP, that results in a tree space of linear diameter and quadratic neighbourhood size. We also introduce a novel metric, the HOP distance, which is the minimum number of HOPs to transform a tree into another tree. The HOP distance can be computed in near-linear time-a rare instance of tree rearrangement distance that is tractable. Our experiments show that the HOP distance is better correlated to the Subtree-Prune-and-Regraft distance than the widely used Robinson-Foulds distance. We also describe how the proposed tree representation can be further generalized to tree-child networks, showcasing its versatility and potential applications in broader evolutionary analyses.This article is part of the theme issue '"A mathematical theory of evolution": phylogenetic models dating back 100 years'.
More
Translated text
PDF
Bibtex
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper
Summary is being generated by the instructions you defined