4 research outputs found

    Sequence length requirements for phylogenetic methods

    Get PDF

    Recovering Evolutionary Trees Through Harmonic Greedy Triplets (Extended Abstract)

    No full text
    . We give a greedy learning algorithm for reconstructing an evolutionary tree based on a harmonic average on triplets of taxa. This algorithm runs in polynomial time in the input size. Using the Jukes-Cantor model of evolution, our algorithm is mathematically proven to require sample sequences of only polynomial lengths in the number of taxa in order to recover the correct tree topology with high probability. In addition to recovering the topology, the algorithm also estimates the tree edge lengths with high accuracy. Our theoretical analysis is supported by simulated experiments, in which the algorithm has demonstrated high success rates in reconstructing a large tree from short sequences. 1. Introduction. Algorithms for reconstructing evolutionary trees are principal tools in biology [11]. These algorithms usually compare aligned character sequences for the taxa in question to infer their evolutionary relationships [19]. In the past, such characters were often categorical variables o..
    corecore