8,208 research outputs found

    Defining a Phylogenetic Tree with the Minimum Number of rr-State Characters

    Full text link

    Capturing a phylogenetic tree when the number of character states varies with the number of leaves

    Full text link
    We show that for any two values α,β>0\alpha, \beta >0 for which α+β>1\alpha+\beta>1 then there is a value NN so that for all n≥Nn \geq N the following holds. For any binary phylogenetic tree TT on nn leaves there is a set of ⌊nα⌋\lfloor n^\alpha \rfloor characters that capture TT, and for which each character takes at most ⌊nβ⌋\lfloor n^\beta \rfloor distinct states. Here `capture' means that TT is the unique perfect phylogeny for these characters. Our short proof of this combinatorial result is based on the probabilistic method.Comment: 3 pages, 0 figure

    Constructing computer virus phylogenies

    Get PDF
    There has been much recent algorithmic work on the problem of reconstructing the evolutionary history of biological species. Computer virus specialists are interested in finding the evolutionary history of computer viruses - a virus is often written using code fragments from one or more other viruses, which are its immediate ancestors. A phylogeny for a collection of computer viruses is a directed acyclic graph whose nodes are the viruses and whose edges map ancestors to descendants and satisfy the property that each code fragment is "invented" only once. To provide a simple explanation for the data, we consider the problem of constructing such a phylogeny with a minimum number of edges. In general this optimization problem is NP-complete; some associated approximation problems are also hard, but others are easy. When tree solutions exist, they can be constructed and randomly sampled in polynomial time

    Multivariate Approaches to Classification in Extragalactic Astronomy

    Get PDF
    Clustering objects into synthetic groups is a natural activity of any science. Astrophysics is not an exception and is now facing a deluge of data. For galaxies, the one-century old Hubble classification and the Hubble tuning fork are still largely in use, together with numerous mono-or bivariate classifications most often made by eye. However, a classification must be driven by the data, and sophisticated multivariate statistical tools are used more and more often. In this paper we review these different approaches in order to situate them in the general context of unsupervised and supervised learning. We insist on the astrophysical outcomes of these studies to show that multivariate analyses provide an obvious path toward a renewal of our classification of galaxies and are invaluable tools to investigate the physics and evolution of galaxies.Comment: Open Access paper. http://www.frontiersin.org/milky\_way\_and\_galaxies/10.3389/fspas.2015.00003/abstract\>. \<10.3389/fspas.2015.00003 \&g
    • …
    corecore