229 research outputs found

    Improved Lower Bounds on the Compatibility of Multi-State Characters

    Full text link
    We study a long standing conjecture on the necessary and sufficient conditions for the compatibility of multi-state characters: There exists a function f(r)f(r) such that, for any set CC of rr-state characters, CC is compatible if and only if every subset of f(r)f(r) characters of CC is compatible. We show that for every r2r \ge 2, there exists an incompatible set CC of r2r2+1\lfloor\frac{r}{2}\rfloor\cdot\lceil\frac{r}{2}\rceil + 1 rr-state characters such that every proper subset of CC is compatible. Thus, f(r)r2r2+1f(r) \ge \lfloor\frac{r}{2}\rfloor\cdot\lceil\frac{r}{2}\rceil + 1 for every r2r \ge 2. This improves the previous lower bound of f(r)rf(r) \ge r given by Meacham (1983), and generalizes the construction showing that f(4)5f(4) \ge 5 given by Habib and To (2011). We prove our result via a result on quartet compatibility that may be of independent interest: For every integer n4n \ge 4, there exists an incompatible set QQ of n22n22+1\lfloor\frac{n-2}{2}\rfloor\cdot\lceil\frac{n-2}{2}\rceil + 1 quartets over nn labels such that every proper subset of QQ is compatible. We contrast this with a result on the compatibility of triplets: For every n3n \ge 3, if RR is an incompatible set of more than n1n-1 triplets over nn labels, then some proper subset of RR is incompatible. We show this upper bound is tight by exhibiting, for every n3n \ge 3, a set of n1n-1 triplets over nn taxa such that RR is incompatible, but every proper subset of RR is compatible

    On strongly chordal graphs that are not leaf powers

    Full text link
    A common task in phylogenetics is to find an evolutionary tree representing proximity relationships between species. This motivates the notion of leaf powers: a graph G = (V, E) is a leaf power if there exist a tree T on leafset V and a threshold k such that uv is an edge if and only if the distance between u and v in T is at most k. Characterizing leaf powers is a challenging open problem, along with determining the complexity of their recognition. This is in part due to the fact that few graphs are known to not be leaf powers, as such graphs are difficult to construct. Recently, Nevries and Rosenke asked if leaf powers could be characterized by strong chordality and a finite set of forbidden subgraphs. In this paper, we provide a negative answer to this question, by exhibiting an infinite family \G of (minimal) strongly chordal graphs that are not leaf powers. During the process, we establish a connection between leaf powers, alternating cycles and quartet compatibility. We also show that deciding if a chordal graph is \G-free is NP-complete, which may provide insight on the complexity of the leaf power recognition problem

    Reconstructing a phylogenetic level-1 network from quartets

    Get PDF
    We describe a method that will reconstruct an unrooted binary phylogenetic level-1 network on n taxa from the set of all quartets containing a certain fixed taxon, in O(n^3) time. We also present a more general method which can handle more diverse quartet data, but which takes O(n^6) time. Both methods proceed by solving a certain system of linear equations over GF(2). For a general dense quartet set (containing at least one quartet on every four taxa) our O(n^6) algorithm constructs a phylogenetic level-1 network consistent with the quartet set if such a network exists and returns an (O(n^2) sized) certificate of inconsistency otherwise. This answers a question raised by Gambette, Berry and Paul regarding the complexity of reconstructing a level-1 network from a dense quartet set

    Data incongruence and the problem of avian louse phylogeny

    Get PDF
    Recent studies based on different types of data (i.e. morphological and molecular) have supported conflicting phylogenies for the genera of avian feather lice (Ischnocera: Phthiraptera). We analyse new and published data from morphology and from mitochondrial (12S rRNA and COI) and nuclear (EF1-) genes to explore the sources of this incongruence and explain these conflicts. Character convergence, multiple substitutions at high divergences, and ancient radiation over a short period of time have contributed to the problem of resolving louse phylogeny with the data currently available. We show that apparent incongruence between the molecular datasets is largely attributable to rate variation and nonstationarity of base composition. In contrast, highly significant character incongruence leads to topological incongruence between the molecular and morphological data. We consider ways in which biases in the sequence data could be misleading, using several maximum likelihood models and LogDet corrections. The hierarchical structure of the data is explored using likelihood mapping and SplitsTree methods. Ultimately, we concede there is strong discordance between the molecular and morphological data and apply the conditional combination approach in this case. We conclude that higher level phylogenetic relationships within avian Ischnocera remain extremely problematic. However, consensus between datasets is beginning to converge on a stable phylogeny for avian lice, at and below the familial rank

    Contributions to computational phylogenetics and algorithmic self-assembly

    Get PDF
    This dissertation addresses some of the algorithmic and combinatorial problems at the interface between biology and computation. In particular, it focuses on problems in both computational phylogenetics, an area of study in which computation is used to better understand evolutionary relationships, and algorithmic self-assembly, an area of study in which biological processes are used to perform computation. The first set of results investigate inferring phylogenetic trees from multi-state character data. We give a novel characterization of when a set of three-state characters has a perfect phylogeny and make progress on a long-standing conjecture regarding the compatibility of multi-state characters. The next set of results investigate inferring phylogenetic supertrees from collections of smaller input trees when the input trees do not fully agree on the relative positions of the taxa. Two approaches to dealing with such conflicting input trees are considered. The first is to contract a set of edges in the input trees so that the resulting trees have an agreement supertree. The second is to remove a set of taxa from the input trees so that the resulting trees have an agreement supertree. We give fixed-parameter tractable algorithms for both approaches. We then turn to the algorithmic self-assembly of fractal structures from DNA tiles and investigate approximating the Sierpinski triangle and the Sierpinski carpet with strict self-assembly. We prove tight bounds on approximating the Sierpinski triangle and exhibit a class of fractals that are generalizations of the Sierpinski carpet that can approximately self-assemble. We conclude by discussing some ideas for further research

    Polynomial supertree methods in phylogenomics: algorithms, simulations and software

    Get PDF
    One of the objectives in modern biology, especially phylogenetics, is to build larger clades of the Tree of Life. Large-scale phylogenetic analysis involves several serious challenges. The aim of this thesis is to contribute to some of the open problems in this context. In computational phylogenetics, supertree methods provide a way to reconstruct larger clades of the Tree of Life. We present a novel polynomial time approach for the computation of supertrees called FlipCut supertree. Our method combines the computation of minimum cuts from graph-based methods with a matrix representation method, namely Minimum Flip Supertrees. Here, the input trees are encoded in a 0/1/?-matrix. We present a heuristic to search for a minimum set of 0/1-flips such that the resulting matrix admits a directed perfect phylogeny. In contrast to other polynomial time approaches, our results can be interpreted in the sense that we try to minimize a global objective function, namely the number of flips in the input matrix. We extend our approach by using edge weights to weight the columns of the 0/1/?-matrix. In order to compare our new FlipCut supertree method with other recent polynomial supertree methods and matrix representation methods, we present a large scale simulation study using two different data sets. Our findings illustrate the trade-off between accuracy and running time in supertree construction, as well as the pros and cons of different supertree approaches. Furthermore, we present EPoS, a modular software framework for phylogenetic analysis and visualization. It fills the gap between command line-based algorithmic packages and visual tools without sufficient support for computational methods. By combining a powerful graphical user interface with a plugin system that allows simple integration of new algorithms, visualizations and data structures, we created a framework that is easy to use, to extend and that covers all important steps of a phylogenetic analysis

    CHAMBER MUSIC FOR THE PRE-COLLEGIATE STUDENT: VIOLIN QUARTETS AND TRIOS

    Get PDF
    In both private string studios and school orchestra programs, pre-collegiate students need more opportunities to study chamber music. One of the barriers string teachers face when establishing a chamber music program for children is instrumentation as there are often more students who play the violin than the viola and cello. While the string quartet dominates the strings chamber music genre, there is a large body of underperformed repertoire written for violin quartets and trios that provide a variety of pedagogical benefits to students and teachers alike. This project establishes the pedagogical value of some of these pieces and places each piece within several of the major strings grading systems. Each piece is graded based on the American String Teachers Association Grading Scale, Suzuki Volume Level, and Royal Conservatory Certification Grade Level. These methods were chosen to provide recognizable scale systems for public school directors and private teachers alike. The project also contains a brief description of the three pedagogical methods as well as charts comparing these grading systems to one another for reference
    corecore