229 research outputs found
Improved Lower Bounds on the Compatibility of Multi-State Characters
We study a long standing conjecture on the necessary and sufficient
conditions for the compatibility of multi-state characters: There exists a
function such that, for any set of -state characters, is
compatible if and only if every subset of characters of is
compatible. We show that for every , there exists an incompatible set
of -state
characters such that every proper subset of is compatible. Thus, for every .
This improves the previous lower bound of given by Meacham (1983),
and generalizes the construction showing that given by Habib and
To (2011). We prove our result via a result on quartet compatibility that may
be of independent interest: For every integer , there exists an
incompatible set of
quartets over
labels such that every proper subset of is compatible. We contrast this
with a result on the compatibility of triplets: For every , if is
an incompatible set of more than triplets over labels, then some
proper subset of is incompatible. We show this upper bound is tight by
exhibiting, for every , a set of triplets over taxa such
that is incompatible, but every proper subset of is compatible
On strongly chordal graphs that are not leaf powers
A common task in phylogenetics is to find an evolutionary tree representing
proximity relationships between species. This motivates the notion of leaf
powers: a graph G = (V, E) is a leaf power if there exist a tree T on leafset V
and a threshold k such that uv is an edge if and only if the distance between u
and v in T is at most k. Characterizing leaf powers is a challenging open
problem, along with determining the complexity of their recognition. This is in
part due to the fact that few graphs are known to not be leaf powers, as such
graphs are difficult to construct. Recently, Nevries and Rosenke asked if leaf
powers could be characterized by strong chordality and a finite set of
forbidden subgraphs.
In this paper, we provide a negative answer to this question, by exhibiting
an infinite family \G of (minimal) strongly chordal graphs that are not leaf
powers. During the process, we establish a connection between leaf powers,
alternating cycles and quartet compatibility. We also show that deciding if a
chordal graph is \G-free is NP-complete, which may provide insight on the
complexity of the leaf power recognition problem
Reconstructing a phylogenetic level-1 network from quartets
We describe a method that will reconstruct an unrooted binary phylogenetic
level-1 network on n taxa from the set of all quartets containing a certain
fixed taxon, in O(n^3) time. We also present a more general method which can
handle more diverse quartet data, but which takes O(n^6) time. Both methods
proceed by solving a certain system of linear equations over GF(2).
For a general dense quartet set (containing at least one quartet on every
four taxa) our O(n^6) algorithm constructs a phylogenetic level-1 network
consistent with the quartet set if such a network exists and returns an (O(n^2)
sized) certificate of inconsistency otherwise. This answers a question raised
by Gambette, Berry and Paul regarding the complexity of reconstructing a
level-1 network from a dense quartet set
Data incongruence and the problem of avian louse phylogeny
Recent studies based on different types of data (i.e. morphological and molecular) have supported conflicting phylogenies for the genera of avian feather lice (Ischnocera: Phthiraptera). We analyse new and published data from morphology and from mitochondrial (12S rRNA and COI) and nuclear (EF1-) genes to explore the sources of this incongruence and explain these conflicts. Character convergence, multiple substitutions at high divergences, and ancient radiation over a short period of time have contributed to the problem of resolving louse phylogeny with the data currently available. We show that apparent incongruence between the molecular datasets is largely attributable to rate variation and nonstationarity of base composition. In contrast, highly significant character incongruence leads to topological incongruence between the molecular and morphological data. We consider ways in which biases in the sequence data could be misleading, using several maximum likelihood models and LogDet corrections. The hierarchical structure of the data is explored using likelihood mapping and SplitsTree methods. Ultimately, we concede there is strong discordance between the molecular and morphological data and apply the conditional combination approach in this case. We conclude that higher level phylogenetic relationships within avian Ischnocera remain extremely problematic. However, consensus between datasets is beginning to converge on a stable phylogeny for avian lice, at and below the familial rank
Contributions to computational phylogenetics and algorithmic self-assembly
This dissertation addresses some of the algorithmic and combinatorial problems at the interface between biology and computation.
In particular, it focuses on problems in both computational phylogenetics, an area of study in which computation is used to better understand evolutionary relationships, and algorithmic self-assembly, an area of study in which biological processes are used to perform computation.
The first set of results investigate inferring phylogenetic trees from multi-state character data. We give a novel characterization of when a set of three-state characters has a perfect phylogeny and make progress on a long-standing conjecture regarding the compatibility of multi-state characters.
The next set of results investigate inferring phylogenetic supertrees from collections of smaller input trees when the input trees do not fully agree on the relative positions of the taxa. Two approaches to dealing with such conflicting input trees are considered. The first is to contract a set of edges in the input trees so that the resulting trees have an agreement supertree. The second is to remove a set of taxa from the input trees so that the resulting trees have an agreement supertree. We give fixed-parameter tractable algorithms for both approaches.
We then turn to the algorithmic self-assembly of fractal structures from DNA tiles and investigate approximating the Sierpinski triangle and the Sierpinski carpet with strict self-assembly. We prove tight bounds on approximating the Sierpinski triangle and exhibit a class of fractals that are generalizations of the Sierpinski carpet that can approximately self-assemble.
We conclude by discussing some ideas for further research
Polynomial supertree methods in phylogenomics: algorithms, simulations and software
One of the objectives in modern biology, especially phylogenetics, is to build larger clades of the Tree of Life. Large-scale phylogenetic analysis involves several serious challenges. The aim of this thesis is to contribute to some of the open problems in this context. In computational phylogenetics, supertree methods provide a way to reconstruct larger clades of the Tree of Life. We present a novel polynomial time approach for the computation of supertrees called FlipCut supertree. Our method combines the computation of minimum cuts from graph-based methods with a matrix representation method, namely Minimum Flip Supertrees. Here, the input trees are encoded in a 0/1/?-matrix. We present a heuristic to search for a minimum set of 0/1-flips such that the resulting matrix admits a directed perfect phylogeny. In contrast to other polynomial time approaches, our results can be interpreted in the sense that we try to minimize a global objective function, namely the number of flips in the input matrix. We extend our approach by using edge weights to weight the columns of the 0/1/?-matrix. In order to compare our new FlipCut supertree method with other recent polynomial supertree methods and matrix representation methods, we present a large scale simulation study using two different data sets. Our findings illustrate the trade-off between accuracy and running time in supertree construction, as well as the pros and cons of different supertree approaches. Furthermore, we present EPoS, a modular software framework for phylogenetic analysis and visualization. It fills the gap between command line-based algorithmic packages and visual tools without sufficient support for computational methods. By combining a powerful graphical user interface with a plugin system that allows simple integration of new algorithms, visualizations and data structures, we created a framework that is easy to use, to extend and that covers all important steps of a phylogenetic analysis
CHAMBER MUSIC FOR THE PRE-COLLEGIATE STUDENT: VIOLIN QUARTETS AND TRIOS
In both private string studios and school orchestra programs, pre-collegiate students need more opportunities to study chamber music. One of the barriers string teachers face when establishing a chamber music program for children is instrumentation as there are often more students who play the violin than the viola and cello. While the string quartet dominates the strings chamber music genre, there is a large body of underperformed repertoire written for violin quartets and trios that provide a variety of pedagogical benefits to students and teachers alike. This project establishes the pedagogical value of some of these pieces and places each piece within several of the major strings grading systems. Each piece is graded based on the American String Teachers Association Grading Scale, Suzuki Volume Level, and Royal Conservatory Certification Grade Level. These methods were chosen to provide recognizable scale systems for public school directors and private teachers alike. The project also contains a brief description of the three pedagogical methods as well as charts comparing these grading systems to one another for reference
- …