32 research outputs found

    Accuracy of phylogeny reconstruction methods combining overlapping gene data sets

    Get PDF
    Background The availability of many gene alignments with overlapping taxon sets raises the question of which strategy is the best to infer species phylogenies from multiple gene information. Methods and programs abound that use the gene alignment in different ways to reconstruct the species tree. In particular, different methods combine the original data at different points along the way from the underlying sequences to the final tree. Accordingly, they are classified into superalignment, supertree and medium-level approaches. Here, we present a simulation study to compare different methods from each of these three approaches. Results We observe that superalignment methods usually outperform the other approaches over a wide range of parameters including sparse data and gene-specific evolutionary parameters. In the presence of high incongruency among gene trees, however, other combination methods show better performance than the superalignment approach. Surprisingly, some supertree and medium-level methods exhibit, on average, worse results than a single gene phylogeny with complete taxon information. Conclusions For some methods, using the reconstructed gene tree as an estimation of the species tree is superior to the combination of incomplete information. Superalignment usually performs best since it is less susceptible to stochastic error. Supertree methods can outperform superalignment in the presence of gene-tree conflict

    Horizontally transmitted symbiont populations in deep-sea mussels are genetically isolated

    Get PDF
    Eukaryotes are habitats for bacterial organisms where the host colonization and dispersal among individual hosts have consequences for the bacterial ecology and evolution. Vertical symbiont transmission leads to geographic isolation of the microbial population and consequently to genetic isolation of microbiotas from individual hosts. In contrast, the extent of geographic and genetic isolation of horizontally transmitted microbiota is poorly characterized. Here we show that chemosynthetic symbionts of individual Bathymodiolus brooksi mussels constitute genetically isolated subpopulations. The reconstruction of core genome-wide strains from high-resolution metagenomes revealed distinct phylogenetic clades. Nucleotide diversity and strain composition vary along the mussel life span and individual hosts show a high degree of genetic isolation. Our results suggest that the uptake of environmental bacteria is a restricted process in B. brooksi, where self-infection of the gill tissue results in serial founder effects during symbiont evolution. We conclude that bacterial colonization dynamics over the host life cycle is thus an important determinant of population structure and genome evolution of horizontally transmitted symbionts

    Complete genome sequence of the novel phage MG-B1 infecting bacillus weihenstephanensis

    Get PDF
    Here, we describe a novel virulent bacteriophage that infects Bacillus weihenstephanensis, isolated from soil in Austria. It is the first phage to be discovered that infects this species. Here, we present the complete genome sequence of this podovirus

    Characterization of Blf4, an Archaeal Lytic Virus Targeting a Member of the Methanomicrobiales

    Get PDF
    Today, the number of known viruses infecting methanogenic archaea is limited. Here, we report on a novel lytic virus, designated Blf4, and its host strain Methanoculleus bourgensis E02.3, a methanogenic archaeon belonging to the Methanomicrobiales, both isolated from a commercial biogas plant in Germany. The virus consists of an icosahedral head 60 nm in diameter and a long non-contractile tail of 125 nm in length, which is consistent with the new isolate belonging to the Siphoviridae family. Electron microscopy revealed that Blf4 attaches to the vegetative cells of M. bourgensis E02.3 as well as to cellular appendages. Apart from M. bourgensis E02.3, none of the tested Methanoculleus strains were lysed by Blf4, indicating a narrow host range. The complete 37 kb dsDNA genome of Blf4 contains 63 open reading frames (ORFs), all organized in the same transcriptional direction. For most of the ORFs, potential functions were predicted. In addition, the genome of the host M. bourgensis E02.3 was sequenced and assembled, resulting in a 2.6 Mbp draft genome consisting of nine contigs. All genes required for a hydrogenotrophic lifestyle were predicted. A CRISPR/Cas system (type I-U) was identified with six spacers directed against Blf4, indicating that this defense system might not be very efficient in fending off invading Blf4 virus

    Do we still need supertrees?

    Get PDF
    The up-dated species level phylogeny for the carnivores using a supertree approach provides new insights into the evolutionary origin and relationships of carnivores. While the gain in biological knowledge is substantial, the supertree approach is not undisputed. I discuss the principles of supertree methods and the competitor supermatrix approaches. I argue that both methods are important to infer phylogenetic relationships

    Split-based computation of majority-rule supertrees

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Supertree methods combine overlapping input trees into a larger supertree. Here, I consider split-based supertree methods that first extract the split information of the input trees and subsequently combine this split information into a phylogeny. Well known split-based supertree methods are matrix representation with parsimony and matrix representation with compatibility. Combining input trees on the same taxon set, as in the consensus setting, is a well-studied task and it is thus desirable to generalize consensus methods to supertree methods.</p> <p>Results</p> <p>Here, three variants of majority-rule (MR) supertrees that generalize majority-rule consensus trees are investigated. I provide simple formulas for computing the respective score for bifurcating input- and supertrees. These score computations, together with a heuristic tree search minmizing the scores, were implemented in the python program PluMiST (Plus- and Minus SuperTrees) available from <url>http://www.cibiv.at/software/plumist</url>. The different MR methods were tested by simulation and on real data sets. The search heuristic was successful in combining compatible input trees. When combining incompatible input trees, especially one variant, MR(-) supertrees, performed well.</p> <p>Conclusions</p> <p>The presented framework allows for an efficient score computation of three majority-rule supertree variants and input trees. I combined the score computation with a heuristic search over the supertree space. The implementation was tested by simulation and on real data sets and showed promising results. Especially the MR(-) variant seems to be a reasonable score for supertree reconstruction. Generalizing these computations to multifurcating trees is an open problem, which may be tackled using this framework.</p

    On the use of cartographic projections in visualizing phylo-genetic tree space

    Get PDF
    Phylogenetic analysis is becoming an increasingly important tool for biological research. Applications include epidemiological studies, drug development, and evolutionary analysis. Phylogenetic search is a known NP-Hard problem. The size of the data sets which can be analyzed is limited by the exponential growth in the number of trees that must be considered as the problem size increases. A better understanding of the problem space could lead to better methods, which in turn could lead to the feasible analysis of more data sets. We present a definition of phylogenetic tree space and a visualization of this space that shows significant exploitable structure. This structure can be used to develop search methods capable of handling much larger data sets
    corecore