22 research outputs found

    Hybrid capture of 964 nuclear genes resolves evolutionary relationships in the mimosoid legumes and reveals the polytomous origins of a large pantropical radiation

    Get PDF
    PREMISE Targeted enrichment methods facilitate sequencing of hundreds of nuclear loci to enhance phylogenetic resolution and elucidate why some parts of the “tree of life” are difficult (if not impossible) to resolve. The mimosoid legumes are a prominent pantropical clade of ~3300 species of woody angiosperms for which previous phylogenies have shown extensive lack of resolution, especially among the species‐rich and taxonomically challenging ingoids. METHODS We generated transcriptomes to select low‐copy nuclear genes, enrich these via hybrid capture for representative species of most mimosoid genera, and analyze the resulting data using de novo assembly and various phylogenomic tools for species tree inference. We also evaluate gene tree support and conflict for key internodes and use phylogenetic network analysis to investigate phylogenetic signal across the ingoids. RESULTS Our selection of 964 nuclear genes greatly improves phylogenetic resolution across the mimosoid phylogeny and shows that the ingoid clade can be resolved into several well‐supported clades. However, nearly all loci show lack of phylogenetic signal for some of the deeper internodes within the ingoids. CONCLUSIONS Lack of resolution in the ingoid clade is most likely the result of hyperfast diversification, potentially causing a hard polytomy of six or seven lineages. The gene set for targeted sequencing presented here offers great potential to further enhance the phylogeny of mimosoids and the wider Caesalpinioideae with denser taxon sampling, to provide a framework for taxonomic reclassification, and to study the ingoid radiation

    The evolution of the pan-genome of Shiga-toxin (Stx) producing Escherichia coli and the Stx₂ bacteriophage

    Get PDF
    xvi, 206 leaves : ill. (some col.) ; 29 cmHuman infections with Shiga-toxin (Stx)-producing E. coli (STEC) vary in severity of illness. The pan-genome of a bacterial species contains a shared, essential core genome, and a variably distributed accessory genome. While single nucleotide changes likely influence virulence in STEC, horizontal gene transfer (HGT) on elements such as bacteriophage are thought to be most important. My thesis objectives were to: 1) develop tools for the pangenomic analyses of bacterial genomes; 2) describe the phylogeny of STEC and; 3) determine if the evolution of the Stx2-bacteriophage parallels that of its bacterial host. For this thesis, the software program Panseq was created and used to identify pan-genomic differences among STEC. Whole-genome phylogenies showed all serotypes as discrete clusters, with O157:H7 having three distinct lineages and grouping separately from all other STEC. Finally, the phylogenies of Stx2-bacteriophage and their bacterial hosts were largely concordant, with occasional instances of HGT having led to novel pathogen emergence

    SOCIAL MEDIA ANALYTICS − A UNIFYING DEFINITION, COMPREHENSIVE FRAMEWORK, AND ASSESSMENT OF ALGORITHMS FOR IDENTIFYING INFLUENCERS IN SOCIAL MEDIA

    Get PDF
    Given its relative infancy, there is a dearth of research on a comprehensive view of business social media analytics (SMA). This dissertation first examines current literature related to SMA and develops an integrated, unifying definition of business SMA, providing a nuanced starting point for future business SMA research. This dissertation identifies several benefits of business SMA, and elaborates on some of them, while presenting recent empirical evidence in support of foregoing observations. The dissertation also describes several challenges facing business SMA today, along with supporting evidence from the literature, some of which also offer mitigating solutions in particular contexts. The second part of this dissertation studies one SMA implication focusing on identifying social influencer. Growing social media usage, accompanied by explosive growth in SMA, has resulted in increasing interest in finding automated ways of discovering influencers in online social interactions. Beginning 2008, many variants of multiple basic approaches have been proposed. Yet, there is no comprehensive study investigating the relative efficacy of these methods in specific settings. This dissertation investigates and reports on the relative performance of multiple methods on Twitter datasets containing between them tens of thousands to hundreds of thousands of tweets. Accordingly, the second part of the dissertation helps further an understanding of business SMA and its many aspects, grounded in recent empirical work, and is a basis for further research and development. This dissertation provides a relatively comprehensive understanding of SMA and the implementation SMA in influencer identification

    Genome Characterization of natural Saccharomyces hybrids of biotechnological interest

    Get PDF
    At present, the genus Saccharomyces comprises seven species according to their patterns of breeding. The species boundaries are not clear due to the description of several reticulate events due to introgression and hybridization. In the last decade, new natural hybrids have been described in wine and brewing, such as S. cerevisiae x S. kudriavzevii. Due to new practices in wine and beer production, together with consequences in grape properties due to climatic change, led biotechnological companies to search for new yeast strains. In this context, hybrids have become of importance to biotechnological industries because they show good fermentative performance at low temperatures and produce new organoleptic compounds of industrial interest. This doctoral thesis explores the evolution of the natural S. cerevisiae x S. kudriavzevii hybrids and the importance of hybridization in the evolution of the Saccharomyces species. This study was performed by using different molecular approaches combined with bioinformatic tools for phylogenetic tree/networks reconstruction and data analysis. Understanding the origin and genome characteristics of natural S. cerevisiae x S. kudriavzevii hybrids are our priority for obtaining, in the future, personalized yeasts with new properties of biotechnological interest

    Phylogeny and chloroplast evolution in Brassicaceae

    Get PDF
    Brassicaceae is a large family of flowering plants, characterized by cruciform corolla, tetradynamous stamen and capsular fruit. In light of the important economic and scientific values of Brassicaceae, many phylogenetic and systematic studies were carried out. One recent and important phylogenetic analysis revealed three major lineages (I, II and III), however, classification at different taxonomic levels (tribe, genus, and species) remained problematic and evolutionary relationships among and within these lineages were still largely unclear. This is partly due to the fact that the past studies lacked information, as they mainly utilized the morphological data, nuclear DNA, partial chloroplast (cp) genes and so on. Nowadays, next generation sequencing (NGS) technology provides the possibility to make use of big data in phylogeny and evolutionary studies. Thus, we sequenced the chloroplast genomes of 80 representative species, using additional 15 reference chloroplast genomes from the NCBI database, and carried out both the phylogenetic reconstruction and the study of protein coding genes evolution in this novel dataset with different methods. Several novel results were obtained. 1 Successful application of NGS technology in chloroplast genome sequencing. During the final assembly, I could reconstruct full chloroplast genomes and the structure maps for 14 out of 80 sampled species, while the remaining were assembled nearly completely with only few gaps remaining. 2 Characterization of chloroplast genome structure. Gene number and order, single sequence repeat (SSR) as well as variety and distribution of large repeat sequence were characterized. 3 The difference of codon usage frequency was calculated between Cardamine resedifolia and Cardamine impatiens. Twelve genes with signatures of positive selection were identified at a family-wide level. 4 Three major lineages (I – III) were confirmed with high support values. Besides, the positions of various tribes were reclassified. Relationships among and within these lineages were highly resolved and supported in the final tree. Most of the tribes in the analyses were inferred to be monophyletic, only Thlaspideae was paraphyletic. Anastaticeae was for the first time classified into position of expanded lineage II, and position of tribe Lepidieae was delimited with relatively low support values in the final phylogenetic tree. This study was a new and successful application of NGS in large-scale Brassicaceae phylogeny and evolution, which offered the chance to look in details of the structural and functional features of the chloroplast genome. These results provided a paradigm on how to proceed towards the full elucidation of the evolutionary relationships among various biological species in the tree of life

    Genome evolution in Prochlorococcus and marine Synechococcus

    Get PDF

    Integrating genomics with the fossil record to explore the evolutionary history of Echinoidea

    Get PDF
    Echinoidea constitutes one of five major clades of living echinoderms, marine animals uniquely characterized by a pentaradial symmetry. Approximately 1,000 living and 10,000 extinct species have been described, including many commonly known as sea urchins, heart urchins and sand dollars. Today, echinoids are ubiquitous in benthic marine environments, where they strongly affect the functioning of biodiverse communities such as coral reefs and kelp forests. Given the quality of their fossil record, their remarkable morphological complexity and our thorough understanding of their development, echinoids provide unparalleled opportunities to explore evolutionary questions in deep-time, providing access to the developmental and morphological underpinnings of evolutionary innovation. These questions cannot be addressed without first resolving the phylogenetic relationships among living and extinct lineages. The goal of this dissertation is to advance our understanding of echinoid relationships and evolutionary history, as well as to explore more broadly the integration of phylogenomic, morphological and paleontological data in phylogenetic reconstruction and macroevolutionary inference.In Chapter 1, I report the results of the first phylogenomic analysis of echinoids based on the sequencing of 17 novel echinoid transcriptomes. Phylogenetic analyses of this data resolve the position of several clades—including the sand dollars—in disagreement with traditional morphological hypotheses. I demonstrate the presence of a strong phylogenetic signal for these novel resolutions, and explore scenarios to reconcile these findings with morphological evidence. In Chapter 7, I extend this approach with a more thorough taxon sampling, resulting in a robust topology with a near-complete sampling of major echinoid lineages. This effort reveals that apatopygids, a clade of three species with previously unclear affinities, represent the only living descendants of a once diverse Mesozoic clade. I also perform a thorough time calibration analysis, quantifying the relative effects of choosing among alternative models of molecular evolution, gene samples and clock priors. I introduce the concept of a chronospace and use it to reveal that only the last among the aforementioned choices affects significantly our understanding of echinoid diversification. Molecular clocks unambiguously support late Permian and late Cretaceous origins for crown group echinoids and sand dollars, respectively, implying long ghost ranges for both. Fossils have been shown to improve the accuracy of phylogenetic comparative methods, warranting their inclusion alongside extant terminals when exploring evolutionary processes across deep timescales. However, their impact on topological inference remains controversial. I explore this topic in Chapter 3 with the use of simulations, which show that morphological phylogenies are more accurate when fossil taxa are incorporated. I also show that tip-dated Bayesian inference, which takes stratigraphic information from fossils into account, outperforms uncalibrated methods. This approach is complemented in Chapter 2 with the analysis of empirical datasets, confirming that incorporating fossils reshapes phylogenies in a manner that is entirely distinct from increased sampling of extant taxa, a result largely attributable to the occurrence of distinctive character combinations among fossils. Even though phylogenomic and paleontological data are complementary resources for unraveling the relationships and divergence times of lineages, few studies have attempted to fully integrate them. Chapter 4 revisits the phylogeny of crown group Echinoidea using a total-evidence dating approach combining phylogenomic, morphological and stratigraphic information. To this end, I develop a method (genesortR) for subsampling molecular datasets that selects loci with high phylogenetic signal and low systematic biases. The results demonstrate that combining different data sources increases topological accuracy and helps resolve phylogenetic conflicts. Notably, I present a new hypothesis for the origin and early morphological evolution of the sand dollars and close allies. In Chapter 6, I compare the behavior of genesortR against alternative subsampling strategies across a sample of phylogenomic matrices. I find this method to systematically outperform random loci selection, unlike commonly-used approaches that target specific evolutionary rates or minimize sources of systematic error. I conclude that these methods should not be used indiscriminately, and that multivariate methods of phylogenomic subsampling should be favored. Finally, in Chapter 5, I explore the macroevolutionary dynamics of echinoid body size across 270 million years using data for more than 5,000 specimens in a phylogenetically explicit context. I also develop a method (extendedSurface) for parameterizing adaptive landscapes that overcomes issues with existing approaches and finds better fitting models. While echinoid body size has been largely constrained to evolve within a single adaptive peak, the disparity of the clade was generated by regime shifts driving the repeated evolution of miniaturized and gigantic forms. Most innovations occurred during the latter half of the Mesozoic, and were followed by a drastic slowdown in the aftermath of the Cretaceous-Paleogene mass extinction

    Phylogenetics, morpohology, and genomic evolution in Physarieae (Brassicaceae)

    Get PDF
    The mustard family (Brassicaceae) is economically important but the evolution of its morphology is not well understood. I investigate the evolution of morphological and genomic characters and calculate ancestral trait values in the Brassicaceae within a phylogenetic context using the tribe Physarieae as a model system. Physarieae are a unique and diverse group of American mustards characterized by multi-aperturate pollen. Phylogenetic analyses of DNA sequences (chloroplast ndhF and nuclear ITS and LUMINIDEPENDENS) were used to test the monophyly and explore evolutionary relationships of Physarieae. The phylogenetic inferences were used to identify morphological traits to delimit the tribe, to interpret the evolution of selected morphological and genomic characters, and to test alternative hypotheses related to the covariation of traits. Results show that Physarieae are monophyletic and most closely related to three tribes (Halimolobeae, Boechereae, Camelineae) based on analyses of parsimony, likelihood, and Bayesian analyses. Two well-supported monophyletic clades in the tribe are recovered: the DDNLS clade, including Dithyrea, Dimorphocarpa, Nerisyrenia, Lyrocarpa, and Synthlipsis, and the PP clade, comprising Paysonia and Physaria. Character optimization of discrete and continuous morphological data suggest that enlarged fruits and replums, wide seeds, and long fruiting styles are potential synapomorphies of Physariae, whereas traits related to fruits, seed, trichomes, and pollen are useful to distinguish groups and genera within the tribe

    Large scale plastomics approaches for the study of evolution and adaptive signatures in angiosperms

    Get PDF
    corecore