183 research outputs found
Genomic signatures of sex, selection and speciation in the microbial world
Thesis (Ph. D.)--Massachusetts Institute of Technology, Computational and Systems Biology Program, 2010.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Cataloged from student-submitted PDF version of thesis.Includes bibliographical references (p. 218-228).Understanding the microbial world is key to understanding global biogeochemistry, human health and disease, yet this world is largely inaccessible. Microbial genomes, an increasingly accessible data source, provide an ideal entry point. The genome sequences of different microbes may be compared using the tools of population genetics to infer important genetic changes allowing them to diversify ecologically and adapt to distinct ecological niches. Yet the toolkit of population genetics was developed largely with sexual eukaryotes in mind. In this work, I assess and develop tools for inferring natural selection in microbial genomes. Many tools rely on population genetics theory, and thus require defining distinct populations, or species, of bacteria. Because sex (recombination) is not required for reproduction, some bacteria recombine only rarely, while others are extremely promiscuous, exchanging genes across great genetic distances. This behavior poses a challenge for defining microbial population boundaries. This thesis begins with a discussion of how recombination and positive selection interact to promote ecological adaptation. I then describe a general pipeline for quantifying the impacts of mutation, recombination and selection on microbial genomes, and apply it to two closely related, yet ecologically distinct populations of Vibrio splendidus, each with its own microhabitat preference. I introduce a new tool, STARRInIGHTS, for inferring homologous recombination events. By assessing rates of recombination within and between ecological populations, I conclude that ecological differentiation is driven by small number of habitat-specific alleles, while most loci are shared freely across habitats. The remainder of the thesis focuses on lineage-specific changes in natural selection among anciently diverged species of gamma proteobacteria. I develop two new metrics, selective signatures and slow:fast, for detecting deviations from the expected rate of evolution in 'core' proteins (present in single copy in most species). Because they rely on empirical distributions of evolutionary rates across species, these methods should become increasingly powerful as more and more microbial genomes are sampled. Overall, the methods described here significantly expand the repertoire of tools available for microbial population genomics, both for investigating the process of ecological differentiation at the finest of time scales, and over billions of years of microbial evolution.by B. Jesse Shapiro.Ph.D
Recommended from our members
A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
Whole genome sequencing is increasingly used to study phenotypic variation among infectious pathogens and to evaluate their relative transmissibility, virulence, and immunogenicity. To date, relatively little has been published on how and how many pathogen strains should be selected for studies associating phenotype and genotype. There are specific challenges when identifying genetic associations in bacteria which often comprise highly structured populations. Here we consider general methodological questions related to sampling and analysis focusing on clonal to moderately recombining pathogens. We propose that a matched sampling scheme constitutes an efficient study design, and provide a power calculator based on phylogenetic convergence. We demonstrate this approach by applying it to genomic datasets for two microbial pathogens: Mycobacterium tuberculosis and Campylobacter species. Electronic supplementary material The online version of this article (doi:10.1186/s13073-014-0101-7) contains supplementary material, which is available to authorized users
Recommended from our members
Comparing Patterns of Natural Selection Across Species Using Selective Signatures
Comparing gene expression profiles over many different conditions has led to insights that were not obvious from single experiments. In the same way, comparing patterns of natural selection across a set of ecologically distinct species may extend what can be learned from individual genome-wide surveys. Toward this end, we show how variation in protein evolutionary rates, after correcting for genome-wide effects such as mutation rate and demographic factors, can be used to estimate the level and types of natural selection acting on genes across different species. We identify unusually rapidly and slowly evolving genes, relative to empirically derived genome-wide and gene family-specific background rates for 744 core protein families in 30 gamma-proteobacterial species. We describe the pattern of fast or slow evolution across species as the 'selective signature' of a gene. Selective signatures represent a profile of selection across species that is predictive of gene function: pairs of genes with correlated selective signatures are more likely to share the same cellular function, and genes in the same pathway can evolve in concert. For example, glycolysis and phenylalanine metabolism genes evolve rapidly in Idiomarina loihiensis, mirroring an ecological shift in carbon source from sugars to amino acids. In a broader context, our results suggest that the genomic landscape is organized into functional modules even at the level of natural selection, and thus it may be easier than expected to understand the complex evolutionary pressures on a cell
Comparing Patterns of Natural Selection across Species Using Selective Signatures
Comparing gene expression profiles over many different conditions has led to insights that were not obvious from single experiments. In the same way, comparing patterns of natural selection across a set of ecologically distinct species may extend what can be learned from individual genome-wide surveys. Toward this end, we show how variation in protein evolutionary rates, after correcting for genome-wide effects such as mutation rate and demographic factors, can be used to estimate the level and types of natural selection acting on genes across different species. We identify unusually rapidly and slowly evolving genes, relative to empirically derived genome-wide and gene family-specific background rates for 744 core protein families in 30 Îł-proteobacterial species. We describe the pattern of fast or slow evolution across species as the âselective signatureâ of a gene. Selective signatures represent a profile of selection across species that is predictive of gene function: pairs of genes with correlated selective signatures are more likely to share the same cellular function, and genes in the same pathway can evolve in concert. For example, glycolysis and phenylalanine metabolism genes evolve rapidly in Idiomarina loihiensis, mirroring an ecological shift in carbon source from sugars to amino acids. In a broader context, our results suggest that the genomic landscape is organized into functional modules even at the level of natural selection, and thus it may be easier than expected to understand the complex evolutionary pressures on a cell
Recommended from our members
Evolutionary consequences of intra-patient phage predation on microbial populations
The impact of phage predation on bacterial pathogens in the context of human disease is not currently appreciated. Here, we show that predatory interactions of a phage with an important environmentally transmitted pathogen, Vibrio cholerae, can modulate the evolutionary trajectory of this pathogen during the natural course of infection within individual patients. We analyzed geographically and temporally disparate cholera patient stool samples from Haiti and Bangladesh and found that phage predation can drive the genomic diversity of intra-patient V. cholerae populations. Intra-patient phage-sensitive and phage-resistant isolates were isogenic except for mutations conferring phage resistance, and moreover, phage-resistant V. cholerae populations were composed of a heterogeneous mix of many unique mutants. We also observed that phage predation can significantly alter the virulence potential of V. cholerae shed from cholera patients. We provide the first molecular evidence for predatory phage shaping microbial community structure during the natural course of infection in humans. DOI: http://dx.doi.org/10.7554/eLife.03497.00
Can cyanobacterial diversity in the source predict the diversity in sludge and the risk of toxin release in a drinking water treatment plant?
ABSTRACT: Conventional processes (coagulation, flocculation, sedimentation, and filtration) are widely used in drinking water treatment plants and are considered a good treatment strategy to eliminate cyanobacterial cells and cell-bound cyanotoxins. The diversity of cyanobacteria was investigated using taxonomic cell counts and shotgun metagenomics over two seasons in a drinking water treat- ment plant before, during, and after the bloom. Changes in the community structure over time at the phylum, genus, and species levels were monitored in samples retrieved from raw water (RW), sludge in the holding tank (ST), and sludge supernatant (SST). Aphanothece clathrata brevis, Microcystis aeruginosa, Dolichospermum spiroides, and Chroococcus minimus were predominant species detected in RW by taxonomic cell counts. Shotgun metagenomics revealed that Proteobacteria was the pre- dominant phylum in RW before and after the cyanobacterial bloom. Taxonomic cell counts and shotgun metagenomic showed that the Dolichospermum bloom occurred inside the plant. Cyanobac- teria and Bacteroidetes were the major bacterial phyla during the bloom. Shotgun metagenomics also showed that Synechococcus, Microcystis, and Dolichospermum were the predominant detected cyanobacterial genera in the samples. Conventional treatment removed more than 92% of cyanobac- terial cells but led to cell accumulation in the sludge up to 31 times more than in the RW influx. Coagulation/sedimentation selectively removed more than 96% of Microcystis and Dolichospermum. Cyanobacterial community in the sludge varied from raw water to sludge during sludge storage (1â13 days). This variation was due to the selective removal of coagulation/sedimentation as well as the accumulation of captured cells over the period of storage time. However, the prediction of the cyanobacterial community composition in the SST remained a challenge. Among nutrient parameters, orthophosphate availability was related to community profile in RW samples, whereas communities in ST were influenced by total nitrogen, Kjeldahl nitrogen (N- Kjeldahl), total and particulate phos- phorous, and total organic carbon (TOC). No trend was observed on the impact of nutrients on SST communities. This study profiled new health-related, environmental, and technical challenges for the production of drinking water due to the complex fate of cyanobacteria in cyanobacteria-laden sludge and supernatant
Sympatric Speciation: When Is It Possible in Bacteria?
This study investigated a potential auditory illusion in duration perception induced by rhythmic temporal contexts. Listeners with or without musical training performed a duration discrimination task for a silent period in a rhythmic auditory sequence. The critical temporal interval was presented either within a perceptual group or between two perceptual groups. We report the just-noticeable difference (difference limen, DL) for temporal intervals and the point of subjective equality (PSE) derived from individual psychometric functions based on performance of a two-alternative forced choice task. In musically untrained individuals, equal temporal intervals were perceived as significantly longer when presented between perceptual groups than within a perceptual group (109.25% versus 102.5% of the standard duration). Only the perceived duration of the between-group interval was significantly longer than its objective duration. Musically trained individuals did not show this effect. However, in both musically trained and untrained individuals, the relative difference limens for discriminating the comparison interval from the standard interval were larger in the between-groups condition than in the within-group condition (7.3% vs. 5.6% of the standard duration). Thus, rhythmic grouping affected sensitivity to duration changes in all listeners, with duration differences being harder to detect at boundaries of rhythm groups than within rhythm groups. Our results show for the first time that temporal Gestalt induces auditory duration illusions in typical listeners, but that musical experts are not susceptible to this effect of rhythmic grouping.Ellison Medical FoundationSwiss National Science Foundation (PA00P1_131448/1
Age specific aetiological agents of diarrhoea in hospitalized children aged less than five years in Dar es Salaam, Tanzania
\ud
This study aimed to determine the age-specific aetiologic agents of diarrhoea in children aged less than five years. The study also assessed the efficacy of the empiric treatment of childhood diarrhoea using Integrated Management of Childhood Illness (IMCI) guidelines. This study included 280 children aged less than 5 years, admitted with diarrhoea to any of the four major hospitals in Dar es Salaam. Bacterial pathogens were identified using conventional methods. Enzyme Linked Immunosorbent Assay (ELISA) and agglutination assay were used to detect viruses and intestinal protozoa, respectively. Antimicrobial susceptibility was determined using Kirby-Bauer disk diffusion method. At least one of the searched pathogens was detected in 67.1% of the cases, and mixed infections were detected in 20.7% of cases. Overall, bacteria and viruses contributed equally accounting for 33.2% and 32.2% of all the cases, respectively, while parasites were detected in 19.2% patients. Diarrhoeagenic Escherichia coli (DEC) was the most common enteric pathogen, isolated in 22.9% of patients, followed by Cryptosporidium parvum (18.9%), rotavirus (18.1%) and norovirus (13.7%). The main cause of diarrhoea in children aged 0 to 6 months were bacteria, predominantly DEC, while viruses predominated in the 7-12 months age group. Vibrio cholerae was isolated mostly in children above two years. Shigella spp, V. cholerae and DEC showed moderate to high rates of resistance to erythromycin, ampicillin, chloramphenicol and tetracycline (56.2-100%). V. cholerae showed full susceptibility to co-trimoxazole (100%), while DEC and Shigella showed high rate of resistance to co-trimoxazole; 90.6% and 93.3% respectively. None of the bacterial pathogens isolated showed resistance to ciprofloxacin which is not recommended for use in children. Cefotaxime resistance was found only in 4.7% of the DEC. During the dry season, acute watery diarrhoea is the most common type of diarrhoea in children under five years in Dar es Salaam and is predominantly due to DEC, C. parvum, rotaviruses and noroviruses. Constant antibiotic surveillance is warranted as bacteria were highly resistant to various antimicrobial agents including co-trimoxazole and erythromycin which are currently recommended for empiric treatment of diarrhoea.\u
Recommended from our members
Vibrio cholerae genomic diversity within and between patients
Cholera is a severe, water-borne diarrhoeal disease caused by toxin-producing strains of the bacterium Vibrio cholerae. Comparative genomics has revealed âwavesâ of cholera transmission and evolution, in which clones are successively replaced over decades and centuries. However, the extent of V. cholerae genetic diversity within an epidemic or even within an individual patient is poorly understood. Here, we characterized V. cholerae genomic diversity at a micro-epidemiological level within and between individual patients from Bangladesh and Haiti. To capture within-patient diversity, we isolated multiple (8 to 20) V. cholerae colonies from each of eight patients, sequenced their genomes and identified point mutations and gene gain/loss events. We found limited but detectable diversity at the level of point mutations within hosts (zero to three single nucleotide variants within each patient), and comparatively higher gene content variation within hosts (at least one gain/loss event per patient, and up to 103 events in one patient). Much of the gene content variation appeared to be due to gain and loss of phage and plasmids within the V. cholerae population, with occasional exchanges between V. cholerae and other members of the gut microbiota. We also show that certain intra-host variants have phenotypic consequences. For example, the acquisition of a Bacteroides plasmid and non-synonymous mutations in a sensor histidine kinase gene both reduced biofilm formation, an important trait for environmental survival. Together, our results show that V. cholerae is measurably evolving within patients, with possible implications for disease outcomes and transmission dynamics
Swift follow-up observations of candidate gravitational-wave transient events
We present the first multi-wavelength follow-up observations of two candidate
gravitational-wave (GW) transient events recorded by LIGO and Virgo in their
2009-2010 science run. The events were selected with low latency by the network
of GW detectors and their candidate sky locations were observed by the Swift
observatory. Image transient detection was used to analyze the collected
electromagnetic data, which were found to be consistent with background.
Off-line analysis of the GW data alone has also established that the selected
GW events show no evidence of an astrophysical origin; one of them is
consistent with background and the other one was a test, part of a "blind
injection challenge". With this work we demonstrate the feasibility of rapid
follow-ups of GW transients and establish the sensitivity improvement joint
electromagnetic and GW observations could bring. This is a first step toward an
electromagnetic follow-up program in the regime of routine detections with the
advanced GW instruments expected within this decade. In that regime
multi-wavelength observations will play a significant role in completing the
astrophysical identification of GW sources. We present the methods and results
from this first combined analysis and discuss its implications in terms of
sensitivity for the present and future instruments.Comment: Submitted for publication 2012 May 25, accepted 2012 October 25,
published 2012 November 21, in ApJS, 203, 28 (
http://stacks.iop.org/0067-0049/203/28 ); 14 pages, 3 figures, 6 tables;
LIGO-P1100038; Science summary at
http://www.ligo.org/science/Publication-S6LVSwift/index.php ; Public access
area to figures, tables at
https://dcc.ligo.org/cgi-bin/DocDB/ShowDocument?docid=p110003
- âŠ