71 research outputs found
Methods for selecting fixed-effect models for heterogeneous codon evolution, with comments on their application to gene and genome data
BACKGROUND: Models of codon evolution have proven useful for investigating the strength and direction of natural selection. In some cases, a priori biological knowledge has been used successfully to model heterogeneous evolutionary dynamics among codon sites. These are called fixed-effect models, and they require that all codon sites are assigned to one of several partitions which are permitted to have independent parameters for selection pressure, evolutionary rate, transition to transversion ratio or codon frequencies. For single gene analysis, partitions might be defined according to protein tertiary structure, and for multiple gene analysis partitions might be defined according to a gene's functional category. Given a set of related fixed-effect models, the task of selecting the model that best fits the data is not trivial. RESULTS: In this study, we implement a set of fixed-effect codon models which allow for different levels of heterogeneity among partitions in the substitution process. We describe strategies for selecting among these models by a backward elimination procedure, Akaike information criterion (AIC) or a corrected Akaike information criterion (AICc). We evaluate the performance of these model selection methods via a simulation study, and make several recommendations for real data analysis. Our simulation study indicates that the backward elimination procedure can provide a reliable method for model selection in this setting. We also demonstrate the utility of these models by application to a single-gene dataset partitioned according to tertiary structure (abalone sperm lysin), and a multi-gene dataset partitioned according to the functional category of the gene (flagellar-related proteins of Listeria). CONCLUSION: Fixed-effect models have advantages and disadvantages. Fixed-effect models are desirable when data partitions are known to exhibit significant heterogeneity or when a statistical test of such heterogeneity is desired. They have the disadvantage of requiring a priori knowledge for partitioning sites. We recommend: (i) selection of models by using backward elimination rather than AIC or AICc, (ii) use a stringent cut-off, e.g., p = 0.0001, and (iii) conduct sensitivity analysis of results. With thoughtful application, fixed-effect codon models should provide a useful tool for large scale multi-gene analyses
Antibiotic and antifungal use in pediatric leukemia and lymphoma patients are associated with increasing opportunistic pathogens and decreasing bacteria responsible for activities that enhance colonic defense
Due to decreased immunity, both antibiotics and antifungals are regularly used in pediatric hematologic-cancer patients as a means to prevent severe infections and febrile neutropenia. The general effect of antibiotics on the human gut microbiome is profound, yielding decreased diversity and changes in community structure. However, the specific effect on pediatric oncology patients is not well-studied. The effect of antifungal use is even less understood, having been studied only in mouse models. Because the composition of the gut microbiome is associated with regulation of hematopoiesis, immune function and gastrointestinal integrity, changes within the patient gut can have implications for the clinical management of hematologic malignancies. The pediatric population is particularly challenging because the composition of the microbiome is age dependent, with some of the most pronounced changes occurring in the first three years of life. We investigated how antibiotic and antifungal use shapes the taxonomic composition of the stool microbiome in pediatric patients with leukemia and lymphoma, as inferred from both 16S rRNA and metagenome data. Associations with age, antibiotic use and antifungal use were investigated using multiple analysis methods. In addition, multivariable differential abundance was used to identify and assess specific taxa that were associated with multiple variables. Both antibiotics and antifungals were linked to a general decline in diversity in stool samples, which included a decrease in relative abundance in butyrate producers that play a critical role in host gut physiology (e.g., Faecalibacterium, Anaerostipes, Dorea, Blautia),. Furthermore, antifungal use was associated with a significant increase in relative abundance of opportunistic pathogens. Collectively, these findings have important implications for the treatment of leukemia and lymphoma patients. Butyrate is important for gastrointestinal integrity; it inhibits inflammation, reinforces colonic defense, mucosal immunity. and decreases oxidative stress. The routine use of broad-spectrum anti-infectives in pediatric oncology patients could simultaneously contribute to a decline in gastrointestinal integrity and colonic defense while promoting increases in opportunistic pathogens within the patient gut. Because the gut microbiome has been linked to both short-term clinical outcomes, and longer-lasting health effects, systematic characterization of the gut microbiome in pediatric patients during, and beyond, treatment is warranted
Portal protein diversity and phage ecology
© 2008 The Authors. This article is distributed under the terms of the Creative Commons License, Attribution 2.5. The definitive version was published in Environmental Microbiology 10 (2008): 2810-2823, doi:10.1111/j.1462-2920.2008.01702.x.Oceanic phages are critical components of the global ecosystem, where they play a role in microbial mortality and evolution. Our understanding of phage diversity is greatly limited by the lack of useful genetic diversity measures. Previous studies, focusing on myophages that infect the marine cyanobacterium Synechococcus, have used the coliphage T4 portal-protein-encoding homologue, gene 20 (g20), as a diversity marker. These studies revealed 10 sequence clusters, 9 oceanic and 1 freshwater, where only 3 contained cultured representatives. We sequenced g20 from 38 marine myophages isolated using a diversity of Synechococcus and Prochlorococcus hosts to see if any would fall into the clusters that lacked cultured representatives. On the contrary, all fell into the three clusters that already contained sequences from cultured phages. Further, there was no obvious relationship between host of isolation, or host range, and g20 sequence similarity. We next expanded our analyses to all available g20 sequences (769 sequences), which include PCR amplicons from wild uncultured phages, non-PCR amplified sequences identified in the Global Ocean Survey (GOS) metagenomic database, as well as sequences from cultured phages, to evaluate the relationship between g20 sequence clusters and habitat features from which the phage sequences were isolated. Even in this meta-data set, very few sequences fell into the sequence clusters without cultured representatives, suggesting that the latter are very rare, or sequencing artefacts. In contrast, sequences most similar to the culture-containing clusters, the freshwater cluster and two novel clusters, were more highly represented, with one particular culture-containing cluster representing the dominant g20 genotype in the unamplified GOS sequence data. Finally, while some g20 sequences were non-randomly distributed with respect to habitat, there were always numerous exceptions to general patterns, indicating that phage portal proteins are not good predictors of a phage's host or the habitat in which a particular phage may thrive.This research was supported in part by funding from NSF (CMORE contribution #87), DOE, The Seaver Foundation and the Gordon and Betty Moore Foundation Marine Microbiology Program to S.W.C.; an NIH Bioinformatics Training Grant supported M.B.S.; MIT Undergraduate Research Opportunities Program supported V.Q., J.A.L., G.T., R.F. and J.E.R.; Howard Hughes Medical Institute funded MIT Biology Department Undergraduate Research Opportunities Program supported A.S.D.; NSERC (Canada) Discovery Grant (DG 298394) and a Grant from the Canadian Foundation for Innovation (NOF10394) to J.P.B.; NSF Graduate Fellowship funding supported M.L.C
Corrigendum "Portal protein diversity and phage ecology"
Author Posting. © The Author(s), 2011. This is the author's version of the work. It is posted here by permission of John Wiley & Sons for personal use, not for redistribution. The definitive version was published in Environmental Microbiology 13 (2011): 2832, doi:10.1111/j.1462-2920.2011.02616.x
Positive Darwinian Selection in the Piston That Powers Proton Pumps in Complex I of the Mitochondria of Pacific Salmon
The mechanism of oxidative phosphorylation is well understood, but evolution of the proteins involved is not. We combined phylogenetic, genomic, and structural biology analyses to examine the evolution of twelve mitochondrial encoded proteins of closely related, yet phenotypically diverse, Pacific salmon. Two separate analyses identified the same seven positively selected sites in ND5. A strong signal was also detected at three sites of ND2. An energetic coupling analysis revealed several structures in the ND5 protein that may have co-evolved with the selected sites. These data implicate Complex I, specifically the piston arm of ND5 where it connects the proton pumps, as important in the evolution of Pacific salmon. Lastly, the lineage to Chinook experienced rapid evolution at the piston arm
Successful dietary therapy in paediatric Crohn’s disease is associated with shifts in bacterial dysbiosis and inflammatory metabotype towards healthy controls
Background and aims:
Nutritional therapy with the Crohn’s Disease Exclusion Diet + Partial Enteral Nutrition [CDED+PEN] or Exclusive Enteral Nutrition [EEN] induces remission and reduces inflammation in mild-to-moderate paediatric Crohn’s disease [CD]. We aimed to assess if reaching remission with nutritional therapy is mediated by correcting compositional or functional dysbiosis.
Methods:
We assessed metagenome sequences, short chain fatty acids [SCFA] and bile acids [BA] in 54 paediatric CD patients reaching remission after nutritional therapy [with CDED + PEN or EEN] [NCT01728870], compared to 26 paediatric healthy controls.
Results:
Successful dietary therapy decreased the relative abundance of Proteobacteria and increased Firmicutes towards healthy controls. CD patients possessed a mixture of two metabotypes [M1 and M2], whereas all healthy controls had metabotype M1. M1 was characterised by high Bacteroidetes and Firmicutes, low Proteobacteria, and higher SCFA synthesis pathways, and M2 was associated with high Proteobacteria and genes involved in SCFA degradation. M1 contribution increased during diet: 48%, 63%, up to 74% [Weeks 0, 6, 12, respectively.]. By Week 12, genera from Proteobacteria reached relative abundance levels of healthy controls with the exception of E. coli. Despite an increase in SCFA synthesis pathways, remission was not associated with increased SCFAs. Primary BA decreased with EEN but not with CDED+PEN, and secondary BA did not change during diet.
Conclusion:
Successful dietary therapy induced correction of both compositional and functional dysbiosis. However, 12 weeks of diet was not enough to achieve complete correction of dysbiosis. Our data suggests that composition and metabotype are important and change quickly during the early clinical response to dietary intervention. Correction of dysbiosis may therefore be an important future treatment goal for CD
Prevalence and Evolution of Core Photosystem II Genes in Marine Cyanobacterial Viruses and Their Hosts
Cyanophages (cyanobacterial viruses) are important agents of horizontal gene transfer among marine cyanobacteria, the numerically dominant photosynthetic organisms in the oceans. Some cyanophage genomes carry and express host-like photosynthesis genes, presumably to augment the host photosynthetic machinery during infection. To study the prevalence and evolutionary dynamics of this phenomenon, 33 cultured cyanophages of known family and host range and viral DNA from field samples were screened for the presence of two core photosystem reaction center genes, psbA and psbD. Combining this expanded dataset with published data for nine other cyanophages, we found that 88% of the phage genomes contain psbA, and 50% contain both psbA and psbD. The psbA gene was found in all myoviruses and Prochlorococcus podoviruses, but could not be amplified from Prochlorococcus siphoviruses or Synechococcus podoviruses. Nearly all of the phages that encoded both psbA and psbD had broad host ranges. We speculate that the presence or absence of psbA in a phage genome may be determined by the length of the latent period of infection. Whether it also carries psbD may reflect constraints on coupling of viral- and host-encoded PsbA–PsbD in the photosynthetic reaction center across divergent hosts. Phylogenetic clustering patterns of these genes from cultured phages suggest that whole genes have been transferred from host to phage in a discrete number of events over the course of evolution (four for psbA, and two for psbD), followed by horizontal and vertical transfer between cyanophages. Clustering patterns of psbA and psbD from Synechococcus cells were inconsistent with other molecular phylogenetic markers, suggesting genetic exchanges involving Synechococcus lineages. Signatures of intragenic recombination, detected within the cyanophage gene pool as well as between hosts and phages in both directions, support this hypothesis. The analysis of cyanophage psbA and psbD genes from field populations revealed significant sequence diversity, much of which is represented in our cultured isolates. Collectively, these findings show that photosynthesis genes are common in cyanophages and that significant genetic exchanges occur from host to phage, phage to host, and within the phage gene pool. This generates genetic diversity among the phage, which serves as a reservoir for their hosts, and in turn influences photosystem evolution
A maximum likelihood method for detecting functional divergence at individual codon sites, with application to gene family evolution
Abstract. The tailoring of existing genetic systems to new uses is called genetic co-option. Mechanisms of genetic co-option have been difficult to study because of difficulties in identifying functionally important changes. One way to study genetic co-option in protein-coding genes is to identify those amino acid sites that have experienced changes in selective pressure following a genetic co-option event. In this paper we present a maximum likelihood method useful for measuring divergent selective pressures and identifying the amino acid sites affected by divergent selection. The method is based on a codon model of evolution and uses the nonsynonymous-to-synonymous rate ratio (x) as a measure of selection on the protein, with x = 1, <1, and>1 indicating neutral evolution, purifying selection, and positive selection, respectively. The model allows variation in x among sites, with a fraction of sites evolving under divergent selective pressures. Divergent selection is indicated by different x’s between clades, such as between paralogous clades of a gene family. We applied the codon model to duplication followed by functional divergence of (i) the e and c globin genes and (ii) the eosinophil cationic protein (ECP) and eosinophilderived neurotoxin (EDN) genes. In both cases likelihood ratio tests suggested the presence of sites evolving under divergent selective pressures. Results of the e and c globin analysis suggested that divergent selective pressures might be a consequence of a weakened relationship between fetal hemoglobin an
- …