75 research outputs found

    Why Should We Care About Molecular Coevolution?

    Get PDF
    Non-independent evolution of amino acid sites has become a noticeable limitation of most methods aimed at identifying selective constraints at functionally important amino acid sites or protein regions. The need for a generalised framework to account for non-independence of amino acid sites has fuelled the design and development of new mathematical models and computational tools centred on resolving this problem. Molecular coevolution is one of the most active areas of research, with an increasing rate of new models and methods being developed everyday. Both parametric and non-parametric methods have been developed to account for correlated variability of amino acid sites. These methods have been utilised for detecting phylogenetic, functional and structural coevolution as well as to identify surfaces of amino acid sites involved in protein-protein interactions. Here we discuss and briefly describe these methods, and identify their advantages and limitations

    Reducing the false positive rate in the non-parametric analysis of molecular coevolution

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The strength of selective constraints operating on amino acid sites of proteins has a multifactorial nature. In fact, amino acid sites within proteins coevolve due to their functional and/or structural relationships. Different methods have been developed that attempt to account for the evolutionary dependencies between amino acid sites. Researchers have invested a significant effort to increase the sensitivity of such methods. However, the difficulty in disentangling functional co-dependencies from historical covariation has fuelled the scepticism over their power to detect biologically meaningful results. In addition, the biological parameters connecting linear sequence evolution to structure evolution remain elusive. For these reasons, most of the evolutionary studies aimed at identifying functional dependencies among protein domains have focused on the structural properties of proteins rather than on the information extracted from linear multiple sequence alignments (MSA). Non-parametric methods to detect coevolution have been reported to be especially susceptible to produce false positive results based on the properties of MSAs. However, no formal statistical analysis has been performed to definitively test the differential effects of these properties on the sensitivity of such methods.</p> <p>Results</p> <p>Here we test the effect that variations on the MSA properties have over the sensitivity of non-parametric methods to detect coevolution. We test the effect that the size of the MSA (number of sequences), mean pairwise amino acid distance per site and the strength of the coevolution signal have on the ability of non-parametric methods to detect coevolution. Our results indicate that all three factors have significant effects on the accuracy of non-parametric methods. Further, introducing statistical filters improves the sensitivity and increases the statistical power of the methods to detect functional coevolution. Statistical analysis of the physico-chemical properties of amino acid sites in the context of the protein structure reveals striking dependencies among amino acid sites. Results indicate a covariation trend in the hydrophobicities and molecular weight characteristics of amino acid sites when analysing a non-redundant set of 8000 protein structures. Using this biological information as filter in coevolutionary analyses minimises the false positive rate of these methods. Application of these filters to three different proteins with known functional domains supports the importance of using biological filters to detect coevolution.</p> <p>Conclusion</p> <p>Coevolutionary analyses using non-parametric methods have proved difficult and highly prone to provide spurious results depending on the properties of MSAs and on the strength of coevolution between amino acid sites. The application of statistical filters to the number of pairs detected as coevolving reduces significantly the number of artifactual results. Analysis of the physico-chemical properties of amino acid sites in the protein structure context reveals their structure-dependent covariation. The application of this known biological information to the analysis of covariation greatly enhances the functional coevolutionary signal and removes historical covariation. Simultaneous use of statistical and biological data is instrumental in the detection of functional amino acid sites dependencies and compensatory changes at the protein level.</p

    Mutational dynamics of murine angiogenin duplicates

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Angiogenin (Ang) is a protein involved in angiogenesis by inducing the formation of blood vessels. The biomedical importance of this protein has come from findings linking mutations in Ang to cancer progression and neurodegenerative diseases. These findings highlight the evolutionary constrain on Ang amino acid sequence. However, previous studies comparing human Angiogenin with homologs from other phylogenetically related organisms have led to the conclusion that Ang presents a striking variability. Whether this variability has an adaptive value <it>per se </it>remains elusive. Understanding why many functional Ang paralogs have been preserved in mouse and rat and identifying functional divergence mutations at these copies may explain the relationship between mutations and function. In spite of the importance of testing this hypothesis from the evolutionarily and biomedical perspectives, this remains yet unaccomplished. Here we test the main mutational dynamics driving the evolution and function of Ang paralogs in mammals.</p> <p>Results</p> <p>We analysed the phylogenetic asymmetries between the different Ang gene copies in mouse and rat in the context of vertebrate Ang phylogeny. This analysis shows strong evidence in support of accelerated evolution in some Ang murine copies (mAng). This acceleration is not due to non-functionalisation because constraints on amino acid replacements remain strong. We identify many of the amino acid sites involved in signal localization and nucleotide binding by Ang to have evolved under diversifying selection. Compensatory effects of many of the mutations at these paralogs and their key structural location in or nearby important functional regions support a possible functional shift (functional divergence) in many Ang copies. Similarities between 3D-structural models for mAng copies suggest that their divergence is mainly functional.</p> <p>Conclusions</p> <p>We identify the main evolutionary dynamics shaping the variability of Angiogenin in vertebrates and highlight the plasticity of this protein after gene duplication. Our results suggest functional divergence among mAng paralogs. This puts forward mAng as a good system candidate for testing functional plasticity of such an important protein while stresses caution when using mouse as a model to infer the consequences of mutations in the single Ang copy of humans.</p

    Microbiota Variation Across Life Stages of European Field-Caught Anopheles atroparvus and During Laboratory Colonization: New Insights for Malaria Research

    Get PDF
    The potential use of bacteria for developing novel vector control approaches has awakened new interests in the study of the microbiota associated with vector species. To set a baseline for future malaria research, a high-throughput sequencing of the bacterial 16S ribosomal gene V3-V4 region was used to profile the microbiota associated with late-instar larvae, newly emerged females, and wild-caught females of a sylvan Anopheles atroparvus population from a former malaria transmission area of Spain. Field-acquired microbiota was then assessed in non-blood-fed laboratory-reared females from the second, sixth, and 10th generations. Diversity analyses revealed that bacterial communities varied and clustered differently according to origin with sylvan larvae and newly emerged females distributing closer to laboratory-reared females than to their field counterparts. Inter-sample variation was mostly observed throughout the different developmental stages in the sylvan population. Larvae harbored the most diverse bacterial communities; wild-caught females, the poorest. In the transition from the sylvan environment to the first time point of laboratory breeding, a significant increase in diversity was observed, although this did decline under laboratory conditions. Despite diversity differences between wild-caught and laboratory-reared females, a substantial fraction of the bacterial communities was transferred through transstadial transmission and these persisted over 10 laboratory generations. Differentially abundant bacteria were mostly identified between breeding water and late-instar larvae, and in the transition from wild-caught to laboratory-reared females from the second generation. Our findings confirmed the key role of the breeding environment in shaping the microbiota of An. atroparvus. Gram-negative bacteria governed the microbiota of An. atroparvus with the prevalence of proteobacteria. Pantoea, Thorsellia, Serratia, Asaia, and Pseudomonas dominating the microbiota associated with wild-caught females, with the latter two governing the communities of laboratory-reared females. A core microbiota was identified with Pseudomonas and Serratia being the most abundant core genera shared by all sylvan and laboratory specimens. Overall, understanding the microbiota composition of An. atroparvus and how this varies throughout the mosquito life cycle and laboratory colonization paves the way when selecting potential bacterial candidates for use in microbiota-based intervention strategies against mosquito vectors, thereby improving our knowledge of laboratory-reared An. atroparvus mosquitoes for research purposes.info:eu-repo/semantics/publishedVersio

    Understanding pseudo-albinism in sole (Solea senegalensis): a transcriptomics and metagenomics approach

    Get PDF
    Pseudo-albinism is a pigmentation disorder observed in flatfish aquaculture with a complex, multi-factor aetiology. We tested the hypothesis that pigmentation abnormalities are an overt signal of more generalised modifications in tissue structure and function, using as a model the Senegalese sole and two important innate immune barriers, the skin and intestine, and their microbiomes. Stereological analyses in pseudo-albino sole revealed a significantly increased mucous cell number in skin (P < 0.001) and a significantly thicker muscle layer and lamina propria in gut (P < 0.001). RNA-seq transcriptome analysis of the skin and gut identified 573 differentially expressed transcripts (DETs, FDR < 0.05) between pseudo-albino and pigmented soles (one pool/tissue from 4 individuals/phenotype). DETs were mainly linked to pigment production, skin structure and regeneration and smooth muscle contraction. The microbiome (16 S rRNA analysis) was highly diverse in pigmented and pseudo-albino skin but in gut had low complexity and diverged between the two pigmentation phenotypes. Quantitative PCR revealed significantly lower loads of Mycoplasma (P < 0.05) and Vibrio bacteria (P < 0.01) in pseudo-albino compared to the control. The study revealed that pseudo-albinism in addition to pigmentation changes was associated with generalised changes in the skin and gut structure and a modification in the gut microbiome.Agência financiadora H2020 European Funds MSCA-RISE project 691102 Portuguese national funds from FCT - Foundation for Science and Technology UID/Multi/04326/2019 Portuguese national funds from the operational programme CRESC Algarve 2020 EMBRC. PT ALG-01-0145-FEDER-022121 Portuguese national funds from the operational programme COMPETE 2020 EMBRC. PT ALG-01-0145-FEDER-022121 European Union (EU) 654008 Fundacao para a Ciencia e a Tecnologia (FCT) SFRH/BPD/84033/2012 Portuguese Institute for Employment and Vocational Training 0068/ET/18info:eu-repo/semantics/publishedVersio

    Systematic Production of Inactivating and NonInactivating Suppressor Mutations at the relA Locus That Compensate the Detrimental Effects of Complete spoT Loss and Affect Glycogen Content in Escherichia coli

    Get PDF
    In Escherichia coli, ppGpp is a major determinant of growth and glycogen accumulation. Levels of this signaling nucleotide are controlled by the balanced activities of the ppGpp RelA synthetase and the dual-function hydrolase/synthetase SpoT. Here we report the construction of spoT null (DspoT) mutants obtained by transducing a DspoT allele from DrelADspoT double mutants into relA+ cells. Iodine staining of randomly selected transductants cultured on a rich complex medium revealed differences in glycogen content among them. Sequence and biochemical analyses of 8 DspoT clones displaying glycogen-deficient phenotypes revealed different inactivating mutations in relA and no detectable ppGpp when cells were cultured on a rich complex medium. Remarkably, although the co-existence of DspoT with relA proficient alleles has generally been considered synthetically lethal, we found that 11 DspoT clones displaying high glycogen phenotypes possessed relA mutant alleles with non-inactivating mutations that encoded stable RelA proteins and ppGpp contents reaching 45–85% of those of wild type cells. None of the DspoT clones, however, could grow on M9-glucose minimal medium. Both Sanger sequencing of specific genes and high-throughput genome sequencing of the DspoT clones revealed that suppressor mutations were restricted to the relA locus. The overall results (a) defined in around 4 nmoles ppGpp/g dry weight the threshold cellular levels that suffice to trigger net glycogen accumulation, (b) showed that mutations in relA, but not necessarily inactivating mutations, can be selected to compensate total SpoT function(s) loss, and (c) provided useful tools for studies of the in vivo regulation of E. coli RelA ppGpp synthetaseFil: Montero, Manuel. Gobierno de Navarra. Instituto de Agrobiotecnología; EspañaFil: Rahimpour, Mehdi. Gobierno de Navarra. Instituto de Agrobiotecnología; EspañaFil: Viale, Alejandro Miguel. Gobierno de Navarra. Instituto de Agrobiotecnología; España. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Rosario. Instituto de Biología Molecular y Celular de Rosario. Universidad Nacional de Rosario. Facultad de Ciencias Bioquímicas y Farmacéuticas. Instituto de Biología Molecular y Celular de Rosario; ArgentinaFil: Almagro, Goizeder. Gobierno de Navarra. Instituto de Agrobiotecnología; EspañaFil: Eydallin, Gustavo. Gobierno de Navarra. Instituto de Agrobiotecnología; EspañaFil: Sevilla, Angel. Universidad de Murcia; EspañaFil: Canovas, Manuel. Universidad de Murcia; EspañaFil: Bernal, Cristina. Universidad de Murcia; EspañaFil: Lozano, Ana Belen. Universidad de Murcia; EspañaFil: Muñoz, Francisco Jose. Gobierno de Navarra. Instituto de Agrobiotecnología; EspañaFil: Bora Fernandez, Edurne. Gobierno de Navarra. Instituto de Agrobiotecnología; EspañaFil: Bahaji, Abdellatif. Gobierno de Navarra. Instituto de Agrobiotecnología; EspañaFil: Mori, Hirotada. Nara Institute of Science and Technology. Graduate School of Biological Sciences; JapónFil: Codoñer, Francisco M.. Lifesequencing SL. Valencia; EspañaFil: Potueza Romeo, Javier. Gobierno de Navarra. Instituto de Agrobiotecnología; Españ

    Endometrial receptivity revisited: endometrial transcriptome adjusted for tissue cellular heterogeneity

    Get PDF
    STUDY QUESTION Does cellular composition of the endometrial biopsy affect the gene expression profile of endometrial whole-tissue samples? SUMMARY ANSWER The differences in epithelial and stromal cell proportions in endometrial biopsies modify the whole-tissue gene expression profiles and affect the results of differential expression analyses. WHAT IS ALREADY KNOWN Each cell type has its unique gene expression profile. The proportions of epithelial and stromal cells vary in endometrial tissue during the menstrual cycle, along with individual and technical variation due to the method and tools used to obtain the tissue biopsy. STUDY DESIGN, SIZE, DURATION Using cell-population specific transcriptome data and computational deconvolution approach, we estimated the epithelial and stromal cell proportions in whole-tissue biopsies taken during early secretory and mid-secretory phases. The estimated cellular proportions were used as covariates in whole-tissue differential gene expression analysis. Endometrial transcriptomes before and after deconvolution were compared and analysed in biological context. PARTICIPANTS/MATERIAL, SETTING, METHODS Paired early- and mid-secretory endometrial biopsies were obtained from 35 healthy, regularly cycling, fertile volunteers, aged 23–36 years, and analysed by RNA sequencing. Differential gene expression analysis was performed using two approaches. In one of them, computational deconvolution was applied as an intermediate step to adjust for the proportions of epithelial and stromal cells in the endometrial biopsy. The results were then compared to conventional differential expression analysis. Ten paired endometrial samples were analysed with qPCR to validate the results. MAIN RESULTS AND THE ROLE OF CHANCE The estimated average proportions of stromal and epithelial cells in early secretory phase were 65% and 35%, and during mid-secretory phase, 46% and 54%, respectively, correlating well with the results of histological evaluation (r = 0.88, P = 1.1 × 10−6). Endometrial tissue transcriptomic analysis showed that approximately 26% of transcripts (n = 946) differentially expressed in receptive endometrium in cell-type unadjusted analysis also remain differentially expressed after adjustment for biopsy cellular composition. However, the other 74% (n = 2645) become statistically non-significant after adjustment for biopsy cellular composition, underlining the impact of tissue heterogeneity on differential expression analysis. The results suggest new mechanisms involved in endometrial maturation, involving genes like LINC01320, SLC8A1 and GGTA1P, described for the first time in context of endometrial receptivity. LARGE-SCALE DATA The RNA-seq data presented in this study is deposited in the Gene Expression Omnibus database with accession number GSE98386. LIMITATIONS REASONS FOR CAUTION Only dominant endometrial cell types were considered in gene expression profile deconvolution; however, other less frequent endometrial cell types also contribute to the whole-tissue gene expression profile. WIDER IMPLICATIONS OF THE FINDINGS The better understanding of molecular processes during transition from pre-receptive to receptive endometrium serves to improve the effectiveness and personalization of assisted reproduction protocols. Biopsy cellular composition should be taken into account in future endometrial ‘omics’ studies, where tissue heterogeneity could potentially influence the results. STUDY FUNDING/COMPETING INTEREST(S) This study was funded by: Estonian Ministry of Education and Research (grant IUT34-16); Enterprise Estonia (EU48695); the EU-FP7 Eurostars program (NOTED, EU41564); the EU-FP7 Marie Curie Industry-Academia Partnerships and Pathways (SARM, EU324509); Horizon 2020 innovation program (WIDENLIFE, EU692065); MSCA-RISE-2015 project MOMENDO (No 691058) and the Miguel Servet Program Type I of Instituto de Salud Carlos III (CP13/00038); Spanish Ministry of Economy, Industry and Competitiveness (MINECO) and European Regional Development Fund (FEDER): grants RYC-2016-21199 and ENDORE SAF2017-87526. Authors confirm no competing interests
    corecore