101 research outputs found

    Detection of new protein domains using co-occurrence: application to Plasmodium falciparum

    Get PDF
    International audienceMotivation: Hidden Markov Models (HMMs) have proved to be a powerful tool for protein domain identification in newly sequenced organisms. However, numerous domains may be missed in highly divergent proteins. This is the case for Plasmodium falciparum proteins, the main causal agent of human malaria. Results: We propose a method to improve the sensitivity of HMM domain detection by exploiting the tendency of the domains to appear preferentially with a few other favorite domains in a protein. When sequence information alone is not sufficient to warrant the presence of a particular domain, our method enables its detection on the basis of the presence of other Pfam or InterPro domains. Moreover, a shuffling procedure allows us to estimate the false discovery rate associated with the results. Applied to P. falciparum, our method identifies 585 new Pfam domains (versus the 3683 already known domains in the Pfam database) with an estimated error rate below 20%. These new domains provide 387 new Gene Ontology annotations to the P. falciparum proteome. Analogous and congruent results are obtained when applying the method to related Plasmodium species, P. vivax and P. yoelii. Availability: Supplementary Material and a database of the new domains and GO predictions achieved on Plasmodium proteins are available at http://www.lirmm.fr/~terrapon/codd

    EuPathDomains: The Divergent Domain Database for Eukaryotic Pathogens

    Get PDF
    International audienceEukaryotic pathogens (e.g. Plasmodium, Leishmania, Trypanosomes, etc.) are a major source of morbidity and mortality worldwide. In Africa, one of the most impacted continents, they cause millions of deaths and constitute an immense economic burden. While the genome sequence of several of these organisms is now available, the biological functions of more than half of their proteins are still unknown. This is a serious issue for bringing to the foreground the expected new therapeutic targets. In this context, the identification of protein domains is a key step to improve the functional annotation of the proteins. However, several domains are missed in eukaryotic pathogens because of the high phylogenetic distance of these organisms from the classical eukaryote models. We recently proposed a method, co-occurrence domain detection (CODD), that improves the sensitivity of Pfam domain detection by exploiting the tendency of domains to appear preferentially with a few other favorite domains in a protein. In this paper, we present EuPathDomains (http://www.atgc-montpellier.fr/EuPathDomains/), an extended database of protein domains belonging to ten major eukaryotic human pathogens. EuPathDomains gathers known and new domains detected by CODD, along with the associated confidence measurements and the GO annotations that can be deduced from the new domains. This database significantly extends the Pfam domain coverage of all selected genomes, by proposing new occurrences of domains as well as new domain families that have never been reported before. For example, with a false discovery rate lower than 20%, EuPathDomains increases the number of detected domains by 13% in Toxoplasma gondii genome and up to 28% in Cryptospordium parvum, and the total number of domain families by 10% in Plasmodium falciparum and up to 16% in C. parvum genome. The database can be queried by protein names, domain identifiers, Pfam or Interpro identifiers, or organisms, and should become a valuable resource to decipher the protein functions of eukaryotic pathogens

    Phenotypic and Genomic Diversification in Complex Carbohydrate-Degrading Human Gut Bacteria

    Get PDF
    Symbiotic bacteria are responsible for the majority of complex carbohydrate digestion in the human colon. Since the identities and amounts of dietary polysaccharides directly impact the gut microbiota, determining which microorganisms consume specific nutrients is central for defining the relationship between diet and gut microbial ecology. Using a custom phenotyping array, we determined carbohydrate utilization profiles for 354 members of the Bacteroidetes, a dominant saccharolytic phylum. There was wide variation in the numbers and types of substrates degraded by individual bacteria, but phenotype-based clustering grouped members of the same species indicating that each species performs characteristic roles. The ability to utilize dietary polysaccharides and endogenous mucin glycans was negatively correlated, suggesting exclusion between these niches. By analyzing related Bacteroides ovatus/Bacteroides xylanisolvens strains that vary in their ability to utilize mucin glycans, we addressed whether gene clusters that confer this complex, multilocus trait are being gained or lost in individual strains. Pangenome reconstruction of these strains revealed a remarkably mosaic architecture in which genes involved in polysaccharide metabolism are highly variable and bioinformatics data provide evidence of interspecies gene transfer that might explain this genomic heterogeneity. Global transcriptomic analyses suggest that the ability to utilize mucin has been lost in some lineages of B. ovatus and B. xylanisolvens, which harbor residual gene clusters that are involved in mucin utilization by strains that still actively express this phenotype. Our data provide insight into the breadth and complexity of carbohydrate metabolism in the microbiome and the underlying genomic events that shape these behaviors

    How members of the human gut microbiota overcome the sulfation problem posed by glycosaminoglycans

    Get PDF
    The human microbiota, which plays an important role in health and disease, uses complex carbohydrates as a major source of nutrients. Utilization hierarchy indicates that the host glycosaminoglycans heparin (Hep) and heparan sulfate (HS) are high-priority carbohydrates for Bacteroides thetaiotaomicron, a prominent member of the human microbiota. The sulfation patterns of these glycosaminoglycans are highly variable, which presents a significant enzymatic challenge to the polysaccharide lyases and sulfatases that mediate degradation. It is possible that the bacterium recruits lyases with highly plastic specificities and expresses a repertoire of enzymes that target substructures of the glycosaminoglycans with variable sulfation or that the glycans are desulfated before cleavage by the lyases. To distinguish between these mechanisms, the components of the B. thetaiotaomicron Hep/HS degrading apparatus were analyzed. The data showed that the bacterium expressed a single-surface endo-acting lyase that cleaved HS, reflecting its higher molecular weight compared with Hep. Both Hep and HS oligosaccharides imported into the periplasm were degraded by a repertoire of lyases, with each enzyme displaying specificity for substructures within these glycosaminoglycans that display a different degree of sulfation. Furthermore, the crystal structures of a key surface glycan binding protein, which is able to bind both Hep and HS, and periplasmic sulfatases reveal the major specificity determinants for these proteins. The locus described here is highly conserved within the human gut Bacteroides, indicating that the model developed is of generic relevance to this important microbial community

    Microbiota-directed fibre activates both targeted and secondary metabolic shifts in the distal gut

    Get PDF
    Beneficial modulation of the gut microbiome has high-impact implications not only in humans, but also in livestock that sustain our current societal needs. In this context, we have tailored an acetylated galactoglucomannan (AcGGM) fibre to match unique enzymatic capabilities of Roseburia and Faecalibacterium species, both renowned butyrate-producing gut commensals. Here, we test the accuracy of AcGGM within the complex endogenous gut microbiome of pigs, wherein we resolve 355 metagenome-assembled genomes together with quantitative metaproteomes. In AcGGM-fed pigs, both target populations differentially express AcGGM-specific polysaccharide utilization loci, including novel, mannan-specific esterases that are critical to its deconstruction. However, AcGGM-inclusion also manifests a “butterfly effect”, whereby numerous metabolic changes and interdependent cross-feeding pathways occur in neighboring non-mannanolytic populations that produce short-chain fatty acids. Our findings show how intricate structural features and acetylation patterns of dietary fibre can be customized to specific bacterial populations, with potential to create greater modulatory effects at large

    Glycan complexity dictates microbial resource allocation in the large intestine.

    Get PDF
    The structure of the human gut microbiota is controlled primarily through the degradation of complex dietary carbohydrates, but the extent to which carbohydrate breakdown products are shared between members of the microbiota is unclear. We show here, using xylan as a model, that sharing the breakdown products of complex carbohydrates by key members of the microbiota, such as Bacteroides ovatus, is dependent on the complexity of the target glycan. Characterization of the extensive xylan degrading apparatus expressed by B. ovatus reveals that the breakdown of the polysaccharide by the human gut microbiota is significantly more complex than previous models suggested, which were based on the deconstruction of xylans containing limited monosaccharide side chains. Our report presents a highly complex and dynamic xylan degrading apparatus that is fine-tuned to recognize the different forms of the polysaccharide presented to the human gut microbiota.This work was supported in part by grants to D.N.B. (BBSRC BB/G016186/1) and H.J.G. (Wellcome Trust WT097907AIA).This is the final version. It was first published by NPG at http://dx.doi.org/10.1038/ncomms848

    Dietary pectic glycans are degraded by coordinated enzyme pathways in human colonic Bacteroides.

    Get PDF
    The major nutrients available to human colonic Bacteroides species are glycans, exemplified by pectins, a network of covalently linked plant cell wall polysaccharides containing galacturonic acid (GalA). Metabolism of complex carbohydrates by the Bacteroides genus is orchestrated by polysaccharide utilization loci (PULs). In Bacteroides thetaiotaomicron, a human colonic bacterium, the PULs activated by different pectin domains have been identified; however, the mechanism by which these loci contribute to the degradation of these GalA-containing polysaccharides is poorly understood. Here we show that each PUL orchestrates the metabolism of specific pectin molecules, recruiting enzymes from two previously unknown glycoside hydrolase families. The apparatus that depolymerizes the backbone of rhamnogalacturonan-I is particularly complex. This system contains several glycoside hydrolases that trim the remnants of other pectin domains attached to rhamnogalacturonan-I, and nine enzymes that contribute to the degradation of the backbone that makes up a rhamnose-GalA repeating unit. The catalytic properties of the pectin-degrading enzymes are optimized to protect the glycan cues that activate the specific PULs ensuring a continuous supply of inducing molecules throughout growth. The contribution of Bacteroides spp. to metabolism of the pectic network is illustrated by cross-feeding between organisms.This work was supported in part by an Advanced Grant from the European Research Council (Grant No. 322820) awarded to H.J.G. and B.H. supporting A.S.L., D.N., A.C. and N.T., a Wellcome Trust Senior Investigator Award to H.J.G. (grant No. WT097907MA) that supported J.B. and E.C.L. a European Union Seventh Framework Initial Training Network Programme entitled the “WallTraC project” (Grant Agreement number 263916) awarded to M-C.R. and H.J.G, which supported X.Z. and J.S. The Biotechnology and Biological Research Council project ‘Ricefuel’ (grant numbers BB/K020358/1) awarded to H.J.G. supported A.L

    A catalog of microbial genes from the bovine rumen unveils a specialized and diverse biomass-degrading environment

    Get PDF
    Background The rumen microbiota provides essential services to its host and, through its role in ruminant production, contributes to human nutrition and food security. A thorough knowledge of the genetic potential of rumen microbes will provide opportunities for improving the sustainability of ruminant production systems. The availability of gene reference catalogs from gut microbiomes has advanced the understanding of the role of the microbiota in health and disease in humans and other mammals. In this work, we established a catalog of reference prokaryote genes from the bovine rumen. Results Using deep metagenome sequencing we identified 13,825,880 non-redundant prokaryote genes from the bovine rumen. Compared to human, pig, and mouse gut metagenome catalogs, the rumen is larger and richer in functions and microbial species associated with the degradation of plant cell wall material and production of methane. Genes encoding enzymes catalyzing the breakdown of plant polysaccharides showed a particularly high richness that is otherwise impossible to infer from available genomes or shallow metagenomics sequencing. The catalog expands the dataset of carbohydrate-degrading enzymes described in the rumen. Using an independent dataset from a group of 77 cattle fed 4 common dietary regimes, we found that only <0.1% of genes were shared by all animals, which contrast with a large overlap for functions, i.e., 63% for KEGG functions. Different diets induced differences in the relative abundance rather than the presence or absence of genes, which explains the great adaptability of cattle to rapidly adjust to dietary changes. Conclusions These data bring new insights into functions, carbohydrate-degrading enzymes, and microbes of the rumen to complement the available information on microbial genomes. The catalog is a significant biological resource enabling deeper understanding of phenotypes and biological processes and will be expanded as new data are made available.info:eu-repo/semantics/publishedVersio

    Ninety-nine de novo assembled genomes from the moose (Alces alces) rumen microbiome provide new insights into microbial plant biomass degradation.

    Get PDF
    The moose (Alces alces) is a ruminant that harvests energy from fiber-rich lignocellulose material through carbohydrate-active enzymes (CAZymes) produced by its rumen microbes. We applied shotgun metagenomics to rumen contents from six moose to obtain insights into this microbiome. Following binning, 99 metagenome-assembled genomes (MAGs) belonging to 11 prokaryotic phyla were reconstructed and characterized based on phylogeny and CAZyme profile. The taxonomy of these MAGs reflected the overall composition of the metagenome, with dominance of the phyla Bacteroidetes and Firmicutes. Unlike in other ruminants, Spirochaetes constituted a significant proportion of the community and our analyses indicate that the corresponding strains are primarily pectin digesters. Pectin-degrading genes were also common in MAGs of Ruminococcus, Fibrobacteres and Bacteroidetes and were overall overrepresented in the moose microbiome compared with other ruminants. Phylogenomic analyses revealed several clades within the Bacteriodetes without previously characterized genomes. Several of these MAGs encoded a large numbers of dockerins, a module usually associated with cellulosomes. The Bacteroidetes dockerins were often linked to CAZymes and sometimes encoded inside polysaccharide utilization loci, which has never been reported before. The almost 100 CAZyme-annotated genomes reconstructed in this study provide an in-depth view of an efficient lignocellulose-degrading microbiome and prospects for developing enzyme technology for biorefineries
    • 

    corecore