210 research outputs found

    Etude de la diversité métabolique dans l'espèce Escherichia coli (a l'aide de réseaux et de modèles du métabolisme à l'échelle de l'organisme)

    Get PDF
    Il existe plusieurs façons de concevoir l étude des différences métaboliques chez les microorganismes. On peut s intéresser à des variations des capacités métaboliques des organismes. Derrière cette analyse se cache niveau d étude à l échelle la cellule qui permet d avoir une vue d ensemble des capacités de croissance de l organisme dans différentes conditions. Dans cette thèse, nous proposons une nouvelle stratégie de reconstruction de réseaux et de modèles du métabolisme à l échelle globale. Cette stratégie s applique à un nombre quelconque d organismes à condition qu ils soient de la même espèce et qu il existe un réseau métabolique de référence de bonne qualité pour au moins l un d entre eux. Le point clé de cette stratégie repose sur l utilisation et la propagation automatisée des connaissances déjà acquises sur les organismes étudiés. Nous avons appliqué cette stratégie pour reconstruire et étudier les réseaux métaboliques de 23 Escherichia coli et 6 Shigellas. Nous avons ensuite converti ces réseaux en modèles métaboliques pour explorer les capacités physiologiques des différentes souches. Nous avons comparé nos prédictions de croissance à des expériences de croissance ainsi qu aux résultats du modèle de référence. Au final le travail réalisé propose une nouvelle stratégie de reconstruction de réseaux et de modèles du métabolisme à l échelle de la cellule, qui permet d étudier le lien entre l évolution et les capacités métaboliques des organismes étudiés.Metabolic differences in microorganisms can be focused on the variations in metabolic capacities. This analysis is at genome scale, and provides a global view of the organism s growth capacities under various conditions. In this project, we propose a new strategy to reconstruct metabolic networks and models at genome scale. Our strategy can be applied to any number of organisms as long as they are members of the same species and a curated metabolic network for one of the strains of this species is already reconstructed. The keystone of the strategy is the automatic utilization and propagation of both specific knowledge of the species and general knowledge in metabolic databases. This strategy was applied, to study the metabolic network of 23 strains of Escherichia coli and 6 of Shigella. Next, we have converted these networks into metabolic models to explore the metabolic capacities of these organisms. We compared the growth predictions of the models to experimental growth observations and also to the reference model. Finally, the studies carried out for this thesis consist of a new strategy, which allows the reconstruction of metabolic networks and models at cell scale. The models give us the opportunity to study the link between evolution, genomes and metabolic capacities of these organisms.EVRY-Bib. électronique (912289901) / SudocSudocFranceF

    Automatically exploiting genomic and metabolic contexts to aid the functional annotation of prokaryote genomes

    Get PDF
    Cette thèse porte sur le développement d'approches bioinformatiques exploitant de l'information de contextes génomiques et métaboliques afin de générer des annotations fonctionnelles de gènes prokaryotes, et comporte deux projets principaux. Le premier projet focalise sur les activités enzymatiques orphelines de séquence. Environ 27% des activités définies par le International Union of Biochemistry and Molecular Biology sont encore aujourd'hui orphelines. Pour celles-ci, les méthodes bioinformatiques traditionnelles ne peuvent proposer de gènes candidats; il est donc impératif d'utiliser des méthodes exploitant des informations contextuelles dans ces cas. La stratégie CanOE (fishingCandidate genes for Orphan Enzymes) a été développée et rajoutée à la plateforme MicroScope dans ce but, intégrant des informations génomiques et métaboliques sur des milliers d'organismes prokaryotes afin de localiser des gènes probants pour des activités orphelines. Le projet miroir au précédent est celui des protéines de fonction inconnue. Un projet collaboratif a été initié au Genoscope afin de formaliser les stratégies d'exploration des fonctions de familles protéiques prokaryotes. Une version pilote du projet a été mise en place sur la famille DUF849 dont une fonction enzymatique avait été récemment découverte. Des stratégies de proposition d'activités enzymatiques alternatives et d'établissement de sous familles isofonctionnelles ont été mises en place dans le cadre de cette thèse, afin de guider les expérimentations de paillasse et d'analyser leurs résultats.The subject of this thesis concerns the development of bioinformatic strategies exploiting genomic and metabolic contextual information in order to generate functional annotations for prokaryote genes. Two main projects were involved during this work: the first focuses on sequence-orphan enzymatic activities. Today, roughly 27% of activities defined by International Union of Biochemistry and Molecular Biology are sequence-orphans. For these, traditional bioinformatic approaches cannot propose candidate genes. It is thus imperative to use alternative, context-based approaches in such cases. The CanOE strategy fishing Candidate genes for Orphan Enzymes) was developed and added to the MicroScope bioinformatics platform in this aim. It integrates genomic and metabolic information across thousands of prokaryote genomes in order to locate promising gene candidates for orphan activities. The mirror project focuses on protein families of unknown function. A collaborative project has been set up at the Genoscope in hope of formalising functional exploration strategies for prokaryote protein families. A pilot version was created on the DUF849 Pfam family, for which a single activity had recently been elucidated. Strategies for proposing novel functions and activities and creating isofunctional sub-families were researched, so as to guide biochemical experimentations and to analyse their results.EVRY-Bib. électronique (912289901) / SudocSudocFranceF

    Backbone and side chain 1H, 15N and 13C assignments for a thiol-disulphide oxidoreductase from the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125

    Get PDF
    Enzymes produced by psychrophilic organisms have successfully overcome the low temperature challenge and evolved to maintain high catalytic rates in their permanently cold environments. As an initial step in our attempt to elucidate the cold-adaptation strategies used by these enzymes we report here the 1H, 15N and 13C assignments for the reduced form of a thiol-disulphide oxidoreductase from the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125.The NMR spectrometers are part of The National NMR Network (REDE/1517/RMN/2005), supported by ‘‘Programa Operacional Ciência e Inovação (POCTI) 2010’’ and Fundação para a Ciência e a Tecnologia (FCT). This work was funded by FCT, POCTI and FEDER; Projects POCI/BIA-PRO/57263/2004 and PTDC/BIO/70806/2006. TC is holder of a long term EMBO fellowship. MM is thankful to the Fundação para a Ciência e Tecnologia for its support through Programa Ciência 2007.info:eu-repo/semantics/publishedVersio

    GenoList: an integrated environment for comparative analysis of microbial genomes

    Get PDF
    The multitude of bacterial genome sequences being determined has generated new requirements regarding the development of databases and graphical interfaces: these are needed to organize and retrieve biological information from the comparison of large sets of genomes. GenoList (http://genolist.pasteur.fr/GenoList) is an integrated environment dedicated to querying and analyzing genome data from bacterial species. GenoList inherits from the SubtiList database and web server, the reference data resource for the Bacillus subtilis genome. The data model was extended to hold information about relationships between genomes (e.g. protein families). The web user interface was designed to primarily take into account biologists’ needs and modes of operation. Along with standard query and browsing capabilities, comparative genomics facilities are available, including subtractive proteome analysis. One key feature is the integration of the many tools accessible in the environment. As an example, it is straightforward to identify the genes that are specific to a group of bacteria, export them as a tab-separated list, get their protein sequences and run a multiple alignment on a subset of these sequences

    Infectious Disease Ontology

    Get PDF
    Technological developments have resulted in tremendous increases in the volume and diversity of the data and information that must be processed in the course of biomedical and clinical research and practice. Researchers are at the same time under ever greater pressure to share data and to take steps to ensure that data resources are interoperable. The use of ontologies to annotate data has proven successful in supporting these goals and in providing new possibilities for the automated processing of data and information. In this chapter, we describe different types of vocabulary resources and emphasize those features of formal ontologies that make them most useful for computational applications. We describe current uses of ontologies and discuss future goals for ontology-based computing, focusing on its use in the field of infectious diseases. We review the largest and most widely used vocabulary resources relevant to the study of infectious diseases and conclude with a description of the Infectious Disease Ontology (IDO) suite of interoperable ontology modules that together cover the entire infectious disease domain

    MOSAIC: an online database dedicated to the comparative genomics of bacterial strains at the intra-species level

    Get PDF
    BACKGROUND: The recent availability of complete sequences for numerous closely related bacterial genomes opens up new challenges in comparative genomics. Several methods have been developed to align complete genomes at the nucleotide level but their use and the biological interpretation of results are not straightforward. It is therefore necessary to develop new resources to access, analyze, and visualize genome comparisons. DESCRIPTION: Here we present recent developments on MOSAIC, a generalist comparative bacterial genome database. This database provides the bacteriologist community with easy access to comparisons of complete bacterial genomes at the intra-species level. The strategy we developed for comparison allows us to define two types of regions in bacterial genomes: backbone segments (i.e., regions conserved in all compared strains) and variable segments (i.e., regions that are either specific to or variable in one of the aligned genomes). Definition of these segments at the nucleotide level allows precise comparative and evolutionary analyses of both coding and non-coding regions of bacterial genomes. Such work is easily performed using the MOSAIC Web interface, which allows browsing and graphical visualization of genome comparisons. CONCLUSION: The MOSAIC database now includes 493 pairwise comparisons and 35 multiple maximal comparisons representing 78 bacterial species. Genome conserved regions (backbones) and variable segments are presented in various formats for further analysis. A graphical interface allows visualization of aligned genomes and functional annotations. The MOSAIC database is available online at http://genome.jouy.inra.fr/mosaic

    Rare Codons Cluster

    Get PDF
    Most amino acids are encoded by more than one codon. These synonymous codons are not used with equal frequency: in every organism, some codons are used more commonly, while others are more rare. Though the encoded protein sequence is identical, selective pressures favor more common codons for enhanced translation speed and fidelity. However, rare codons persist, presumably due to neutral drift. Here, we determine whether other, unknown factors, beyond neutral drift, affect the selection and/or distribution of rare codons. We have developed a novel algorithm that evaluates the relative rareness of a nucleotide sequence used to produce a given protein sequence. We show that rare codons, rather than being randomly scattered across genes, often occur in large clusters. These clusters occur in numerous eukaryotic and prokaryotic genomes, and are not confined to unusual or rarely expressed genes: many highly expressed genes, including genes for ribosomal proteins, contain rare codon clusters. A rare codon cluster can impede ribosome translation of the rare codon sequence. These results indicate additional selective pressures govern the use of synonymous codons, and specifically that local pauses in translation can be beneficial for protein biogenesis

    EcoliWiki: a wiki-based community resource for Escherichia coli

    Get PDF
    EcoliWiki is the community annotation component of the PortEco (http://porteco.org; formerly EcoliHub) project, an online data resource that integrates information on laboratory strains of Escherichia coli, its phages, plasmids and mobile genetic elements. As one of the early adopters of the wiki approach to model organism databases, EcoliWiki was designed to not only facilitate community-driven sharing of biological knowledge about E. coli as a model organism, but also to be interoperable with other data resources. EcoliWiki content currently covers genes from five laboratory E. coli strains, 21 bacteriophage genomes, F plasmid and eight transposons. EcoliWiki integrates the Mediawiki wiki platform with other open-source software tools and in-house software development to extend how wikis can be used for model organism databases. EcoliWiki can be accessed online at http://ecoliwiki.net

    Comparative Geno-Plasticity Analysis of Mycoplasma bovis HB0801 (Chinese Isolate)

    Get PDF
    Mycoplasma bovis pneumonia in cattle has been epidemic in China since 2008. To investigate M. bovis pathogenesis, we completed genome sequencing of strain HB0801 isolated from a lesioned bovine lung from Hubei, China. The genomic plasticity was determined by comparing HB0801 with M. bovis strain ATCC® 25523™/PG45 from cow mastitis milk, Chinese strain Hubei-1 from lesioned lung tissue, and 16 other Mycoplasmas species. Compared to PG45, the genome size of HB0801 was reduced by 11.7 kb. Furthermore, a large chromosome inversion (580 kb) was confirmed in all Chinese isolates including HB0801, HB1007, a strain from cow mastitis milk, and Hubei-1. In addition, the variable surface lipoproteins (vsp) gene cluster existed in HB0801, but contained less than half of the genes, and had poor identity to that in PG45, but they had conserved structures. Further inter-strain comparisons revealed other mechanisms of gene acquisition and loss in HB0801 that primarily involved insertion sequence (IS) elements, integrative conjugative element, restriction and modification systems, and some lipoproteins and transmembrane proteins. Subsequently, PG45 and HB0801 virulence in cattle was compared. Results indicated that both strains were pathogenic to cattle. The scores of gross pathological assessment for the control group, and the PG45- and HB0801-infected groups were 3, 13 and 9, respectively. Meanwhile the scores of lung lesion for these three groups were 36, 70, and 69, respectively. In addition, immunohistochemistry detection demonstrated that both strains were similarly distributed in lungs and lymph nodes. Although PG45 showed slightly higher virulence in calves than HB0801, there was no statistical difference between the strains (P>0.05). Compared to Hubei-1, a total of 122 SNP loci were disclosed in HB0801. In conclusion, although genomic plasticity was thought to be an evolutionary advantage, it did not apparently affect virulence of M. bovis strains in cattle
    corecore