387 research outputs found

    PanCGHweb: a web tool for genotype calling in pangenome CGH data

    Get PDF
    Summary: A pangenome is the total of genes present in strains of the same species. Pangenome microarrays allow determining the genomic content of bacterial strains more accurately than conventional comparative genome hybridization microarrays. PanCGHweb is the first tool that effectively calls genotype based on pangenome microarray data

    The proteolytic system of lactic acid bacteria revisited: a genomic comparison

    Get PDF
    Contains fulltext : 87750.pdf (publisher's version ) (Open Access)BACKGROUND: Lactic acid bacteria (LAB) are a group of gram-positive, lactic acid producing Firmicutes. They have been extensively used in food fermentations, including the production of various dairy products. The proteolytic system of LAB converts proteins to peptides and then to amino acids, which is essential for bacterial growth and also contributes significantly to flavor compounds as end-products. Recent developments in high-throughput genome sequencing and comparative genomics hybridization arrays provide us with opportunities to explore the diversity of the proteolytic system in various LAB strains. RESULTS: We performed a genome-wide comparative genomics analysis of proteolytic system components, including cell-wall bound proteinase, peptide transporters and peptidases, in 22 sequenced LAB strains. The peptidase families PepP/PepQ/PepM, PepD and PepI/PepR/PepL are described as examples of our in silico approach to refine the distinction of subfamilies with different enzymatic activities. Comparison of protein 3D structures of proline peptidases PepI/PepR/PepL and esterase A allowed identification of a conserved core structure, which was then used to improve phylogenetic analysis and functional annotation within this protein superfamily.The diversity of proteolytic system components in 39 Lactococcus lactis strains was explored using pangenome comparative genome hybridization analysis. Variations were observed in the proteinase PrtP and its maturation protein PrtM, in one of the Opp transport systems and in several peptidases between strains from different Lactococcus subspecies or from different origin. CONCLUSIONS: The improved functional annotation of the proteolytic system components provides an excellent framework for future experimental validations of predicted enzymatic activities. The genome sequence data can be coupled to other "omics" data e.g. transcriptomics and metabolomics for prediction of proteolytic and flavor-forming potential of LAB strains. Such an integrated approach can be used to tune the strain selection process in food fermentations

    A predicted physicochemically distinct sub-proteome associated with the intracellular organelle of the anammox bacterium Kuenenia stuttgartiensis

    Get PDF
    Background: Anaerobic ammonium-oxidizing (anammox) bacteria perform a key step in global nitrogen cycling. These bacteria make use of an organelle to oxidize ammonia anaerobically to nitrogen (N2) and so contribute approximately 50% of the nitrogen in the atmosphere. It is currently unknown which proteins constitute the organellar proteome and how anammox bacteria are able to specifically target organellar and cell-envelope proteins to their correct final destinations. Experimental approaches are complicated by the absence of pure cultures and genetic accessibility. However, the genome of the anammox bacterium Candidatus "Kuenenia stuttgartiensis" has recently been sequenced. Here, we make use of these genome data to predict the organellar sub-proteome and address the molecular basis of protein sorting in anammox bacteria. Results: Two training sets representing organellar (30 proteins) and cell envelope (59 proteins) proteins were constructed based on previous experimental evidence and comparative genomics. Random forest (RF) classifiers trained on these two sets could differentiate between organellar and cell envelope proteins with ~89% accuracy using 400 features consisting of frequencies of two adjacent amino acid combinations. A physicochemically distinct organellar sub-proteome containing 562 proteins was predicted with the best RF classifier. This set included almost all catabolic and respiratory factors encoded in the genome. Apparently, the cytoplasmic membrane performs no catabolic functions. We predict that the Tat-translocation system is located exclusively in the organellar membrane, whereas the Sec-translocation system is located on both the organellar and cytoplasmic membranes. Canonical signal peptides were predicted and validated experimentally, but a specific (N- or C-terminal) signal that could be used for protein targeting to the organelle remained elusive. Conclusions: A physicochemically distinct organellar sub-proteome was predicted from the genome of the anammox bacterium K. stuttgartiensis. This result provides strong in silico support for the existing experimental evidence for the existence of an organelle in this bacterium, and is an important step forward in unravelling a geochemically relevant case of cytoplasmic differentiation in bacteria. The predicted dual location of the Sec-translocation system and the apparent absence of a specific N- or C-terminal signal in the organellar proteins suggests that additional chaperones may be necessary that act on an as-yet unknown property of the targeted proteins

    The cytoplasm of living cells: A functional mixture of thousands of components

    Full text link
    Inside every living cell is the cytoplasm: a fluid mixture of thousands of different macromolecules, predominantly proteins. This mixture is where most of the biochemistry occurs that enables living cells to function, and it is perhaps the most complex liquid on earth. Here we take an inventory of what is actually in this mixture. Recent genome-sequencing work has given us for the first time at least some information on all of these thousands of components. Having done so we consider two physical phenomena in the cytoplasm: diffusion and possible phase separation. Diffusion is slower in the highly crowded cytoplasm than in dilute solution. Reasonable estimates of this slowdown can be obtained and their consequences explored, for example, monomer-dimer equilibria are established approximately twenty times slower than in a dilute solution. Phase separation in all except exceptional cells appears not to be a problem, despite the high density and so strong protein-protein interactions present. We suggest that this may be partially a byproduct of the evolution of other properties, and partially a result of the huge number of components present.Comment: 11 pages, 1 figure, 1 tabl

    PhenoLink - a web-tool for linking phenotype to ~omics data for bacteria: application to gene-trait matching for Lactobacillus plantarum strains

    Get PDF
    Contains fulltext : 109042.pdf (publisher's version ) (Open Access)BACKGROUND: Linking phenotypes to high-throughput molecular biology information generated by ~omics technologies allows revealing cellular mechanisms underlying an organism's phenotype. ~Omics datasets are often very large and noisy with many features (e.g., genes, metabolite abundances). Thus, associating phenotypes to ~omics data requires an approach that is robust to noise and can handle large and diverse data sets. RESULTS: We developed a web-tool PhenoLink (http://bamics2.cmbi.ru.nl/websoftware/phenolink/) that links phenotype to ~omics data sets using well-established as well new techniques. PhenoLink imputes missing values and preprocesses input data (i) to decrease inherent noise in the data and (ii) to counterbalance pitfalls of the Random Forest algorithm, on which feature (e.g., gene) selection is based. Preprocessed data is used in feature (e.g., gene) selection to identify relations to phenotypes. We applied PhenoLink to identify gene-phenotype relations based on the presence/absence of 2847 genes in 42 Lactobacillus plantarum strains and phenotypic measurements of these strains in several experimental conditions, including growth on sugars and nitrogen-dioxide production. Genes were ranked based on their importance (predictive value) to correctly predict the phenotype of a given strain. In addition to known gene to phenotype relations we also found novel relations. CONCLUSIONS: PhenoLink is an easily accessible web-tool to facilitate identifying relations from large and often noisy phenotype and ~omics datasets. Visualization of links to phenotypes offered in PhenoLink allows prioritizing links, finding relations between features, finding relations between phenotypes, and identifying outliers in phenotype data. PhenoLink can be used to uncover phenotype links to a multitude of ~omics data, e.g., gene presence/absence (determined by e.g.: CGH or next-generation sequencing), gene expression (determined by e.g.: microarrays or RNA-seq), or metabolite abundance (determined by e.g.: GC-MS)

    Complete genome sequence of Geobacillus thermoglucosidans TNO-09.020, a thermophilic sporeformer associated with a dairy-processing environment

    Get PDF
    Item does not contain fulltextThermophilic spore-forming bacteria are a common cause of contamination in dairy products. We isolated the thermophilic strain Geobacillus thermoglucosidans TNO-09.020 from a milk processing plant and report the complete genome of a dairy plant isolate consisting of a single chromosome of 3.75 Mb.1 augustus 201

    Making sense of quorum sensing in lactobacilli: a special focus on Lactobacillus plantarum WCFS1

    Get PDF
    In silico identification criteria were defined to predict if genes encoding histidine protein kinases (HPKs) and response regulators (RRs) could be part of peptide-based quorum sensing (QS) two-component regulatory systems (QS-TCSs) in Firmicutes. These criteria were used to screen HPKs and RRs annotated on the completed genome sequences of Lactobacillus species, and several (putative) QS-TCSs were identified in this way. The five peptide-based QS-TCSs that were predicted on the Lactobacillus plantarum WCFS1 genome were further analysed to test their (QS) functionality. Four of these systems contained an upstream gene encoding a putative autoinducing peptide (AIP), of which two were preceded by a double-glycine-type leader peptide. One of these was identical to the plnABCD regulatory system of L. plantarum C11 and was shown to regulate plantaricin production in L. plantarum WCFS1. The third TCS was designated lamBDCA for Lactobacillus agr-like module, where the lamD gene was shown to encode a cyclic thiolactone peptide. The fourth TCS was paralogous to the lam system and contained a putative AIP-encoding gene but lacked the lamB gene. Finally, a genetically separated orphan HPK and RR that showed clear peptide-based QS characteristics could form a fifth peptide-based QS-TCS. The predicted presence of multiple (peptide-based) QS-TCSs in some lactobacilli and in particular in L. plantarum might be a reflection of the ability of these species to persist in a diverse range of ecological niches

    Complete Genome Sequence and Comparative Analysis of the Fish Pathogen Lactococcus garvieae

    Get PDF
    Lactococcus garvieae causes fatal haemorrhagic septicaemia in fish such as yellowtail. The comparative analysis of genomes of a virulent strain Lg2 and a non-virulent strain ATCC 49156 of L. garvieae revealed that the two strains shared a high degree of sequence identity, but Lg2 had a 16.5-kb capsule gene cluster that is absent in ATCC 49156. The capsule gene cluster was composed of 15 genes, of which eight genes are highly conserved with those in exopolysaccharide biosynthesis gene cluster often found in Lactococcus lactis strains. Sequence analysis of the capsule gene cluster in the less virulent strain L. garvieae Lg2-S, Lg2-derived strain, showed that two conserved genes were disrupted by a single base pair deletion, respectively. These results strongly suggest that the capsule is crucial for virulence of Lg2. The capsule gene cluster of Lg2 may be a genomic island from several features such as the presence of insertion sequences flanked on both ends, different GC content from the chromosomal average, integration into the locus syntenic to other lactococcal genome sequences, and distribution in human gut microbiomes. The analysis also predicted other potential virulence factors such as haemolysin. The present study provides new insights into understanding of the virulence mechanisms of L. garvieae in fish

    The pre- and post-authorisation data published by the European Medicines Agency on the use of Biologics during pregnancy and lactation

    Get PDF
    Aims: The effects of biologics on reproduction/lactation are mostly unknown although many patients that receive biologics are women of reproductive age. The first objective of this study was to investigate the publicly available data on pregnancy/lactation before and after marketing authorization in Europe of biologics for the indications of rheumatologic inflammatory autoimmune diseases and inflammatory bowel disease. Secondary objectives included the assessment of the clinical relevance of the provided data and comparison of initial and post-authorization data. Methods: Initial and post-authorization data were extracted from the European Public Assessment Reports and the latest versions of Summary of Product Characteristics using publicly available documents on the European Medicines Agency's website. Four sections were categorized regarding pregnancy outcomes: pre-clinical/animal studies, human female fertility, pregnancy-related outcomes and congenital malformations in the human fetus. Three sections were categorized regarding lactation outcomes: pre-clinical/animal studies, excretion in human breast milk and absorption in children through breastfeeding. The clinical applicability of each category was scored by specified criteria, based on scientific literature, and further as defined by the authors. Results: For the 16 included biologics, post-authorization data were delivered only for adalimumab, certolizumab pegol, etanercept and infliximab. For the 12 remaining biologics limited data on pregnancy and lactation during the post-marketing period of 2–21 years were available. Conclusions: In this article several suggestions are provided for improving a multidisciplinary approach to these issues. The initiation of suitable registries by marketing authorization holders and data transparency for clinicians and academics are highly endorsed
    corecore