10 research outputs found

    Improving phage genome annotation to understand phage biology: the case of Pseudomonas aeruginosa LES prophages

    Get PDF
    Pseudomonas aeruginosais an important opportunistic pathogen, causing nosocomial infections. The Liverpool Epidemic Strain (LES), a major cause of mortality and morbidity in cystic fibrosis patients, harbours five prophages associated with increased fitness and survival in models of infection. However, ~76.5% of the LES prophage genes are hypothetical proteins. Also, little is known about the LES prophage interactions with the lysogen and other prophages. In this study, we used the VIral Genome Annotation (VIGA) pipeline to re-annotate the LES prophage genomes and improve the prediction of gene function. The RefSeq viral proteins database was used for homology-based gene prediction and the Prokaryotic Virus Orthologous Genes (PVOGs) and Reference Viral DataBase (RVDB) were used for HMM-based protein function prediction. InterProScan 5.47-5.82 and Infernal 1.1.3 (with Rfam 14.3) were applied to enrich the gene function prediction for all genes and ncRNA elements, respectively. The number of putative coding sequences had increased by 1.17-1.43 times. Multiple genes related to DNA recombination and host cell lysis were identified in this reannotation. Also, we have identified new ncRNA elements in these prophages, such astRNAs in prophages 2 and 5 and a putative regulatory ncRNA, a Hammerhead-II ribozyme, in prophage 4. All this new information will be combined with data from future RNAseq experiments to map the expression profiles of each LES prophage under inducing and non-inducing conditions to characterise interactions between the prophages and their lysogen host

    Comparative analysis of gene prediction tools for viral genome annotation

    Get PDF
    The number of newly available viral genomes and metagenomes has increased exponentially since the development of high throughput sequencing platforms and genome analysis tools. Bioinformatic annotation pipelines are largely based on open reading frame (ORF) calling software, which identifies genes independently of the sequence taxonomical background. Although ORF-calling programs provide a rapid genome annotation, they can misidentify ORFs and start codons; errors that might be perpetuated and propagated over time. This study evaluated the performance of multiple ORF-calling programs for viral genome annotation against the complete RefSeq viral database. Programs outputs varied when considering the viral nucleic acid type versus the viral host. According to the number of ORFs, Prodigal and Metaprodigal were the most accurate programs for DNA viruses, while FragGeneScan and Prodigal generated the most accurate outputs for RNA viruses. Similarly, Prodigal outperformed the benchmark for viruses infecting prokaryotes, and GLIMMER and GeneMarkS produced the most accurate annotations for viruses infecting eukaryotes. When the coordinates of the ORFs were considered, Prodigal scored high for all scenarios except for RNA viruses, where GeneMarkS generated the most reliable results. Overall, the quality of the coordinates predicted for RNA viruses was poorer than for DNA viruses, suggesting the need for improved ORF-calling programs to deal with RNA viruses. Moreover, none of the ORF-calling programs reached 90% accuracy for annotation of DNA viruses. Any automatic annotation can still be improved by manual curation, especially when the presence of ORFs is validated with wet-lab experiments. However, our evaluation of the current ORF-calling programs is expected to be useful for the improvement of viral genome annotation pipelines and highlights the need for more expression data to improve the rigor of reference genomes

    Clay-induced DNA breaks as a path for genetic diversity, antibiotic resistance, and asbestos carcinogenesis

    Get PDF
    Natural clays and synthetic nanofibres can have a severe impact on human health. After several decades of research, the molecular mechanism of how asbestos induces cancer is not well understood. Different fibres, including asbestos, can penetrate cell membranes and introduce foreign DNA in bacterial and eukaryotic cells. Incubating Escherichia coli under friction forces with sepiolite, a clayey material, or with asbestos, causes double-strand DNA breaks. Antibiotics and clays are used together in animal husbandry, the mutagenic effect of these fibres could be a pathway to antibiotic resistance due to the friction provided by peristalsis of the gut from farm animals in addition to horizontal gene transfer. Moreover, we raise the possibility that the same mechanism could generate bacteria diversity in natural scenarios, playing a role in the evolution of species. Finally, we provide a new model on how asbestos may promote mutagenesis and cancer based on the observed mechanical genotoxicity

    Heat-killed lactobacilli alter both microbiota composition and behaviour

    Get PDF
    Recently it has been proposed to expand the definition of psychobiotics (beneficial bacteria (probiotics) or support for such bacteria (prebiotics) that positively impact mental health) to any exogenous influence whose effect on the brain is bacterially-mediated. This definition would include inactivated microorganisms with anxiolytic and antidepressant effects. The use of inactivated microorganisms has several advantages over living organisms, including no risk of infection in vulnerable individuals and ease of use in terms of storage and delivery options. It has been reported that consumption of inactivated microorganisms can affect behaviour, particularly in chronic or prolonged stress situations, but effects on healthy populations have not been investigated to the same extent. Also, only limited data is available on the effects of inactivated microorganisms on the microbiota of healthy individuals (animal or human). Therefore, we investigated the effect of feeding a standard mouse chow which incorporates ADR-159, a heat-killed fermentate generated by two Lactobacillus strains, on the behaviour and microbiota of healthy mice. Prolonged consumption of ADR-159 diet had no adverse effect on anthropometrics or general health, but the ADR-159 fed animals demonstrated increased sociability and lower baseline corticosterone levels (stress hormone). The diet also led to subtle but significant changes in the microbiota, with less abundant taxa being most affected. The behavioural, biochemical and microbiological results provide a new light on the impact of inactivated microorganisms and their metabolites on the social behaviour and microbiota of healthy mice

    Seawater salt-trapped Pseudomonas aeruginosa survives for years and gets primed for salinity tolerance

    Get PDF
    Background In nature, microorganisms have to adapt to long-term stressful conditions often with growth limitations. However, little is known about the evolution of the adaptability of new bacteria to such environments. Pseudomonas aeruginosa, an opportunistic pathogen, after natural evaporation of seawater, was shown to be trapped in laboratory-grown halite crystals and to remain viable after entrapment for years. However, how this bacterium persists and survives in such hypersaline conditions is not understood. Results In this study, we aimed to understand the basis of survival, and to characterise the physiological changes required to develop salt tolerance using P. aeruginosa as a model. Several clones of P. aeruginosa were rescued after 14 years in naturally evaporated marine salt crystals. Incubation of samples in nutrient-rich broth allowed re-growth and subsequent plating yielded observable colonies. Whole genome sequencing of the P. aeruginosa isolates confirmed the recovery of the original strain. The re-grown strains, however, showed a new phenotype consisting of an enhanced growth in growing salt concentration compared to the ancestor strain. The intracellular accumulation of K+ was elicited by high concentration of Na+ in the external medium to maintain the homeostasis. Whole transcriptomic analysis by microarray indicated that 78 genes had differential expression between the parental strain and its derivative clones. Sixty-one transcripts were up-regulated, while 17 were down-regulated. Based on a collection of single-gene knockout mutants and gene ontology analysis, we suggest that the adaptive response in P. aeruginosa to hyper-salinity relies on multiple gene product interactions. Conclusions The individual gene contributions build up the observed phenotype, but do not ease the identification of salinity-related metabolic pathways. The long-term inclusion of P. aeruginosa in salt crystals primes the bacteria, mediating a readjustment of the bacterial physiology to growth in higher salt concentrations. Our findings provide a starting point to understand how P. aeruginosa, a relevant environmental and pathogenic bacterium, survives to long-term salt stress

    BSGatlas : a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access

    Get PDF
    A large part of our current understanding of gene regulation in Gram-positive bacteria is based on Bacillus subtilis , as it is one of the most well studied bacterial model systems. The rapid growth in data concerning its molecular and genomic biology is distributed across multiple annotation resources. Consequently, the interpretation of data from further B. subtilis experiments becomes increasingly challenging in both low- and large-scale analyses. Additionally, B. subtilis annotation of structured RNA and non-coding RNA (ncRNA), as well as the operon structure, is still lagging behind the annotation of the coding sequences. To address these challenges, we created the B. subtilis genome atlas, BSGatlas, which integrates and unifies multiple existing annotation resources. Compared to any of the individual resources, the BSGatlas contains twice as many ncRNAs, while improving the positional annotation for 70 % of the ncRNAs. Furthermore, we combined known transcription start and termination sites with lists of known co-transcribed gene sets to create a comprehensive transcript map. The combination with transcription start/termination site annotations resulted in 717 new sets of co-transcribed genes and 5335 untranslated regions (UTRs). In comparison to existing resources, the number of 5′ and 3′ UTRs increased nearly fivefold, and the number of internal UTRs doubled. The transcript map is organized in 2266 operons, which provides transcriptional annotation for 92 % of all genes in the genome compared to the at most 82 % by previous resources. We predicted an off-target-aware genome-wide library of CRISPR–Cas9 guide RNAs, which we also linked to polycistronic operons. We provide the BSGatlas in multiple forms: as a website (https://rth.dk/resources/bsgatlas/), an annotation hub for display in the UCSC genome browser, supplementary tables and standardized GFF3 format, which can be used in large scale -omics studies. By complementing existing resources, the BSGatlas supports analyses of the B. subtilis genome and its molecular biology with respect to not only non-coding genes but also genome-wide transcriptional relationships of all genes

    The quantification of representative sequences pipeline for amplicon sequencing : case study on within-population ITS1 sequence variation in a microparasite infecting Daphnia

    No full text
    Next generation sequencing (NGS) platforms are replacing traditional molecular biology protocols like cloning and Sanger sequencing. However, accuracy of NGS platforms has rarely been measured when quantifying relative frequencies of genotypes or taxa within populations. Here we developed a new bioinformatic pipeline (QRS) that pools similar sequence variants and estimates their frequencies in NGS data sets from populations or communities. We tested whether the estimated frequency of representative sequences, generated by 454 amplicon sequencing, differs significantly from that obtained by Sanger sequencing of cloned PCR products. This was performed by analysing sequence variation of the highly variable first internal transcribed spacer (ITS1) of the ichthyosporean Caullerya mesnili, a microparasite of cladocerans of the genus Daphnia. This analysis also serves as a case example of the usage of this pipeline to study within‐population variation. Additionally, a public Illumina data set was used to validate the pipeline on community‐level data. Overall, there was a good correspondence in absolute frequencies of C. mesnili ITS1 sequences obtained from Sanger and 454 platforms. Furthermore, analyses of molecular variance (amova) revealed that population structure of C. mesnili differs across lakes and years independently of the sequencing platform. Our results support not only the usefulness of amplicon sequencing data for studies of within‐population structure but also the successful application of the QRS pipeline on Illumina‐generated data. The QRS pipeline is freely available together with its documentation under GNU Public Licence version 3 at http://code.google.com/p/quantification-representative-sequences</a

    Biology and taxonomy of crAss-like Bacteriophages, the most abundant virus in the human gut

    Get PDF
    CrAssphages represent the most abundant virus in the human gut microbiota, but the lack of available genome sequences for comparison has kept them enigmatic. Recently, sequence-based classification of distantly related crAss-like phages from multiple environments was reported, leading to a proposed familial-level taxonomic group. Here, we assembled the metagenomic sequencing reads from 702 human fecal virome/phageome samples and analyzed 99 complete circular crAss-like phage genomes and 150 contigs ≥70 kb. In silico comparative genomics and taxonomic analysis enabled a classification scheme of crAss-like phages from human fecal microbiomes into four candidate subfamilies composed of ten candidate genera. Laboratory analysis was performed on fecal samples from an individual harboring seven distinct crAss-like phages. We achieved crAss-like phage propagation in ex vivo human fecal fermentations and visualized short-tailed podoviruses by electron microscopy. Mass spectrometry of a crAss-like phage capsid protein could be linked to metagenomic sequencing data, confirming crAss-like phage structural annotations

    The International Virus Bioinformatics Meeting 2022

    Get PDF
    The International Virus Bioinformatics Meeting 2022 took place online, on 23-25 March 2022, and has attracted about 380 participants from all over the world. The goal of the meeting was to provide a meaningful and interactive scientific environment to promote discussion and collaboration and to inspire and suggest new research directions and questions. The participants created a highly interactive scientific environment even without physical face-to-face interactions. This meeting is a focal point to gain an insight into the state-of-the-art of the virus bioinformatics research landscape and to interact with researchers in the forefront as well as aspiring young scientists. The meeting featured eight invited and 18 contributed talks in eight sessions on three days, as well as 52 posters, which were presented during three virtual poster sessions. The main topics were: SARS-CoV-2, viral emergence and surveillance, virus-host interactions, viral sequence analysis, virus identification and annotation, phages, and viral diversity. This report summarizes the main research findings and highlights presented at the meeting
    corecore