11 research outputs found

    Thousands of small, novel genes predicted in global phage genomes

    Get PDF
    Small genes (40,000 small-gene families in ∼2.3 million phage genome contigs. We find that small genes in phage genomes are approximately 3-fold more prevalent than in host prokaryotic genomes. Our approach enriches for small genes that are translated in microbiomes, suggesting the small genes identified are coding. More than 9,000 families encode potentially secreted or transmembrane proteins, more than 5,000 families encode predicted anti-CRISPR proteins, and more than 500 families encode predicted antimicrobial proteins. By combining homology and genomic-neighborhood analyses, we reveal substantial novelty and diversity within phage biology, including small phage genes found in multiple host phyla, small genes encoding proteins that play essential roles in host infection, and small genes that share genomic neighborhoods and whose encoded proteins may share related functions

    Identifying candidate structured RNAs in CRISPR operons.

    No full text
    Noncoding RNAs with secondary structures play important roles in CRISPR-Cas systems. Many of these structures likely remain undiscovered. We used a large-scale comparative genomics approach to predict 156 novel candidate structured RNAs from 36,111 CRISPR-Cas systems. A number of these were found to overlap with coding genes, including palindromic candidates that overlapped with a variety of Cas genes in type I and III systems. Among these 156 candidates, we identified 46 new models of CRISPR direct repeats and 1 tracrRNA. This tracrRNA model occasionally overlapped with predicted cas9 coding regions, emphasizing the importance of expanding our search windows for novel structure RNAs in coding regions. We also demonstrated that the antirepeat sequence in this tracrRNA model can be used to accurately assign thousands of predicted CRISPR arrays to type II-C systems. This study highlights the importance of unbiased identification of candidate structured RNAs across CRISPR-Cas systems

    Identification of over ten thousand candidate structured RNAs in viruses and phages

    No full text
    Structured RNAs play crucial roles in viruses, exerting influence over both viral and host gene expression. However, the extensive diversity of structured RNAs and their ability to act in cis or trans positions pose challenges for predicting and assigning their functions. While comparative genomics approaches have successfully predicted candidate structured RNAs in microbes on a large scale, similar efforts for viruses have been lacking. In this study, we screened over 5 million DNA and RNA viral sequences, resulting in the prediction of 10,006 novel candidate structured RNAs. These predictions are widely distributed across taxonomy and ecosystem. We found transcriptional evidence for 206 of these candidate structured RNAs in the human fecal microbiome. These candidate RNAs exhibited evidence of nucleotide covariation, indicative of selective pressure maintaining the predicted secondary structures. Our analysis revealed a diverse repertoire of candidate structured RNAs, encompassing a substantial number of putative tRNAs or tRNA-like structures, Rho-independent transcription terminators, and potentially cis-regulatory structures consistently positioned upstream of genes. In summary, our findings shed light on the extensive diversity of structured RNAs in viruses, offering a valuable resource for further investigations into their functional roles and implications in viral gene expression and pave the way for a deeper understanding of the intricate interplay between viruses and their hosts at the molecular level

    Thousands of small, novel genes predicted in global phage genomes

    No full text
    Small genes (\u3c150 nucleotides) have been systematically overlooked in phage genomes. We employ a large-scale comparative genomics approach to predict \u3e40,000 small-gene families in ∼2.3 million phage genome contigs. We find that small genes in phage genomes are approximately 3-fold more prevalent than in host prokaryotic genomes. Our approach enriches for small genes that are translated in microbiomes, suggesting the small genes identified are coding. More than 9,000 families encode potentially secreted or transmembrane proteins, more than 5,000 families encode predicted anti-CRISPR proteins, and more than 500 families encode predicted antimicrobial proteins. By combining homology and genomic-neighborhood analyses, we reveal substantial novelty and diversity within phage biology, including small phage genes found in multiple host phyla, small genes encoding proteins that play essential roles in host infection, and small genes that share genomic neighborhoods and whose encoded proteins may share related functions

    Epidemiologic Investigation of a Cluster of Neuroinvasive Bacillus cereus Infections in 5 Patients With Acute Myelogenous Leukemia

    No full text
    Background. Five neuroinvasive Bacillus cereus infections (4 fatal) occurred in hospitalized patients with acute myelogenous leukemia (AML) during a 9-month period, prompting an investigation by infection control and public health officials. Methods. Medical records of case-patients were reviewed and a matched case-control study was performed. Infection control practices were observed. Multiple environmental, food, and medication samples common to AML patients were cultured. Multilocus sequence typing was performed for case and environmental B cereus isolates. Results. All 5 case-patients received chemotherapy and had early-onset neutropenic fevers that resolved with empiric antibiotics. Fever recurred at a median of 17 days (range, 9–20) with headaches and abrupt neurological deterioration. Case-patients had B cereus identified in central nervous system (CNS) samples by (1) polymerase chain reaction or culture or (2) bacilli seen on CNS pathology stains with high-grade B cereus bacteremia. Two case-patients also had colonic ulcers with abundant bacilli on autopsy. No infection control breaches were observed. On case-control analysis, bananas were the only significant exposure shared by all 5 case-patients (odds ratio, 9.3; P = .04). Five environmental or food isolates tested positive for B cereus, including a homogenized banana peel isolate and the shelf of a kitchen cart where bananas were stored. Multilocus sequence typing confirmed that all case and environmental strains were genetically distinct. Multilocus sequence typing-based phylogenetic analysis revealed that the organisms clustered in 2 separate clades. Conclusions. The investigation of this neuroinvasive B cereus cluster did not identify a single point source but was suggestive of a possible dietary exposure. Our experience underscores the potential virulence of B cereus in immunocompromised hosts
    corecore