222 research outputs found

    Genome-wide transcription start site profiling in biofilm-grown Burkholderia cenocepacia J2315

    Get PDF
    Background: Burkholderia cenocepacia is a soil-dwelling Gram-negative Betaproteobacterium with an important role as opportunistic pathogen in humans. Infections with B. cenocepacia are very difficult to treat due to their high intrinsic resistance to most antibiotics. Biofilm formation further adds to their antibiotic resistance. B. cenocepacia harbours a large, multi-replicon genome with a high GC-content, the reference genome of strain J2315 includes 7374 annotated genes. This study aims to annotate transcription start sites and identify novel transcripts on a whole genome scale. Methods: RNA extracted from B. cenocepacia J2315 biofilms was analysed by differential RNA-sequencing and the resulting dataset compared to data derived from conventional, global RNA-sequencing. Transcription start sites were annotated and further analysed according to their position relative to annotated genes. Results: Four thousand ten transcription start sites were mapped over the whole B. cenocepacia genome and the primary transcription start site of 2089 genes expressed in B. cenocepacia biofilms were defined. For 64 genes a start codon alternative to the annotated one was proposed. Substantial antisense transcription for 105 genes and two novel protein coding sequences were identified. The distribution of internal transcription start sites can be used to identify genomic islands in B. cenocepacia. A potassium pump strongly induced only under biofilm conditions was found and 15 non-coding small RNAs highly expressed in biofilms were discovered. Conclusions: Mapping transcription start sites across the B. cenocepacia genome added relevant information to the J2315 annotation. Genes and novel regulatory RNAs putatively involved in B. cenocepacia biofilm formation were identified. These findings will help in understanding regulation of B. cenocepacia biofilm formation

    Deep sequencing approaches for the analysis of prokaryotic transcriptional boundaries and dynamics

    Get PDF
    The identification of the protein-coding regions of a genome is straightforward due to the universality of start and stop codons. However, the boundaries of the transcribed regions, conditional operon structures, non-coding RNAs and the dynamics of transcription, such as pausing of elongation, are non-trivial to identify, even in the comparatively simple genomes of prokaryotes. Traditional methods for the study of these areas, such as tiling arrays, are noisy, labour-intensive and lack the resolution required for densely-packed bacterial genomes. Recently, deep sequencing has become increasingly popular for the study of the transcriptome due to its lower costs, higher accuracy and single nucleotide resolution. These methods have revolutionised our understanding of prokaryotic transcriptional dynamics. Here, we review the deep sequencing and data analysis techniques that are available for the study of transcription in prokaryotes, and discuss the bioinformatic considerations of these analyses

    Identification of small RNAs abundant in Burkholderia cenocepacia biofilms reveal putative regulators with a potential role in carbon and iron metabolism

    Get PDF
    Small RNAs play a regulatory role in many central metabolic processes of bacteria, as well as in developmental processes such as biofilm formation. Small RNAs of Burkholderia cenocepacia, an opportunistic pathogenic beta-proteobacterium, are to date not well characterised. To address that, we performed genome-wide transcriptome structure analysis of biofilm grown B. cenocepacia J2315. 41 unannotated short transcripts were identified in intergenic regions of the B. cenocepacia genome. 15 of these short transcripts, highly abundant in biofilms, widely conserved in Burkholderia sp. and without known function, were selected for in-depth analysis. Expression profiling showed that most of these sRNAs are more abundant in biofilms than in planktonic cultures. Many are also highly abundant in cells grown in minimal media, suggesting they are involved in adaptation to nutrient limitation and growth arrest. Their computationally predicted targets include a high proportion of genes involved in carbon metabolism. Expression and target genes of one sRNA suggest a potential role in regulating iron homoeostasis. The strategy used for this study to detect sRNAs expressed in B. cenocepacia biofilms has successfully identified sRNAs with a regulatory function

    Dual RNA sequencing (dRNA-Seq) of bacteria and their host cells

    Full text link
    Bacterial pathogens subvert host cells by manipulating cellular pathways for survival and replication; in turn, host cells respond to the invading pathogen through cascading changes in gene expression. Deciphering these complex temporal and spatial dynamics to identify novel bacterial virulence factors or host response pathways is crucial for improved diagnostics and therapeutics. Dual RNA sequencing (dRNA-Seq) has recently been developed to simultaneously capture host and bacterial transcriptomes from an infected cell. This approach builds on the high sensitivity and resolution of RNA-Seq technology and is applicable to any bacteria that interact with eukaryotic cells, encompassing parasitic, commensal or mutualistic lifestyles. We pioneered dRNA-Seq to simultaneously capture prokaryotic and eukaryotic expression profiles of cells infected with bacteria, using in vitro Chlamydia-infected epithelial cells as proof of principle. Here we provide a detailed laboratory and bioinformatics protocol for dRNA-seq that is readily adaptable to any host-bacteria system of interest

    RNA-seq – Revealing Biological Insights in Bacteria

    Get PDF
    New technologies are constantly being released and the improvements therein bring advances not only to transcriptome, the focus of this chapter, but also to diverse areas of biological research. Since the announcement and application of the RNA-seq approach, discoveries are being made in this field, but when we consider bacterial species, this progress proceeded a few years behind. However, with the application of RNA-seq derivative approaches, we can gain biological insights into the bacterial world and aspire to uncover the mysteries involving gene expression, organization and other functional genomic features

    Deep sequencing uncovers numerous small RNAs on all four replicons of the plant pathogen Agrobacterium tumefaciens

    Get PDF
    Agrobacterium species are capable of interkingdom gene transfer between bacteria and plants. The genome of Agrobacterium tumefaciens consists of a circular and a linear chromosome, the At-plasmid and the Ti-plasmid, which harbors bacterial virulence genes required for tumor formation in plants. Little is known about promoter sequences and the small RNA (sRNA) repertoire of this and other α-proteobacteria. We used a differential RNA sequencing (dRNA-seq) approach to map transcriptional start sites of 388 annotated genes and operons. In addition, a total number of 228 sRNAs was revealed from all four Agrobacterium replicons. Twenty-two of these were confirmed by independent RNA gel blot analysis and several sRNAs were differentially expressed in response to growth media, growth phase, temperature or pH. One sRNA from the Ti-plasmid was massively induced under virulence conditions. The presence of 76 cis-antisense sRNAs, two of them on the reverse strand of virulence genes, suggests considerable antisense transcription in Agrobacterium. The information gained from this study provides a valuable reservoir for an in-depth understanding of sRNA-mediated regulation of the complex physiology and infection process of Agrobacterium

    The Conserved Dcw Gene Cluster of R. sphaeroides Is Preceded by an Uncommonly Extended 5’ Leader Featuring the sRNA UpsM

    Get PDF
    Cell division and cell wall synthesis mechanisms are similarly conserved among bacteria. Consequently some bacterial species have comparable sets of genes organized in the dcw (division and cell wall) gene cluster. Dcw genes, their regulation and their relative order within the cluster are outstandingly conserved among rod shaped and gram negative bacteria to ensure an efficient coordination of growth and division. A well studied representative is the dcw gene cluster of E. coli. The first promoter of the gene cluster (mraZ1p) gives rise to polycistronic transcripts containing a 38 nt long 5’ UTR followed by the first gene mraZ. Despite reported conservation we present evidence for a much longer 5’ UTR in the gram negative and rod shaped bacterium Rhodobacter sphaeroides and in the family of Rhodobacteraceae. This extended 268 nt long 5’ UTR comprises a Rho independent terminator, which in case of termination gives rise to a non-coding RNA (UpsM). This sRNA is conditionally cleaved by RNase E under stress conditions in an Hfq- and very likely target mRNA-dependent manner, implying its function in trans. These results raise the question for the regulatory function of this extended 5’ UTR. It might represent the rarely described case of a trans acting sRNA derived from a riboswitch with exclusive presence in the family of Rhodobacteraceae

    The architecture and ppGpp-dependent expression of the primary transcriptome of Salmonella Typhimurium during invasion gene expression

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Invasion of intestinal epithelial cells by <it>Salmonella enterica </it>serovar Typhimurium (<it>S</it>. Typhimurium) requires expression of the extracellular virulence gene expression programme (ST<sup>EX</sup>), activation of which is dependent on the signalling molecule guanosine tetraphosphate (ppGpp). Recently, next-generation transcriptomics (RNA-seq) has revealed the unexpected complexity of bacterial transcriptomes and in this report we use differential RNA sequencing (dRNA-seq) to define the high-resolution transcriptomic architecture of wild-type <it>S</it>. Typhimurium and a ppGpp null strain under growth conditions which model ST<sup>EX</sup>. In doing so we show that ppGpp plays a much wider role in regulating the <it>S</it>. Typhimurium ST<sup>EX </sup>primary transcriptome than previously recognised.</p> <p>Results</p> <p>Here we report the precise mapping of transcriptional start sites (TSSs) for 78% of the <it>S</it>. Typhimurium open reading frames (ORFs). The TSS mapping enabled a genome-wide promoter analysis resulting in the prediction of 169 alternative sigma factor binding sites, and the prediction of the structure of 625 operons. We also report the discovery of 55 new candidate small RNAs (sRNAs) and 302 candidate antisense RNAs (asRNAs). We discovered 32 ppGpp-dependent alternative TSSs and determined the extent and level of ppGpp-dependent coding and non-coding transcription. We found that 34% and 20% of coding and non-coding RNA transcription respectively was ppGpp-dependent under these growth conditions, adding a further dimension to the role of this remarkable small regulatory molecule in enabling rapid adaptation to the infective environment.</p> <p>Conclusions</p> <p>The transcriptional architecture of <it>S</it>. Typhimurium and finer definition of the key role ppGpp plays in regulating <it>Salmonella </it>coding and non-coding transcription should promote the understanding of gene regulation in this important food borne pathogen and act as a resource for future research.</p

    Deep splicing plasticity of the human adenovirus type 5 transcriptome drives virus evolution

    Get PDF
    Viral genomes have high gene densities and complex transcription strategies rendering transcriptome analysis through short-read RNA-seq approaches problematic. Adenovirus transcription and splicing is especially complex. We used long-read direct RNA sequencing to study adenovirus transcription and splicing during infection. This revealed a previously unappreciated complexity of alternative splicing and potential for secondary initiating codon usage. Moreover, we find that most viral transcripts tend to shorten polyadenylation lengths as infection progresses. Development of an open reading frame centric bioinformatics analysis pipeline provided a deeper quantitative and qualitative understanding of adenovirus’s genetic potential. Across the viral genome adenovirus makes multiple distinctly spliced transcripts that code for the same protein. Over 11,000 different splicing patterns were recorded across the viral genome, most occurring at low levels. This low-level use of alternative splicing patterns potentially enables the virus to maximise its coding potential over evolutionary timescales
    corecore