12,927 research outputs found
Insights into bacterial genome composition through variable target GC content profiling
This study presents a new computational method for guanine (G) and cytosine (C), or GC, content profiling based on the idea of multiple resolution sampling (MRS). The benefit of our new approach over existing techniques follows from its ability to locate significant regions without prior knowledge of the sequence, nor the features being sought. The use of MRS has provided novel insights into bacterial genome composition. Key findings include those that are related to the core composition of bacterial genomes, to the identification of large genomic islands (in Enterobacterial genomes), and to the identification of surface protein determinants in human pathogenic organisms (e.g., Staphylococcus genomes). We observed that bacterial surface binding proteins maintain abnormal GC content, potentially pointing to a viral origin. This study has demonstrated that GC content holds a high informational worth and hints at many underlying evolutionary processes. For online Supplementary Material, see www.liebertonline.com
Recommended from our members
Shotgun metagenome data of a defined mock community using Oxford Nanopore, PacBio and Illumina technologies.
Metagenomic sequence data from defined mock communities is crucial for the assessment of sequencing platform performance and downstream analyses, including assembly, binning and taxonomic assignment. We report a comparison of shotgun metagenome sequencing and assembly metrics of a defined microbial mock community using the Oxford Nanopore Technologies (ONT) MinION, PacBio and Illumina sequencing platforms. Our synthetic microbial community BMock12 consists of 12 bacterial strains with genome sizes spanning 3.2-7.2 Mbp, 40-73% GC content, and 1.5-7.3% repeats. Size selection of both PacBio and ONT sequencing libraries prior to sequencing was essential to yield comparable relative abundances of organisms among all sequencing technologies. While the Illumina-based metagenome assembly yielded good coverage with few misassemblies, contiguity was greatly improved by both, Illumina + ONT and Illumina + PacBio hybrid assemblies but increased misassemblies, most notably in genomes with high sequence similarity to each other. Our resulting datasets allow evaluation and benchmarking of bioinformatics software on Illumina, PacBio and ONT platforms in parallel
Capsular profiling of the Cronobacter genus and the association of specific Cronobacter sakazakii and C. malonaticus capsule types with neonatal meningitis and necrotizing enterocolitis
Background: Cronobacter sakazakii and C. malonaticus can cause serious diseases especially in infants where they are associated with rare but fatal neonatal infections such as meningitis and necrotising enterocolitis.
Methods: This study used 104 whole genome sequenced strains, covering all seven species in the genus, to analyse capsule associated clusters of genes involved in the biosynthesis of the O-antigen, colanic acid, bacterial cellulose, enterobacterial common antigen (ECA), and a previously uncharacterised K-antigen.
Results: Phylogeny of the gnd and galF genes flanking the O-antigen region enabled the defining of 38 subgroups which are potential serotypes. Two variants of the colanic acid synthesis gene cluster (CA1 and CA2) were found which differed with the absence of galE in CA2. Cellulose (bcs genes) were present in all species, but were absent in C. sakazakii sequence type (ST) 13 and clonal complex (CC) 100 strains. The ECA locus was found in all strains. The K-antigen capsular polysaccharide Region 1 (kpsEDCS) and Region 3 (kpsMT) genes were found in all Cronobacter strains. The highly variable Region 2 genes were assigned to 2 homology groups (K1 and K2). C. sakazakii and C. malonaticus isolates with capsular type [K2:CA2:Cell+] were associated with neonatal meningitis and necrotizing enterocolitis. Other capsular types were less associated with clinical infections. Conclusion: This study proposes a new capsular typing scheme which identifies a possible important virulence trait associated with severe neonatal infections. The various capsular polysaccharide structures warrant further investigation as they could be relevant to macrophage survival, desiccation resistance, environmental survival, and biofilm formation in the hospital environment, including neonatal enteral feeding tubes
Nucleic acid-based approaches to investigate microbial-related cheese quality defects
peer-reviewedThe microbial profile of cheese is a primary determinant of cheese quality. Microorganisms can contribute to aroma and taste defects, form biogenic amines, cause gas and secondary fermentation defects, and can contribute to cheese pinking and mineral deposition issues. These defects may be as a result of seasonality and the variability in the composition of the milk supplied, variations in cheese processing parameters, as well as the nature and number of the non-starter microorganisms which come from the milk or other environmental sources. Such defects can be responsible for production and product recall costs and thus represent a significant economic burden for the dairy industry worldwide. Traditional non-molecular approaches are often considered biased and have inherently slow turnaround times. Molecular techniques can provide early and rapid detection of defects that result from the presence of specific spoilage microbes and, ultimately, assist in enhancing cheese quality and reducing costs. Here we review the DNA-based methods that are available to detect/quantify spoilage bacteria, and relevant metabolic pathways in cheeses and, in the process, highlight how these strategies can be employed to improve cheese quality and reduce the associated economic burden on cheese processors.This work was funded by the Department of Agriculture, Food and the Marine under the Food Institutional Research Measure. Daniel J. O’Sullivan is in receipt of a Teagasc Walsh Fellowship,
Grant Number:2012205
Capturing the ‘ome’ : the expanding molecular toolbox for RNA and DNA library construction
All sequencing experiments and most functional genomics screens rely on the generation of libraries to comprehensively capture pools of targeted sequences. In the past decade especially, driven by the progress in the field of massively parallel sequencing, numerous studies have comprehensively assessed the impact of particular manipulations on library complexity and quality, and characterized the activities and specificities of several key enzymes used in library construction. Fortunately, careful protocol design and reagent choice can substantially mitigate many of these biases, and enable reliable representation of sequences in libraries. This review aims to guide the reader through the vast expanse of literature on the subject to promote informed library generation, independent of the application
Recommended from our members
Gene Regulatory Compatibility in Bacteria: Consequences for Synthetic Biology and Evolution
Mechanistic understanding of gene regulation is crucial for rational engineering of new genetic systems through synthetic biology. Genetic engineering efforts in new organisms are often hampered by a lack of knowledge about how regulatory components function in new host contexts. This dissertation focuses on efforts to overcome these challenges through the development of generalizable experimental methods for studying the behavior of DNA regulatory sequences in diverse species at large-scale.
Chapter 2 describes experimental approaches for quantitatively assessing the functions of thousands of diverse natural regulatory sequences through a combination of metagenomic mining, high-throughput DNA synthesis and deep sequencing. By employing these methods in three distinct bacterial species, we revealed striking functional differences in gene regulatory capacity. We identified regulatory sequences with activity levels with activity levels spanning several orders of magnitude, which will aid in efforts to engineer diverse bacterial species. We also demonstrate functional species-selective gene circuits with programmable host behaviors that may be useful for microbial community engineering. In Chapter 3 we provide evidence for the evolution of altered stringency in σ70-mediated transcriptional activation based on patterns of initiation and activity from promoters of diverse compositions. We show that the contrast in GC content between a regulatory element and the host genome dictates both the likelihood and the magnitude of expression. We also discuss the potential implications of this proposed mechanism on horizontal gene transfer.
The next two chapters focus on efforts aimed at extending the high-throughput methods described in earlier chapters to new organisms. Chapter 4 presents an in vitro approach for multiplexed gene expression profiling. Through the development and use of cell-free expression systems made from diverse bacteria, it was possible to rapidly acquire thousands of transcriptional measurements in small volume reactions, enabling functional comparisons of regulatory sequence function across multiple species. In Chapter 5 we characterize the restriction-modification system repertoires of several commensal bacterial species. We also describe ongoing efforts to develop methods for bypassing these systems in order to increase transformation efficiencies in species that are difficult or impossible to transform using current approaches
Exploring the loblolly pine (Pinus taeda L.) genome by BAC sequencing and Cot analysis.
Loblolly pine (LP; Pinus taeda L.) is an economically and ecologically important tree in the southeastern U.S. To advance understanding of the loblolly pine (LP; Pinus taeda L.) genome, we sequenced and analyzed 100 BAC clones and performed a Cot analysis. The Cot analysis indicates that the genome is composed of 57, 24, and 10% highly-repetitive, moderately-repetitive, and single/low-copy sequences, respectively (the remaining 9% of the genome is a combination of fold back and damaged DNA). Although single/low-copy DNA only accounts for 10% of the LP genome, the amount of single/low-copy DNA in LP is still 14 times the size of the Arabidopsis genome. Since gene numbers in LP are similar to those in Arabidopsis, much of the single/low-copy DNA of LP would appear to be composed of DNA that is both gene- and repeat-poor. Macroarrays prepared from a LP bacterial artificial chromosome (BAC) library were hybridized with probes designed from cell wall synthesis/wood development cDNAs, and 50 of the "targeted" clones were selected for further analysis. An additional 25 clones were selected because they contained few repeats, while 25 more clones were selected at random. The 100 BAC clones were Sanger sequenced and assembled. Of the targeted BACs, 80% contained all or part of the cDNA used to target them. One targeted BAC was found to contain fungal DNA and was eliminated from further analysis. Combinations of similarity-based and ab initio gene prediction approaches were utilized to identify and characterize potential coding regions in the 99 BACs containing LP DNA. From this analysis, we identified 154 gene models (GMs) representing both putative protein-coding genes and likely pseudogenes. Ten of the GMs (all of which were specifically targeted) had enough support to be classified as intact genes. Interestingly, the 154 GMs had statistically indistinguishable (α = 0.05) distributions in the targeted and random BAC clones (15.18 and 12.61 GM/Mb, respectively), whereas the low-repeat BACs contained significantly fewer GMs (7.08 GM/Mb). However, when GM length was considered, the targeted BACs had a significantly greater percentage of their length in GMs (3.26%) when compared to random (1.63%) and low-repeat (0.62%) BACs. The results of our study provide insight into LP evolution and inform ongoing efforts to produce a reference genome sequence for LP, while characterization of genes involved in cell wall production highlights carbon metabolism pathways that can be leveraged for increasing wood production
- …