1,357 research outputs found

    Printing Microbial Dark Matter: Using Single Cell Dispensing and Genomics to Investigate the Patescibacteria/Candidate Phyla Radiation

    Get PDF
    As of today, the majority of environmental microorganisms remain uncultured. They are therefore referred to as “microbial dark matter.” In the recent past, cultivation-independent methods like single-cell genomics (SCG) enabled the discovery of many previously unknown microorganisms, among them the Patescibacteria/Candidate Phyla Radiation (CPR). This approach was shown to be complementary to metagenomics, however, the development of additional and refined sorting techniques beyond the most commonly used fluorescence-activated cell sorting (FACS) is still desirable to enable additional downstream applications. Adding image information on the number and morphology of sorted cells would be beneficial, as would be minimizing cell stress caused by sorting conditions such as staining or pressure. Recently, a novel cell sorting technique has been developed, a microfluidic single-cell dispenser, which assesses the number and morphology of the cell in each droplet by automated light microscopic processing. Here, we report for the first time the successful application of the newly developed single-cell dispensing system for label-free isolation of individual bacteria from a complex sample retrieved from a wastewater treatment plant, demonstrating the potential of this technique for single cell genomics and other alternative downstream applications. Genome recovery success rated above 80% with this technique—out of 880 sorted cells 717 were successfully amplified. For 50.1% of these, analysis of the 16S rRNA gene was feasible and led to the sequencing of 50 sorted cells identified as Patescibacteria/CPR members. Subsequentially, 27 single amplified genomes (SAGs) of 15 novel and distinct Patescibacteria/CPR members, representing yet unseen species, genera and families could be captured and reconstructed. This phylogenetic distinctness of the recovered SAGs from available metagenome-assembled genomes (MAGs) is accompanied by the finding that these lineages—in whole or in part—have not been accessed by genome-resolved metagenomics of the same sample, thereby emphasizing the importance and opportunities of SCGs

    DNA Sequencing

    Get PDF
    This book illustrates methods of DNA sequencing and its application in plant, animal and medical sciences. It has two distinct sections. The one includes 2 chapters devoted to the DNA sequencing methods and the second includes 6 chapters focusing on various applications of this technology. The content of the articles presented in the book is guided by the knowledge and experience of the contributing authors. This book is intended to serve as an important resource and review to the researchers in the field of DNA sequencing

    A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation.</p> <p>Results</p> <p>We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes <it>in planta</it>. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of <it>CitrSEP</it>, a citrus homolog to the Arabidopsis <it>SEP3 </it>gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis.</p> <p>Conclusion</p> <p>The new EST collection denotes an important step towards the identification of all genes in the citrus genome. Furthermore, public availability of the cDNA clones generated in this study, and not only their sequence, enables testing of the biological function of the genes represented in the collection. Expression of the citrus <it>SEP3 </it>homologue, <it>CitrSEP</it>, in Arabidopsis results in early flowering, along with other phenotypes resembling the over-expression of the Arabidopsis <it>SEPALLATA </it>genes. Our findings suggest that the members of the <it>SEP </it>gene family play similar roles in these quite distant plant species.</p

    The MGX framework for microbial community analysis

    Get PDF
    Jaenicke S. The MGX framework for microbial community analysis. Bielefeld: Universität Bielefeld; 2020

    Development of a read mapping analysis software and computational pan genome analysis of 20 Pseudomonas aeruginosa strains

    Get PDF
    Hilker R. Development of a read mapping analysis software and computational pan genome analysis of 20 Pseudomonas aeruginosa strains. Bielefeld: Bielefeld University; 2015.In times of multi-resistant pathogenic bacteria, their detailed study is of utmost importance. Their comparative analysis can even aid the emerging field of personalized medicine by enabling optimized treatment depending on the presence of virulence factors and antibiotic resistances in the infection concerned. The weaknesses and functionality of these pathogenic bacteria can be investigated using modern computer science and novel sequencing technologies. One of these methods is the bioinformatics evaluation of high-throughput sequencing data. A pathogenic bacterium posing severe health care issues is the ubiquitous Pseudomonas aeruginosa. It is involved in a wide range of infections mainly affecting the pulmonary or urinary tract, open wounds and burns. The prevalence of chronic obstructive pulmonary disease cases with P. aeruginosa in Germany alone is ~600,000 per year. Within the framework of this dissertation, computational comparative genomics experiments were conducted with a panel of 20 of the most abundant Pseudomonas aeruginosa strains. 15 of these strains were isolated from clinical cases, while the remaining 5 were strains without a known infection history isolated from the environment. This division was chosen to enable direct comparison of the pathogenic potential of clinical and environmental strains and identification of their possible characteristic differences. When designing the bioinformatics experiments and searching for an efficient visualization and automatic analysis platform for read alignment (mapping) data, it became evident that no adequate solution was available that included all required functionalities. On these grounds, the decision was made to define two main subjects for this dissertation. Besides the P. aeruginosa pan genome analysis, a novel read mapping visualization and analysis software was developed and published in the journal Bioinformatics. This software - ReadXplorer - is partly based upon a prototype, which was developed during a preceding master's thesis at the Center for Biotechnology of the Bielefeld University under the name VAMP. The software was developed into a comprehensive user-friendly platform augmented with several newly developed and implemented automatic bioinformatics read mapping analyses. Two examples of these are the transcription start site detection and the single nucleotide polymorphism detection. Moreover, new intuitive visualizations were added to the existent ones and existing visualizations were greatly enhanced. ReadXplorer is designed to support not only DNA-seq data as accrued in the P. aeruginosa experiments, but also any kind of standard read mapping data as obtained from RNA-seq or ChIP-seq experiments. The data management was designed to comply with the latest performance and efficiency needs emerging from the large next generation sequencing data sets. Finally, ReadXplorer was empowered to deal with eukaryotic read mapping data as well. Amongst other software, ReadXplorer was then used to analyze different comparative genomics aspects of P. aeruginosa and to draw conclusions regarding the development of their pathogenicity. The list of conducted experiments includes phylogeny and gene set determination, analysis of regions of genomic plasticity and identification of single nucleotide polymorphisms. The achieved results were published in the journal Environmental Biology

    Focus: A Graph Approach for Data-Mining and Domain-Specific Assembly of Next Generation Sequencing Data

    Get PDF
    Next Generation Sequencing (NGS) has emerged as a key technology leading to revolutionary breakthroughs in numerous biomedical research areas. These technologies produce millions to billions of short DNA reads that represent a small fraction of the original target DNA sequence. These short reads contain little information individually but are produced at a high coverage of the original sequence such that many reads overlap. Overlap relationships allow for the reads to be linearly ordered and merged by computational programs called assemblers into long stretches of contiguous sequence called contigs that can be used for research applications. Although the assembly of the reads produced by NGS remains a difficult task, it is the process of extracting useful knowledge from these relatively short sequences that has become one of the most exciting and challenging problems in Bioinformatics. The assembly of short reads is an aggregative process where critical information is lost as reads are merged into contigs. In addition, the assembly process is treated as a black box, with generic assembler tools that do not adapt to input data set characteristics. Finally, as NGS data throughput continues to increase, there is an increasing need for smart parallel assembler implementations. In this dissertation, a new assembly approach called Focus is proposed. Unlike previous assemblers, Focus relies on a novel hybrid graph constructed from multiple graphs at different levels of granularity to represent the assembly problem, facilitating information capture and dynamic adjustment to input data set characteristics. This work is composed of four specific aims: 1) The implementation of a robust assembly and analysis tool built on the hybrid graph platform 2) The development and application of graph mining to extract biologically relevant features in NGS data sets 3) The integration of domain specific knowledge to improve the assembly and analysis process. 4) The construction of smart parallel computing approaches, including the application of energy-aware computing for NGS assembly and knowledge integration to improve algorithm performance. In conclusion, this dissertation presents a complete parallel assembler called Focus that is capable of extracting biologically relevant features directly from its hybrid assembly graph

    Optimization of Recombination Methods and Expanding the Utility of Penicillin G Acylase

    Get PDF
    Protein engineering can be performed by combinatorial techniques (directed evolution) and data-driven methods using machine-learning algorithms. The main characteristic of directed evolution (DE) is the application of an effective and efficient screen or selection on a diverse mutant library. As it is important to have a diverse mutant library for the success of DE, we compared the performance of DNA-shuffling and recombination PCR on fluorescent proteins using sequence information as well as statistical methods. We found that the diversity of the libraries DNA-shuffling and recombination PCR generates were dependent on type of skew primers used and sensitive to nucleotide identity levels between genes. DNA-shuffling and recombination PCR produced libraries with different crossover tendencies, suggesting that the two protocols could be used in combination to produce better libraries. Data-driven protein engineering uses sequence, structure and function data along with analyzed empirical activity information to guide library design. Boolean Learning Support Vector Machines (BLSVM) to identify interacting residues in fluorescent proteins and the gene templates were modified to preserve interactions post recombination. By site-directed mutagenesis, recombination and expression experiments, we validated that BLSVM can be used to identify interacting residues and increase the fraction of active proteins in the library. As an extension to the above experiments, DE was applied on monomeric Red Fluorescent Proteins to improve its spectral characteristics and structure-guided protein engineering was performed on penicillin G acylase (PGA), an industrially relevant catalyst, to change its substrate specificity.Ph.D.Committee Chair: Bommarius, Andreas; Committee Member: Hu, Wei-Shou; Committee Member: Lee, Jay; Committee Member: Lutz, Stefan; Committee Member: Prausnitz, Mar

    Development of fluorogenic RNA aptamers for cellular imaging of RNA and genomic loci

    Get PDF
    In recent years, there has been an explosion of SELEX-evolved fluorescent RNA aptamers, such as Spinach, Broccoli, Corn and Mango. Fluorogenic RNA aptamers have sparked a lot of interest and hold great potential to enable background-free visualisation of RNA molecules in cellular environments. However, their application has been limited by relatively inefficient folding in vivo and fluorescent stability. Therefore, evolving new RNA aptamers with improved brightness and stability should better their use in cellular imaging. Three new Mango-based aptamers have recently been selected from the original Mango RNA SELEX pool using microfluidic- assisted in vitro compartmentalization and fluorescence-activated sorting. This thesis demonstrates the use of these new aptamer variants to image small non-coding RNAs (such as 5S rRNA, U6 snRNA and mgU2-47 scaRNA) in both fixed and live human cells with improved sensitivity and resolution. Upon expression the modified RNAs subcellular localisation pattern is conserved, as validated using immunofluoresence. Recent work with tandem Mango arrays shows increased sensitivity, which enables the visualization of single mRNA molecules in live and fixed cells. Furthermore, it is shown that the tandem Mango arrays don’t affect the expected localization of a cytoplasmic mRNA (β-actin) and the nuclear long non- coding RNA (NEAT-1). Furthermore, these RNA aptamers can also be used to label genomic loci via CRISPR/Cas9 mediated genome targeting with improved contrast. This allows for the targeting of short genomic repeats in a less invasive manner with regards to current methodologies. Taken together this data shows that new Mango aptamers are vastly improved for cellular imaging over previous RNA aptamers, and can in principle be incorporated into a wide range of coding and non-coding RNAs.Open Acces

    Mechanisms and design of Tc1/mariner transposons for genome engineering

    Get PDF
    Transposons are DNA segments that autonomously move within and between genomes across the tree of life. Tc1/mariners in particular have frequently crossed species boundaries in nature and provide powerful broad-host-range genetic vectors. Among them, the Sleeping Beauty (SB) transposon inserts DNA in vertebrate genomes with extraordinarily high efficiency, making it a prime genetic tool with applications expanding to gene therapy clinical trials. Nevertheless, the molecular principles of SB’s distinctive activity remain elusive, greatly hampering its further development. In the first part of this thesis, I investigated the molecular mechanisms of the SB transposon in comparison to Human mariner 1 (Hsmar1), a representative transposon of the same superfamily. Using biochemical and biophysical techniques together with fluorescence-based assays, I have characterized the initial steps of SB and Hsmar1 transposition and shown that the two transposons assemble their molecular machineries (or transpososomes) differently. By combining crystallographic data and SAXS-based modelling, I visualized the structural basis of these differences and explained how transpososome assembly is coupled to catalysis in the Hsmar1 transposon. Moreover, the data demonstrated that the unique assembly pathway of SB largely contributes to its exceptional efficiency and that it can be chemically modulated to control insertion rates in living cells. I have further reconstituted in vitro the ordered series of events comprising SB transposition, including transposon end binding, cleavage, and integration, and dissected previously unrevealed molecular features of the process. In the second part of my work, building on these mechanistic insights, I developed a novel SB transposase variant (hsSB) by employing a structure-based protein design approach. Using hsSB allowed for establishing a new genome engineering method based on the direct delivery of recombinant SB protein to cells. We showed that this new method, named SBprotAct, provides safer and more controlled genome modification of several cell types (including stem cells and human T cells), as compared to the state-of-art technology. This work sheds first light on the molecular determinants of SB transposition and its hyper-activity, providing a unique resource for the rational design of improved genome engineering platforms for research and medicine

    Spatiotemporal analysis of transcription dynamics

    Get PDF
    • …
    corecore