25,747 research outputs found

    BATCH-GE : batch analysis of next-generation sequencing data for genome editing assessment

    Get PDF
    Targeted mutagenesis by the CRISPR/Cas9 system is currently revolutionizing genetics. The ease of this technique has enabled genome engineering in-vitro and in a range of model organisms and has pushed experimental dimensions to unprecedented proportions. Due to its tremendous progress in terms of speed, read length, throughput and cost, Next-Generation Sequencing (NGS) has been increasingly used for the analysis of CRISPR/Cas9 genome editing experiments. However, the current tools for genome editing assessment lack flexibility and fall short in the analysis of large amounts of NGS data. Therefore, we designed BATCH-GE, an easy-to-use bioinformatics tool for batch analysis of NGS-generated genome editing data, available from https://github.com/WouterSteyaert/BATCH-GE.git. BATCH-GE detects and reports indel mutations and other precise genome editing events and calculates the corresponding mutagenesis efficiencies for a large number of samples in parallel. Furthermore, this new tool provides flexibility by allowing the user to adapt a number of input variables. The performance of BATCH-GE was evaluated in two genome editing experiments, aiming to generate knock-out and knock-in zebrafish mutants. This tool will not only contribute to the evaluation of CRISPR/Cas9-based experiments, but will be of use in any genome editing experiment and has the ability to analyze data from every organism with a sequenced genome

    Analysis of nucleosome positioning landscapes enables gene discovery in the human malaria parasite Plasmodium falciparum.

    Get PDF
    BackgroundPlasmodium falciparum, the deadliest malaria-causing parasite, has an extremely AT-rich (80.7 %) genome. Because of high AT-content, sequence-based annotation of genes and functional elements remains challenging. In order to better understand the regulatory network controlling gene expression in the parasite, a more complete genome annotation as well as analysis tools adapted for AT-rich genomes are needed. Recent studies on genome-wide nucleosome positioning in eukaryotes have shown that nucleosome landscapes exhibit regular characteristic patterns at the 5'- and 3'-end of protein and non-protein coding genes. In addition, nucleosome depleted regions can be found near transcription start sites. These unique nucleosome landscape patterns may be exploited for the identification of novel genes. In this paper, we propose a computational approach to discover novel putative genes based exclusively on nucleosome positioning data in the AT-rich genome of P. falciparum.ResultsUsing binary classifiers trained on nucleosome landscapes at the gene boundaries from two independent nucleosome positioning data sets, we were able to detect a total of 231 regions containing putative genes in the genome of Plasmodium falciparum, of which 67 highly confident genes were found in both data sets. Eighty-eight of these 231 newly predicted genes exhibited transcription signal in RNA-Seq data, indicative of active transcription. In addition, 20 out of 21 selected gene candidates were further validated by RT-PCR, and 28 out of the 231 genes showed significant matches using BLASTN against an expressed sequence tag (EST) database. Furthermore, 108 (47%) out of the 231 putative novel genes overlapped with previously identified but unannotated long non-coding RNAs. Collectively, these results provide experimental validation for 163 predicted genes (70.6%). Finally, 73 out of 231 genes were found to be potentially translated based on their signal in polysome-associated RNA-Seq representing transcripts that are actively being translated.ConclusionOur results clearly indicate that nucleosome positioning data contains sufficient information for novel gene discovery. As distinct nucleosome landscapes around genes are found in many other eukaryotic organisms, this methodology could be used to characterize the transcriptome of any organism, especially when coupled with other DNA-based gene finding and experimental methods (e.g., RNA-Seq)

    Understanding diversity of human innate immunity receptors: analysis of surface features of leucine-rich repeat domains in NLRs and TLRs.

    Get PDF
    BackgroundThe human innate immune system uses a system of extracellular Toll-like receptors (TLRs) and intracellular Nod-like receptors (NLRs) to match the appropriate level of immune response to the level of threat from the current environment. Almost all NLRs and TLRs have a domain consisting of multiple leucine-rich repeats (LRRs), which is believed to be involved in ligand binding. LRRs, found also in thousands of other proteins, form a well-defined "horseshoe"-shaped structural scaffold that can be used for a variety of functions, from binding specific ligands to performing a general structural role. The specific functional roles of LRR domains in NLRs and TLRs are thus defined by their detailed surface features. While experimental crystal structures of four human TLRs have been solved, no structure data are available for NLRs.ResultsWe report a quantitative, comparative analysis of the surface features of LRR domains in human NLRs and TLRs, using predicted three-dimensional structures for NLRs. Specifically, we calculated amino acid hydrophobicity, charge, and glycosylation distributions within LRR domain surfaces and assessed their similarity by clustering. Despite differences in structural and genomic organization, comparison of LRR surface features in NLRs and TLRs allowed us to hypothesize about their possible functional similarities. We find agreement between predicted surface similarities and similar functional roles in NLRs and TLRs with known agonists, and suggest possible binding partners for uncharacterized NLRs.ConclusionDespite its low resolution, our approach permits comparison of molecular surface features in the absence of crystal structure data. Our results illustrate diversity of surface features of innate immunity receptors and provide hints for function of NLRs whose specific role in innate immunity is yet unknown

    AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system

    Get PDF
    We have implemented a genome annotation system for prokaryotes called AGMIAL. Our approach embodies a number of key principles. First, expert manual annotators are seen as a critical component of the overall system; user interfaces were cyclically refined to satisfy their needs. Second, the overall process should be orchestrated in terms of a global annotation strategy; this facilitates coordination between a team of annotators and automatic data analysis. Third, the annotation strategy should allow progressive and incremental annotation from a time when only a few draft contigs are available, to when a final finished assembly is produced. The overall architecture employed is modular and extensible, being based on the W3 standard Web services framework. Specialized modules interact with two independent core modules that are used to annotate, respectively, genomic and protein sequences. AGMIAL is currently being used by several INRA laboratories to analyze genomes of bacteria relevant to the food-processing industry, and is distributed under an open source license

    ΦCrAss001 represents the most abundant bacteriophage family in the human gut and infects Bacteroides intestinalis

    Get PDF
    peer-reviewedCrAssphages are an extensive and ubiquitous family of tailed bacteriophages, predicted to infect bacteria of the order Bacteroidales. Despite being found in ~50% of individuals and representing up to 90% of human gut viromes, members of this viral family have never been isolated in culture and remain understudied. Here, we report the isolation of a CrAssphage (ΦCrAss001) from human faecal material. This bacteriophage infects the human gut symbiont Bacteroides intestinalis, confirming previous in silico predictions of the likely host. DNA sequencing demonstrates that the bacteriophage genome is circular, 102 kb in size, and has unusual structural traits. In addition, electron microscopy confirms that ΦcrAss001 has a podovirus-like morphology. Despite the absence of obvious lysogeny genes, ΦcrAss001 replicates in a way that does not disrupt proliferation of the host bacterium, and is able to maintain itself in continuous host culture during several weeks

    Replication enhancer elements within the open reading frame of tick-borne encephalitis virus and their evolution within the Flavivirus genus

    Get PDF
    We provide experimental evidence of a replication enhancer element (REE) within the capsid gene of tick-borne encephalitis virus (TBEV, genus Flavivirus). Thermodynamic and phylogenetic analyses predicted that the REE folds as a long stable stem–loop (designated SL6), conserved among all tick-borne flaviviruses (TBFV). Homologous sequences and potential base pairing were found in the corresponding regions of mosquito-borne flaviviruses, but not in more genetically distant flaviviruses. To investigate the role of SL6, nucleotide substitutions were introduced which changed a conserved hexanucleotide motif, the conformation of the terminal loop and the base-paired dsRNA stacking. Substitutions were made within a TBEV reverse genetic system and recovered mutants were compared for plaque morphology, single-step replication kinetics and cytopathic effect. The greatest phenotypic changes were observed in mutants with a destabilized stem. Point mutations in the conserved hexanucleotide motif of the terminal loop caused moderate virus attenuation. However, all mutants eventually reached the titre of wild-type virus late post-infection. Thus, although not essential for growth in tissue culture, the SL6 REE acts to up-regulate virus replication. We hypothesize that this modulatory role may be important for TBEV survival in nature, where the virus circulates by non-viraemic transmission between infected and non-infected ticks, during co-feeding on local rodents

    Diverged composition and regulation of the Trypanosoma brucei origin recognition complex that mediates DNA replication initiation

    Get PDF
    Initiation of DNA replication depends upon recognition of genomic sites, termed origins, by AAA+ ATPases. In prokaryotes a single factor binds each origin, whereas in eukaryotes this role is played by a six-protein origin recognition complex (ORC). Why eukaryotes evolved a multisubunit initiator, and the roles of each component, remains unclear. In Trypanosoma brucei, an ancient unicellular eukaryote, only one ORC-related initiator, TbORC1/CDC6, has been identified by sequence homology. Here we show that three TbORC1/CDC6-interacting factors also act in T. brucei nuclear DNA replication and demonstrate that TbORC1/CDC6 interacts in a high molecular complex in which a diverged Orc4 homologue and one replicative helicase subunit can also be found. Analysing the subcellular localization of four TbORC1/CDC6-interacting factors during the cell cycle reveals that one factor, TbORC1B, is not a static constituent of ORC but displays S-phase restricted nuclear localization and expression, suggesting it positively regulates replication. This work shows that ORC architecture and regulation are diverged features of DNA replication initiation in T. brucei, providing new insight into this key stage of eukaryotic genome copying
    corecore