96 research outputs found

    NmeCas9 is an intrinsically high-fidelity genome-editing platform

    Get PDF
    BACKGROUND: The development of CRISPR genome editing has transformed biomedical research. Most applications reported thus far rely upon the Cas9 protein from Streptococcus pyogenes SF370 (SpyCas9). With many RNA guides, wildtype SpyCas9 can induce significant levels of unintended mutations at near-cognate sites, necessitating substantial efforts toward the development of strategies to minimize off-target activity. Although the genome-editing potential of thousands of other Cas9 orthologs remains largely untapped, it is not known how many will require similarly extensive engineering to achieve single-site accuracy within large genomes. In addition to its off-targeting propensity, SpyCas9 is encoded by a relatively large open reading frame, limiting its utility in applications that require size-restricted delivery strategies such as adeno-associated virus vectors. In contrast, some genome-editing-validated Cas9 orthologs are considerably smaller and therefore better suited for viral delivery. RESULTS: Here we show that wildtype NmeCas9, when programmed with guide sequences of the natural length of 24 nucleotides, exhibits a nearly complete absence of unintended editing in human cells, even when targeting sites that are prone to off-target activity with wildtype SpyCas9. We also validate at least six variant protospacer adjacent motifs (PAMs), in addition to the preferred consensus PAM (5\u27-N4GATT-3\u27), for NmeCas9 genome editing in human cells. CONCLUSIONS: Our results show that NmeCas9 is a naturally high-fidelity genome-editing enzyme and suggest that additional Cas9 orthologs may prove to exhibit similarly high accuracy, even without extensive engineering

    NmeCas9 is an intrinsically high-fidelity genome editing platform [preprint]

    Get PDF
    Background: The development of CRISPR genome editing has transformed biomedical research. Most applications reported thus far rely upon the Cas9 protein from Streptococcus pyogenes SF370 (SpyCas9). With many RNA guides, wild-type SpyCas9 can induce significant levels of unintended mutations at near-cognate sites, necessitating substantial efforts toward the development of strategies to minimize off-target activity. Although the genome-editing potential of thousands of other Cas9 orthologs remains largely untapped, it is not known how many will require similarly extensive engineering to achieve single-site accuracy within large (e.g. mammalian) genomes. In addition to its off-targeting propensity, SpyCas9 is encoded by a relatively large (~4.2 kb) open reading frame, limiting its utility in applications that require size-restricted delivery strategies such as adeno-associated virus vectors. In contrast, some genome-editing-validated Cas9 orthologs (e.g. from Staphylococcus aureus, Campylobacter jejuni, Geobacillus stearothermophilus and Neisseria meningitidis) are considerably smaller and therefore better suited for viral delivery. Results: Here we show that wild-type NmeCas9, when programmed with guide sequences of natural length (24 nucleotides), exhibits a nearly complete absence of unintended editing in human cells, even when targeting sites that are prone to off-target activity with wildtype SpyCas9. We also validate at least six variant protospacer adjacent motifs (PAMs), in addition to the preferred consensus PAM (5β€²-N4GATT-3β€²), for NmeCas9 genome editing in human cells. Conclusions: Our results show that NmeCas9 is a naturally high-fidelity genome editing enzyme and suggest that additional Cas9 orthologs may prove to exhibit similarly high accuracy, even without extensive engineering

    Enhanced Cas12a editing in mammalian cells and zebrafish

    Get PDF
    Type V CRISPR-Cas12a systems provide an alternate nuclease platform to Cas9, with potential advantages for specific genome editing applications. Here we describe improvements to the Cas12a system that facilitate efficient targeted mutagenesis in mammalian cells and zebrafish embryos. We show that engineered variants of Cas12a with two different nuclear localization sequences (NLS) on the C terminus provide increased editing efficiency in mammalian cells. Additionally, we find that pre-crRNAs comprising a full-length direct repeat (full-DR-crRNA) sequence with specific stem-loop G-C base substitutions exhibit increased editing efficiencies compared with the standard mature crRNA framework. Finally, we demonstrate in zebrafish embryos that the improved LbCas12a and FnoCas12a nucleases in combination with these modified crRNAs display high mutagenesis efficiencies and low toxicity when delivered as ribonucleoprotein complexes at high concentration. Together, these results define a set of enhanced Cas12a components with broad utility in vertebrate systems

    A modified bacterial one-hybrid system yields improved quantitative models of transcription factor specificity

    Get PDF
    We examine the use of high-throughput sequencing on binding sites recovered using a bacterial one-hybrid (B1H) system and find that improved models of transcription factor (TF) binding specificity can be obtained compared to standard methods of sequencing a small subset of the selected clones. We can obtain even more accurate binding models using a modified version of B1H selection method with constrained variation (CV-B1H). However, achieving these improved models using CV-B1H data required the development of a new method of analysisβ€”GRaMS (Growth Rate Modeling of Specificity)β€”that estimates bacterial growth rates as a function of the quality of the recognition sequence. We benchmark these different methods of motif discovery using Zif268, a well-characterized C2H2 zinc-finger TF on both a 28 bp randomized library for the standard B1H method and on 6 bp randomized library for the CV-B1H method for which 45 different experimental conditions were tested: five time points and three different IPTG and 3-AT concentrations. We find that GRaMS analysis is robust to the different experimental parameters whereas other analysis methods give widely varying results depending on the conditions of the experiment. Finally, we demonstrate that the CV-B1H assay can be performed in liquid media, which produces recognition models that are similar in quality to sequences recovered from selection on solid media

    Zinc finger protein-dependent and -independent contributions to the in vivo off-target activity of zinc finger nucleases

    Get PDF
    Zinc finger nucleases (ZFNs) facilitate tailor-made genomic modifications in vivo through the creation of targeted double-stranded breaks. They have been employed to modify the genomes of plants and animals, and cell-based therapies utilizing ZFNs are undergoing clinical trials. However, many ZFNs display dose-dependent toxicity presumably due to the generation of undesired double-stranded breaks at off-target sites. To evaluate the parameters influencing the functional specificity of ZFNs, we compared the in vivo activity of ZFN variants targeting the zebrafish kdrl locus, which display both high on-target activity and dose-dependent toxicity. We evaluated their functional specificity by assessing lesion frequency at 141 potential off-target sites using Illumina sequencing. Only a minority of these off-target sites accumulated lesions, where the thermodynamics of zinc finger–DNA recognition appear to be a defining feature of active sites. Surprisingly, we observed that both the specificity of the incorporated zinc fingers and the choice of the engineered nuclease domain could independently influence the fidelity of these ZFNs. The results of this study have implications for the assessment of likely off-target sites within a genome and point to both zinc finger-dependent and -independent characteristics that can be tailored to create ZFNs with greater precision

    An improved predictive recognition model for Cys2-His2 zinc finger proteins

    Get PDF
    Cys2-His2 zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities

    Genome-Wide Polymorphism and Comparative Analyses in the White-Tailed Deer (Odocoileus virginianus): A Model for Conservation Genomics

    Get PDF
    The white-tailed deer (Odocoileus virginianus) represents one of the most successful and widely distributed large mammal species within North America, yet very little nucleotide sequence information is available. We utilized massively parallel pyrosequencing of a reduced representation library (RRL) and a random shotgun library (RSL) to generate a complete mitochondrial genome sequence and identify a large number of putative single nucleotide polymorphisms (SNPs) distributed throughout the white-tailed deer nuclear and mitochondrial genomes. A SNP validation study designed to test specific classes of putative SNPs provides evidence for as many as 10,476 genome-wide SNPs in the current dataset. Based on cytogenetic evidence for homology between cow (Bos taurus) and white-tailed deer chromosomes, we demonstrate that a divergent genome may be used for estimating the relative distribution and density of de novo sequence contigs as well as putative SNPs for species without draft genome assemblies. Our approach demonstrates that bioinformatic tools developed for model or agriculturally important species may be leveraged to support next-generation research programs for species of biological, ecological and evolutionary importance. We also provide a functional annotation analysis for the de novo sequence contigs assembled from white-tailed deer pyrosequencing reads, a mitochondrial phylogeny involving 13,722 nucleotide positions for 10 unique species of Cervidae, and a median joining haplotype network as a putative representation of mitochondrial evolution in O. virginianus. The results of this study are expected to provide a detailed template enabling genome-wide sequence-based studies of threatened, endangered or conservationally important non-model organisms

    Culture Enriched Molecular Profiling of the Cystic Fibrosis Airway Microbiome

    Get PDF
    The microbiome of the respiratory tract, including the nasopharyngeal and oropharyngeal microbiota, is a dynamic community of microorganisms that is highly diverse. The cystic fibrosis (CF) airway microbiome refers to the polymicrobial communities present in the lower airways of CF patients. It is comprised of chronic opportunistic pathogens (such as Pseudomonas aeruginosa) and a variety of organisms derived mostly from the normal microbiota of the upper respiratory tract. The complexity of these communities has been inferred primarily from culture independent molecular profiling. As with most microbial communities it is generally assumed that most of the organisms present are not readily cultured. Our culture collection generated using more extensive cultivation approaches, reveals a more complex microbial community than that obtained by conventional CF culture methods. To directly evaluate the cultivability of the airway microbiome, we examined six samples in depth using culture-enriched molecular profiling which combines culture-based methods with the molecular profiling methods of terminal restriction fragment length polymorphisms and 16S rRNA gene sequencing. We demonstrate that combining culture-dependent and culture-independent approaches enhances the sensitivity of either approach alone. Our techniques were able to cultivate 43 of the 48 families detected by deep sequencing; the five families recovered solely by culture-independent approaches were all present at very low abundance (<0.002% total reads). 46% of the molecular signatures detected by culture from the six patients were only identified in an anaerobic environment, suggesting that a large proportion of the cultured airway community is composed of obligate anaerobes. Most significantly, using 20 growth conditions per specimen, half of which included anaerobic cultivation and extended incubation times we demonstrate that the majority of bacteria present can be cultured
    • …
    corecore