9 research outputs found

    An Introduction to the Analysis of Single-Cell RNA-Sequencing Data

    No full text
    The recent development of single-cell RNA sequencing has deepened our understanding of the cell as a functional unit, providing new insights based on gene expression profiles of hundreds to hundreds of thousands of individual cells, and revealing new populations of cells with distinct gene expression profiles previously hidden within analyses of gene expression performed on bulk cell populations. However, appropriate analysis and utilization of the massive amounts of data generated from single-cell RNA sequencing experiments are challenging and require an understanding of the experimental and computational pathways taken between preparation of input cells and output of interpretable data. In this review, we will discuss the basic principles of these new technologies, focusing on concepts important in the analysis of single-cell RNA-sequencing data. Specifically, we summarize approaches to quality-control measures for determination of which single cells to include for further examination, methods of data normalization and scaling to overcome the relatively inefficient capture rate of mRNA from each cell, and clustering and visualization algorithms used for dimensional reduction of the data to a two-dimensional plot. Keywords: single-cell gene expression, RNA sequencing, computational pipeline, microfluidics, drop-seq, sci-seq, principle component analysis, t-distributed stochastic neighbor embeddin

    Prediction and validation of hematopoietic stem and progenitor cell off-target editing in transplanted rhesus macaques

    No full text
    The programmable nuclease technology CRISPR-Cas9 has revolutionized gene editing in the last decade. Due to the risk of off-target editing, accurate and sensitive methods for off-target characterization are crucial prior to applying CRISPR-Cas9 therapeutically. Here, we utilized a rhesus macaque model to compare the predictive values of CIRCLE-seq, an in vitro off-target prediction method, with in silico prediction (ISP) based solely on genomic sequence comparisons. We use AmpliSeq HD error-corrected sequencing to validate offtarget sites predicted by CIRCLE-seq and ISP for a CD33 guide RNA (gRNA) with thousands of off-target sites predicted by ISP and CIRCLE-seq. We found poor correlation between the sites predicted by the two methods. When almost 500 sites predicted by each method were analyzed by error-corrected sequencing of hematopoietic cells following transplantation, 19 off-target sites revealed insertion or deletion mutations. Of these sites, 8 were predicted by both methods, 8 by CIRCLE-seq only, and 3 by ISP only. The levels of cells with these off-target edits exhibited no expansion or abnormal behavior in vivo in animals followed for up to 2 years. In addition, we utilized an unbiased method termed CAST-seq to search for translocations between the on-target site and off-target sites present in animals following transplantation, detecting one specific translocation that persisted in blood cells for at least 1 year following transplantation. In conclusion, neither CIRCLE-seq or ISP predicted all sites, and a combination of careful gRNA design, followed by screening for predicted off-target sites in target cells by multiple methods, may be required for optimizing safety of clinical development.N

    Protein Ontology: a controlled structured network of protein entities.

    No full text
    The Protein Ontology (PRO; http://proconsortium.org) formally defines protein entities and explicitly represents their major forms and interrelations. Protein entities represented in PRO corresponding to single amino acid chains are categorized by level of specificity into family, gene, sequence and modification metaclasses, and there is a separate metaclass for protein complexes. All metaclasses also have organism-specific derivatives. PRO complements established sequence databases such as UniProtKB, and interoperates with other biomedical and biological ontologies such as the Gene Ontology (GO). PRO relates to UniProtKB in that PRO\u27s organism-specific classes of proteins encoded by a specific gene correspond to entities documented in UniProtKB entries. PRO relates to the GO in that PRO\u27s representations of organism-specific protein complexes are subclasses of the organism-agnostic protein complex terms in the GO Cellular Component Ontology. The past few years have seen growth and changes to the PRO, as well as new points of access to the data and new applications of PRO in immunology and proteomics. Here we describe some of these developments. Nucleic Acids Res 2014 Jan 1; 42(1):D415-21

    A macaque clonal hematopoiesis model demonstrates expansion of TET2-disrupted clones and utility for testing interventions

    No full text
    Individuals with age-related clonal hematopoiesis (CH) are at greater risk for hematologic malignancies and cardiovascular diseases. However, predictive preclinical animal models to recapitulate the spectrum of human CH are lacking. Through error-corrected sequencing of 56 human CH/myeloid malignancy genes, we identified natural CH driver mutations in aged rhesus macaques matching genes somatically mutated in human CH, with DNMT3A mutations being the most frequent. A CH model in young adult macaques was generated via autologous transplantation of clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9-mediated gene-edited hematopoietic stem and progenitor cells (HSPCs), targeting the top human CH genes with loss-of-function (LOF) mutations. Long-term follow-up revealed reproducible and significant expansion of multiple HSPC clones with heterozygous TET2 LOF mutations, compared with minimal expansion of clones bearing other mutations. Although the blood counts of these CH macaques were normal, their bone marrows were hypercellular and myeloid-predominant. TET2-disrupted myeloid colony-forming units isolated from these animals showed a distinct hyperinflammatory gene expression profile compared with wild type. In addition, mature macrophages purified from the CH macaques showed elevated NLRP3 inflammasome activity and increased interleukin-1 beta (IL-1 beta) and IL-6 production. The model was used to test the impact of IL-6 blockage by tocilizumab, documenting a slowing of TET2-mutated expansion, suggesting that interruption of the IL-6 axis may remove the selective advantage of mutant HSPCs. These findings provide a model for examining the pathophysiology of CH and give insights into potential therapeutic interventions.N
    corecore