12 research outputs found

    PepMLM: Target Sequence-Conditioned Generation of Peptide Binders via Masked Language Modeling

    Full text link
    Target proteins that lack accessible binding pockets and conformational stability have posed increasing challenges for drug development. Induced proximity strategies, such as PROTACs and molecular glues, have thus gained attention as pharmacological alternatives, but still require small molecule docking at binding pockets for targeted protein degradation (TPD). The computational design of protein-based binders presents unique opportunities to access undruggable targets, but have often relied on stable 3D structures or predictions for effective binder generation. Recently, we have leveraged the expressive latent spaces of protein language models (pLMs) for the prioritization of peptide binders from sequence alone, which we have then fused to E3 ubiquitin ligase domains, creating a CRISPR-analogous TPD system for target proteins. However, our methods rely on training discriminator models for ranking heuristically or unconditionally-derived guide peptides for their target binding capability. In this work, we introduce PepMLM, a purely target sequence-conditioned de novo generator of linear peptide binders. By employing a novel masking strategy that uniquely positions cognate peptide sequences at the terminus of target protein sequences, PepMLM tasks the state-of-the-art ESM-2 pLM to fully reconstruct the binder region, achieving low perplexities matching or improving upon previously-validated peptide-protein sequence pairs. After successful in silico benchmarking with AlphaFold-Multimer, we experimentally verify PepMLM's efficacy via fusion of model-derived peptides to E3 ubiquitin ligase domains, demonstrating endogenous degradation of target substrates in cellular models. In total, PepMLM enables the generative design of candidate binders to any target protein, without the requirement of target structure, empowering downstream programmable proteome editing applications

    Expanding homogeneous culture of human primordial germ cell-like cells maintaining germline features without serum or feeder layers.

    Get PDF
    In vitro expansion of human primordial germ cell-like cells (hPGCLCs), a pluripotent stem cell-derived PGC model, has proved challenging due to rapid loss of primordial germ cell (PGC)-like identity and limited cell survival/proliferation. Here, we describe long-term culture hPGCLCs (LTC-hPGCLCs), which actively proliferate in a serum-free, feeder-free condition without apparent limit as highly homogeneous diploid cell populations maintaining transcriptomic and epigenomic characteristics of hPGCLCs. Histone proteomics confirmed reduced H3K9me2 and increased H3K27me3 marks in LTC-hPGCLCs compared with induced pluripotent stem cells (iPSCs). LTC-hPGCLCs established from multiple human iPSC clones of both sexes were telomerase positive, senescence-free cells readily passaged with minimal cell death or deviation from the PGC-like identity. LTC-hPGCLCs are capable of differentiating to DAZL-positive M-spermatogonia-like cells in the xenogeneic reconstituted testis (xrTestis) organ culture milieu as well as efficiently producing fully pluripotent embryonic germ cell-like cells in the presence of stem cell factor and fibroblast growth factor 2. Thus, LTC-hPGCLCs provide convenient access to unlimited amounts of high-quality and homogeneous hPGCLCs

    PAM-flexible genome editing with an engineered chimeric Cas9

    Get PDF
    CRISPR enzymes require a defined protospacer adjacent motif (PAM) flanking a guide RNA-programmed target site, limiting their sequence accessibility for robust genome editing applications. In this study, we recombine the PAM-interacting domain of SpRY, a broad-targeting Cas9 possessing an NRN > NYN (R = A or G, Y = C or T) PAM preference, with the N-terminus of Sc + +, a Cas9 with simultaneously broad, efficient, and accurate NNG editing capabilities, to generate a chimeric enzyme with highly flexible PAM preference: SpRYc. We demonstrate that SpRYc leverages properties of both enzymes to specifically edit diverse PAMs and disease-related loci for potential therapeutic applications. In total, the approaches to generate SpRYc, coupled with its robust flexibility, highlight the power of integrative protein design for Cas9 engineering and motivate downstream editing applications that require precise genomic positioning

    Novel clustered regularly interspaced short palindromic repeats-associated 9 platform with divergent targeting capabilities

    No full text
    Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, 2018.Cataloged from PDF version of thesis.Includes bibliographical references (pages 47-49).RNA-guided DNA endonucleases of the CRISPR-Cas system are widely used for genome engineering and thus have numerous applications in a wide variety of fields. The range of sequences that CRISPR endonucleases can recognize, however, is constrained by the need for a specific protospacer adjacent motif (PAM) flanking the target site. In this thesis, we demonstrate the natural PAM plasticity of a highly-similar, yet previously uncharacterized, Cas9 from Streptococcus canis (ScCas9) through rational manipulation of distinguishing motif insertions. To this end, we report a divergent affinity to 5'-NNGT-3' PAM sequences, as well as preferences for expanded 5'-NNG- 3' motifs, and demonstrate the editing capabilities of the ortholog in both bacterial and human cells. We subsequently build an automated bioinformatics pipeline, the Search for PAMs by ALignment Of Targets (SPAMALOT), which further explores the microbial PAM diversity of otherwise-overlooked Streptococcus Cas9 orthologs. Our results establish that ScCas9 can be utilized both as an alternative genome editing tool and as a functional platform to discover novel Streptococcus PAM specificities. Finally, we develop original machine learning-based tools to both predict the efficacy of single guide RNA (sgRNA) sequences targeting specific loci, as well as to classify and characterize the recently-discovered anti-CRISPR proteins.by Pranam Chatterjee.S.M

    Minimal PAM specificity of a highly similar SpCas9 ortholog

    No full text
    RNA-guided DNA endonucleases of the CRISPR-Cas system are widely used for genome engineering and thus have numerous applications in a wide variety of fields. CRISPR endonucleases, however, require a specific protospacer adjacent motif (PAM) flanking the target site, thus constraining their targetable sequence space. In this study, we demonstrate the natural PAM plasticity of a highly similar, yet previously uncharacterized, Cas9 from Streptococcus canis (ScCas9) through rational manipulation of distinguishing motif insertions. To this end, we report affinity to minimal 5′-NNG-3′ PAM sequences and demonstrate the accurate editing capabilities of the ortholog in both bacterial and human cells. Last, we build an automated bioinformatics pipeline, the Search for PAMs by ALignment Of Targets (SPAMALOT), which further explores the microbial PAM diversity of otherwise overlooked Streptococcus Cas9 orthologs. Our results establish that ScCas9 can be used both as an alternative genome editing tool and as a functional platform to discover novel Streptococcus PAM specificities. ©2018 The Authors, some rights reserved

    Femtomolar detection of SARS-CoV-2 via peptide beacons integrated on a miniaturized TIRF microscope

    No full text
    The novel coronavirus SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) continues to pose a substantial global health threat. Along with vaccines and targeted therapeutics, there is a critical need for rapid diagnostic solutions. In this work, we use computational protein modeling tools to suggest molecular beacon architectures that function as conformational switches for high-sensitivity detection of the SARS-CoV-2 spike protein receptor binding domain (S-RBD). Integrating these beacons on a miniaturized total internal reflection fluorescence (mini-TIRF) microscope, we detect the S-RBD and pseudotyped SARS-CoV-2 with limits of detection in the femtomolar range. We envision that our designed mini-TIRF platform will serve as a robust platform for point-of-care diagnostics for SARS-CoV-2 and future emergent viral threats.</jats:p

    Targeted intracellular degradation of SARS-CoV-2 via computationally optimized peptide fusions

    No full text
    The COVID-19 pandemic, caused by the novel coronavirus SARS-CoV-2, has elicited a global health crisis of catastrophic proportions. With only a few vaccines approved for early or limited use, there is a critical need for effective antiviral strategies. In this study, we report a unique antiviral platform, through computational design of ACE2-derived peptides which both target the viral spike protein receptor binding domain (RBD) and recruit E3 ubiquitin ligases for subsequent intracellular degradation of SARS-CoV-2 in the proteasome. Our engineered peptide fusions demonstrate robust RBD degradation capabilities in human cells and are capable of inhibiting infection-competent viral production, thus prompting their further experimental characterization and therapeutic development
    corecore