12 research outputs found
PepMLM: Target Sequence-Conditioned Generation of Peptide Binders via Masked Language Modeling
Target proteins that lack accessible binding pockets and conformational
stability have posed increasing challenges for drug development. Induced
proximity strategies, such as PROTACs and molecular glues, have thus gained
attention as pharmacological alternatives, but still require small molecule
docking at binding pockets for targeted protein degradation (TPD). The
computational design of protein-based binders presents unique opportunities to
access undruggable targets, but have often relied on stable 3D structures or
predictions for effective binder generation. Recently, we have leveraged the
expressive latent spaces of protein language models (pLMs) for the
prioritization of peptide binders from sequence alone, which we have then fused
to E3 ubiquitin ligase domains, creating a CRISPR-analogous TPD system for
target proteins. However, our methods rely on training discriminator models for
ranking heuristically or unconditionally-derived guide peptides for their
target binding capability. In this work, we introduce PepMLM, a purely target
sequence-conditioned de novo generator of linear peptide binders. By employing
a novel masking strategy that uniquely positions cognate peptide sequences at
the terminus of target protein sequences, PepMLM tasks the state-of-the-art
ESM-2 pLM to fully reconstruct the binder region, achieving low perplexities
matching or improving upon previously-validated peptide-protein sequence pairs.
After successful in silico benchmarking with AlphaFold-Multimer, we
experimentally verify PepMLM's efficacy via fusion of model-derived peptides to
E3 ubiquitin ligase domains, demonstrating endogenous degradation of target
substrates in cellular models. In total, PepMLM enables the generative design
of candidate binders to any target protein, without the requirement of target
structure, empowering downstream programmable proteome editing applications
Expanding homogeneous culture of human primordial germ cell-like cells maintaining germline features without serum or feeder layers.
In vitro expansion of human primordial germ cell-like cells (hPGCLCs), a pluripotent stem cell-derived PGC model, has proved challenging due to rapid loss of primordial germ cell (PGC)-like identity and limited cell survival/proliferation. Here, we describe long-term culture hPGCLCs (LTC-hPGCLCs), which actively proliferate in a serum-free, feeder-free condition without apparent limit as highly homogeneous diploid cell populations maintaining transcriptomic and epigenomic characteristics of hPGCLCs. Histone proteomics confirmed reduced H3K9me2 and increased H3K27me3 marks in LTC-hPGCLCs compared with induced pluripotent stem cells (iPSCs). LTC-hPGCLCs established from multiple human iPSC clones of both sexes were telomerase positive, senescence-free cells readily passaged with minimal cell death or deviation from the PGC-like identity. LTC-hPGCLCs are capable of differentiating to DAZL-positive M-spermatogonia-like cells in the xenogeneic reconstituted testis (xrTestis) organ culture milieu as well as efficiently producing fully pluripotent embryonic germ cell-like cells in the presence of stem cell factor and fibroblast growth factor 2. Thus, LTC-hPGCLCs provide convenient access to unlimited amounts of high-quality and homogeneous hPGCLCs
Recommended from our members
PD-1 alters T-cell metabolic reprogramming by inhibiting glycolysis and promoting lipolysis and fatty acid oxidation
During activation, T cells undergo metabolic reprogramming, which imprints distinct functional fates. We determined that on PD-1 ligation, activated T cells are unable to engage in glycolysis or amino acid metabolism but have an increased rate of fatty acid β-oxidation (FAO). PD-1 promotes FAO of endogenous lipids by increasing expression of CPT1A, and inducing lipolysis as indicated by elevation of the lipase ATGL, the lipolysis marker glycerol and release of fatty acids. Conversely, CTLA-4 inhibits glycolysis without augmenting FAO, suggesting that CTLA-4 sustains the metabolic profile of non-activated cells. Because T cells utilize glycolysis during differentiation to effectors, our findings reveal a metabolic mechanism responsible for PD-1-mediated blockade of T-effector cell differentiation. The enhancement of FAO provides a mechanistic explanation for the longevity of T cells receiving PD-1 signals in patients with chronic infections and cancer, and for their capacity to be reinvigorated by PD-1 blockade
PAM-flexible genome editing with an engineered chimeric Cas9
CRISPR enzymes require a defined protospacer adjacent motif (PAM) flanking a guide RNA-programmed target site, limiting their sequence accessibility for robust genome editing applications. In this study, we recombine the PAM-interacting domain of SpRY, a broad-targeting Cas9 possessing an NRN > NYN (R = A or G, Y = C or T) PAM preference, with the N-terminus of Sc + +, a Cas9 with simultaneously broad, efficient, and accurate NNG editing capabilities, to generate a chimeric enzyme with highly flexible PAM preference: SpRYc. We demonstrate that SpRYc leverages properties of both enzymes to specifically edit diverse PAMs and disease-related loci for potential therapeutic applications. In total, the approaches to generate SpRYc, coupled with its robust flexibility, highlight the power of integrative protein design for Cas9 engineering and motivate downstream editing applications that require precise genomic positioning
Novel clustered regularly interspaced short palindromic repeats-associated 9 platform with divergent targeting capabilities
Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, 2018.Cataloged from PDF version of thesis.Includes bibliographical references (pages 47-49).RNA-guided DNA endonucleases of the CRISPR-Cas system are widely used for genome engineering and thus have numerous applications in a wide variety of fields. The range of sequences that CRISPR endonucleases can recognize, however, is constrained by the need for a specific protospacer adjacent motif (PAM) flanking the target site. In this thesis, we demonstrate the natural PAM plasticity of a highly-similar, yet previously uncharacterized, Cas9 from Streptococcus canis (ScCas9) through rational manipulation of distinguishing motif insertions. To this end, we report a divergent affinity to 5'-NNGT-3' PAM sequences, as well as preferences for expanded 5'-NNG- 3' motifs, and demonstrate the editing capabilities of the ortholog in both bacterial and human cells. We subsequently build an automated bioinformatics pipeline, the Search for PAMs by ALignment Of Targets (SPAMALOT), which further explores the microbial PAM diversity of otherwise-overlooked Streptococcus Cas9 orthologs. Our results establish that ScCas9 can be utilized both as an alternative genome editing tool and as a functional platform to discover novel Streptococcus PAM specificities. Finally, we develop original machine learning-based tools to both predict the efficacy of single guide RNA (sgRNA) sequences targeting specific loci, as well as to classify and characterize the recently-discovered anti-CRISPR proteins.by Pranam Chatterjee.S.M
MassIVE MSV000093088 - SaLT&PepPr is an Interface-Predicting Language Model for Designing Peptide-Guided Protein Degraders
Minimal PAM specificity of a highly similar SpCas9 ortholog
RNA-guided DNA endonucleases of the CRISPR-Cas system are widely used for genome engineering and thus have numerous applications in a wide variety of fields. CRISPR endonucleases, however, require a specific protospacer adjacent motif (PAM) flanking the target site, thus constraining their targetable sequence space. In this study, we demonstrate the natural PAM plasticity of a highly similar, yet previously uncharacterized, Cas9 from Streptococcus canis (ScCas9) through rational manipulation of distinguishing motif insertions. To this end, we report affinity to minimal 5′-NNG-3′ PAM sequences and demonstrate the accurate editing capabilities of the ortholog in both bacterial and human cells. Last, we build an automated bioinformatics pipeline, the Search for PAMs by ALignment Of Targets (SPAMALOT), which further explores the microbial PAM diversity of otherwise overlooked Streptococcus Cas9 orthologs. Our results establish that ScCas9 can be used both as an alternative genome editing tool and as a functional platform to discover novel Streptococcus PAM specificities. ©2018 The Authors, some rights reserved
Distinct Roles Of PD-1 Itsm and ITIM In Regulating Interactions With SHP-2, ZAP-70 and Lck, and PD-1-Mediated Inhibitory Function
Femtomolar detection of SARS-CoV-2 via peptide beacons integrated on a miniaturized TIRF microscope
The novel coronavirus SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) continues to pose a substantial global health threat. Along with vaccines and targeted therapeutics, there is a critical need for rapid diagnostic solutions. In this work, we use computational protein modeling tools to suggest molecular beacon architectures that function as conformational switches for high-sensitivity detection of the SARS-CoV-2 spike protein receptor binding domain (S-RBD). Integrating these beacons on a miniaturized total internal reflection fluorescence (mini-TIRF) microscope, we detect the S-RBD and pseudotyped SARS-CoV-2 with limits of detection in the femtomolar range. We envision that our designed mini-TIRF platform will serve as a robust platform for point-of-care diagnostics for SARS-CoV-2 and future emergent viral threats.</jats:p
Targeted intracellular degradation of SARS-CoV-2 via computationally optimized peptide fusions
The COVID-19 pandemic, caused by the novel coronavirus SARS-CoV-2, has elicited a global health crisis of catastrophic proportions. With only a few vaccines approved for early or limited use, there is a critical need for effective antiviral strategies. In this study, we report a unique antiviral platform, through computational design of ACE2-derived peptides which both target the viral spike protein receptor binding domain (RBD) and recruit E3 ubiquitin ligases for subsequent intracellular degradation of SARS-CoV-2 in the proteasome. Our engineered peptide fusions demonstrate robust RBD degradation capabilities in human cells and are capable of inhibiting infection-competent viral production, thus prompting their further experimental characterization and therapeutic development