Search CORE

5 research outputs found

Recommended from our members

Revealing the grammar of small RNA secretion using interpretable machine learning.

Author: Fish Lisa
Goodarzi Hani
Huh Doowon
Luo Lixi
Naghipourfar Mohsen
Navickas Albertas
Pouyabahar Delaram
Saberi Ali
Sharifi-Zarchi Ali
Zarezadeh Amirhossein
Zirak Bahar
Publication venue: eScholarship, University of California
Publication date: 10/04/2024
Field of study

Small non-coding RNAs can be secreted through a variety of mechanisms, including exosomal sorting, in small extracellular vesicles, and within lipoprotein complexes. However, the mechanisms that govern their sorting and secretion are not well understood. Here, we present ExoGRU, a machine learning model that predicts small RNA secretion probabilities from primary RNA sequences. We experimentally validated the performance of this model through ExoGRU-guided mutagenesis and synthetic RNA sequence analysis. Additionally, we used ExoGRU to reveal cis and trans factors that underlie small RNA secretion, including known and novel RNA-binding proteins (RBPs), e.g., YBX1, HNRNPA2B1, and RBM24. We also developed a novel technique called exoCLIP, which reveals the RNA interactome of RBPs within the cell-free space. Together, our results demonstrate the power of machine learning in revealing novel biological mechanisms. In addition to providing deeper insight into small RNA secretion, this knowledge can be leveraged in therapeutic and synthetic biology applications

eScholarship - University of California

Mapping single-cell data to reference atlases by transfer learning

Author: Avsec Žiga
Büttner Maren
Gayoso Adam
Interlandi Marta
Khajavi Matin
Lotfollahi Mohammad
Luecken Malte D
Misharin Alexander V
Naghipourfar Mohsen
Rybakov Sergei
Theis Fabian J
Wagenstetter Marco
Yosef Nir
Publication venue: eScholarship, University of California
Publication date: 01/01/2022
Field of study

Large single-cell atlases are now routinely generated to serve as references for analysis of smaller-scale studies. Yet learning from reference data is complicated by batch effects between datasets, limited availability of computational resources and sharing restrictions on raw data. Here we introduce a deep learning strategy for mapping query datasets on top of a reference called single-cell architectural surgery (scArches). scArches uses transfer learning and parameter optimization to enable efficient, decentralized, iterative reference building and contextualization of new datasets with existing references without sharing raw data. Using examples from mouse brain, pancreas, immune and whole-organism atlases, we show that scArches preserves biological state information while removing batch effects, despite using four orders of magnitude fewer parameters than de novo integration. scArches generalizes to multimodal reference mapping, allowing imputation of missing modalities. Finally, scArches retains coronavirus disease 2019 (COVID-19) disease variation when mapping to a healthy reference, enabling the discovery of disease-specific cell states. scArches will facilitate collaborative projects by enabling iterative construction, updating, sharing and efficient use of reference atlases

PubMed Central

eScholarship - University of California

Recommended from our members

Predicting cellular responses to complex perturbations in high‐throughput screens

Author: Boyeau Pierre
Daza Riza M
De Donno Carlo
Günnemann Stephan
Hetzel Leon
Ibarra Ignacio L
Ji Yuge
Lopez‐Paz David
Lotfollahi Mohammad
Martin Beth
McFaline‐Figueroa Jose L
Naghipourfar Mohsen
Shendure Jay
Srivatsan Sanjay R
Susmelj Anna Klimovskaia
Theis Fabian J
Trapnell Cole
Wolf F Alexander
Yakubova Nafissa
Publication venue: eScholarship, University of California
Publication date: 12/06/2023
Field of study

Recent advances in multiplexed single-cell transcriptomics experiments facilitate the high-throughput study of drug and genetic perturbations. However, an exhaustive exploration of the combinatorial perturbation space is experimentally unfeasible. Therefore, computational methods are needed to predict, interpret, and prioritize perturbations. Here, we present the compositional perturbation autoencoder (CPA), which combines the interpretability of linear models with the flexibility of deep-learning approaches for single-cell response modeling. CPA learns to in silico predict transcriptional perturbation response at the single-cell level for unseen dosages, cell types, time points, and species. Using newly generated single-cell drug combination data, we validate that CPA can predict unseen drug combinations while outperforming baseline models. Additionally, the architecture's modularity enables incorporating the chemical representation of the drugs, allowing the prediction of cellular response to completely unseen drugs. Furthermore, CPA is also applicable to genetic combinatorial screens. We demonstrate this by imputing in silico 5,329 missing combinations (97.6% of all possibilities) in a single-cell Perturb-seq experiment with diverse genetic interactions. We envision CPA will facilitate efficient experimental design and hypothesis generation by enabling in silico response prediction at the single-cell level and thus accelerate therapeutic applications using single-cell technologies

eScholarship - University of California

Predicting cellular responses to complex perturbations in high‐throughput screens

Author: Anna Klimovskaia Susmelj
Beth Martin
Carlo De Donno
Cole Trapnell
David Lopez‐Paz
F Alexander Wolf
Fabian J Theis
Ignacio L Ibarra
Jay Shendure
Jose L McFaline‐Figueroa
Leon Hetzel
Mohammad Lotfollahi
Mohsen Naghipourfar
Nafissa Yakubova
Pierre Boyeau
Riza M Daza
Sanjay R Srivatsan
Stephan Günnemann
Yuge Ji
Publication venue: Springer Nature
Publication date: 01/06/2023
Field of study

Abstract Recent advances in multiplexed single‐cell transcriptomics experiments facilitate the high‐throughput study of drug and genetic perturbations. However, an exhaustive exploration of the combinatorial perturbation space is experimentally unfeasible. Therefore, computational methods are needed to predict, interpret, and prioritize perturbations. Here, we present the compositional perturbation autoencoder (CPA), which combines the interpretability of linear models with the flexibility of deep‐learning approaches for single‐cell response modeling. CPA learns to in silico predict transcriptional perturbation response at the single‐cell level for unseen dosages, cell types, time points, and species. Using newly generated single‐cell drug combination data, we validate that CPA can predict unseen drug combinations while outperforming baseline models. Additionally, the architecture's modularity enables incorporating the chemical representation of the drugs, allowing the prediction of cellular response to completely unseen drugs. Furthermore, CPA is also applicable to genetic combinatorial screens. We demonstrate this by imputing in silico 5,329 missing combinations (97.6% of all possibilities) in a single‐cell Perturb‐seq experiment with diverse genetic interactions. We envision CPA will facilitate efficient experimental design and hypothesis generation by enabling in silico response prediction at the single‐cell level and thus accelerate therapeutic applications using single‐cell technologies

Directory of Open Access Journals