Search CORE

38 research outputs found

Recommended from our members

Transcriptomic evidence that von Economo neurons are regionally specialized extratelencephalic-projecting excitatory neurons.

Author: Aevermann Brian D
Bakken Trygve E
Barkan Eliza R
Berkowitz-Cerasano Madeline L
Bernard Amy
Cobbs Charles
Diez-Fuertes Francisco
Ding Song-Lin
Hodge Rebecca D
Kalmbach Brian E
Koch Christof
Lasken Roger S
Lein Ed S
McCorrison Jamison
Miller Jeremy A
Novotny Mark
Phillips John W
Scheuermann Richard H
Schork Nicholas J
Shehata Soraya I
Smith Kimberly A
Steemers Frank J
Sunkin Susan M
Ting Jonathan T
Tran Danny N
Venepally Pratap
Yanny Anna Marie
Publication venue: Providence St. Joseph Health Digital Commons
Publication date: 01/01/2020
Field of study

von Economo neurons (VENs) are bipolar, spindle-shaped neurons restricted to layer 5 of human frontoinsula and anterior cingulate cortex that appear to be selectively vulnerable to neuropsychiatric and neurodegenerative diseases, although little is known about other VEN cellular phenotypes. Single nucleus RNA-sequencing of frontoinsula layer 5 identifies a transcriptomically-defined cell cluster that contained VENs, but also fork cells and a subset of pyramidal neurons. Cross-species alignment of this cell cluster with a well-annotated mouse classification shows strong homology to extratelencephalic (ET) excitatory neurons that project to subcerebral targets. This cluster also shows strong homology to a putative ET cluster in human temporal cortex, but with a strikingly specific regional signature. Together these results suggest that VENs are a regionally distinctive type of ET neuron. Additionally, we describe the first patch clamp recordings of VENs from neurosurgically-resected tissue that show distinctive intrinsic membrane properties relative to neighboring pyramidal neurons

eScholarship - University of California

Providence St. Joseph Health Digital Commons

A comprehensive collection of systems biology data characterizing the host response to viral infection

The Systems Biology for Infectious Diseases Research program was established by the U.S. National Institute of Allergy and Infectious Diseases to investigate host-pathogen interactions at a systems level. This program generated 47 transcriptomic and proteomic datasets from 30 studies that investigate in vivo and in vitro host responses to viral infections. Human pathogens in the Orthomyxoviridae and Coronaviridae families, especially pandemic H1N1 and avian H5N1 influenza A viruses and severe acute respiratory syndrome coronavirus (SARS-CoV), were investigated. Study validation was demonstrated via experimental quality control measures and meta-analysis of independent experiments performed under similar conditions. Primary assay results are archived at the GEO and PeptideAtlas public repositories, while processed statistical results together with standardized metadata are publically available at the Influenza Research Database (www.fludb.org) and the Virus Pathogen Resource (www.viprbrc.org). By comparing data from mutant versus wild-type virus and host strains, RNA versus protein differential expression, and infection with genetically similar strains, these data can be used to further investigate genetic and physiological determinants of host responses to viral infection

Carolina Digital Repository

Comparative cellular analysis of motor cortex in human, marmoset and mouse

Author: Aevermann Brian D
Aldridge Andrew I
Ament Seth A
Bakken Trygve E
Bartlett Anna
Behrens M Margarita
Bertagnolli Darren
Bravo Hector Corrada
Casper Tamara
Castanon Rosa G
Chun Jerold
Crichton Kirsten
Crow Megan
Daigle Tanya L
Dalley Rachel
Dee Nick
Dembrow Nikolai
Diep Dinh
Ding Song-Lin
Dobin Alexander
Dong Weixiu
Ecker Joseph R
Eggermont Jeroen
Fang Rongxin
Feng Guoping
Fischer Stephan
Gillis Jesse
Goldman Melissa
Goldy Jeff
Graybuck Lucas T
Hawrylycz Michael
Herb Brian R
Hertzano Ronna
Hodge Rebecca D
Hof Patrick R
Horwitz Gregory D
Hou Xiaomeng
Hu Qiwen
Höllt Thomas
Jorstad Nikolas L
Kalmbach Brian E
Kancherla Jayaram
Keene C Dirk
Kharchenko Peter V
Ko Andrew L
Koch Christof
Krienen Fenna M
Kroll Matthew
Lake Blue B
Lathia Kanan
Lein Ed S
Lelieveldt Boudewijn P
Li Yang Eric
Linnarsson Sten
Liu Christine S
Liu Hanqing
Lucero Jacinta D
Luo Chongyuan
Macosko Evan Z
Mahurkar Anup
McCarroll Steven A
McMillen Delissa
Miller Jeremy A
Moussa Marmar
Mukamel Eran A
Nery Joseph R
Nicovich Philip R
Niu Sheng-Yong
Orvis Joshua
Osteen Julia K
Owen Scott
Palmer Carter R
Pham Thanh
Pinto-Duarte António
Plongthongkum Nongluk
Poirion Olivier
Preissl Sebastian
Reed Nora M
Regev Aviv
Ren Bing
Rimorin Christine
Rivkin Angeline
Romanow William J
Scheuermann Richard H
Sedeño-Cortés Adriana E
Siletti Kimberly
Smith Kimberly
Somasundaram Saroja
Sorensen Staci A
Spain William J
Sulc Josef
Tasic Bosiljka
Tian Wei
Tieu Michael
Ting Jonathan T
Torkelson Amy
Tung Herman
van Lew Baldur
Wang Xinxin
White Owen R
Xie Fangming
Yanny Anna Marie
Yao Zizhen
Zeng Hongkui
Zhang Kun
Zhang Renee
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2021
Field of study

The primary motor cortex (M1) is essential for voluntary fine-motor control and is functionally conserved across mammals1. Here, using high-throughput transcriptomic and epigenomic profiling of more than 450,000 single nuclei in humans, marmoset monkeys and mice, we demonstrate a broadly conserved cellular makeup of this region, with similarities that mirror evolutionary distance and are consistent between the transcriptome and epigenome. The core conserved molecular identities of neuronal and non-neuronal cell types allow us to generate a cross-species consensus classification of cell types, and to infer conserved properties of cell types across species. Despite the overall conservation, however, many species-dependent specializations are apparent, including differences in cell-type proportions, gene expression, DNA methylation and chromatin state. Few cell-type marker genes are conserved across species, revealing a short list of candidate genes and regulatory mechanisms that are responsible for conserved features of homologous cell types, such as the GABAergic chandelier cells. This consensus transcriptomic classification allows us to use patch-seq (a combination of whole-cell patch-clamp recordings, RNA sequencing and morphological characterization) to identify corticospinal Betz cells from layer 5 in non-human primates and humans, and to characterize their highly specialized physiology and anatomy. These findings highlight the robust molecular underpinnings of cell-type diversity in M1 across mammals, and point to the genes and regulatory pathways responsible for the functional identity of cell types and their species-specific adaptations

DSpace@MIT

Cold Spring Harbor Laboratory Institutional Repository

eScholarship - University of California

Machine Learning-Based Single Cell and Integrative Analysis Reveals That Baseline mDC Predisposition Correlates With Hepatitis B Vaccine Antibody Response.

Author: Aevermann Brian D,
Publication venue
Publication date: 26/06/2023
Field of study

Ezid

Cell type discovery using single-cell transcriptomics: implications for ontological representation

Author: Aevermann Brian D,
Publication venue
Publication date: 23/05/2023
Field of study

Ezid

Machine learning for cell type classification from single nucleus RNA sequencing data

Author: Aevermann Brian D
Carrillo Daniel
Le Huy
Peng Beverly
Scheuermann Richard H
Uy Janelle
Zhang Yun
Publication venue: eScholarship, University of California
Publication date: 01/01/2022
Field of study

With the advent of single cell/nucleus RNA sequencing (sc/snRNA-seq), the field of cell phenotyping is now a data-driven exercise providing statistical evidence to support cell type/state categorization. However, the task of classifying cells into specific, well-defined categories with the empirical data provided by sc/snRNA-seq remains nontrivial due to the difficulty in determining specific differences between related cell types with close transcriptional similarities, resulting in challenges with matching cell types identified in separate experiments. To investigate possible approaches to overcome these obstacles, we explored the use of supervised machine learning methods-logistic regression, support vector machines, random forests, neural networks, and light gradient boosting machine (LightGBM)-as approaches to classify cell types using snRNA-seq datasets from human brain middle temporal gyrus (MTG) and human kidney. Classification accuracy was evaluated using an F-beta score weighted in favor of precision to account for technical artifacts of gene expression dropout. We examined the impact of hyperparameter optimization and feature selection methods on F-beta score performance. We found that the best performing model for granular cell type classification in both datasets is a multinomial logistic regression classifier and that an effective feature selection step was the most influential factor in optimizing the performance of the machine learning pipelines

PubMed Central

eScholarship - University of California

FR-Match: robust matching of cell type clusters from single cell RNA sequencing data using the Friedman–Rafsky non-parametric test

Author: Aevermann Brian D
Bakken Trygve E
Hodge Rebecca D
Lein Ed S
Miller Jeremy A
Scheuermann Richard H
Zhang Yun
Publication venue: eScholarship, University of California
Publication date: 30/11/2020
Field of study

Single cell/nucleus RNA sequencing (scRNAseq) is emerging as an essential tool to unravel the phenotypic heterogeneity of cells in complex biological systems. While computational methods for scRNAseq cell type clustering have advanced, the ability to integrate datasets to identify common and novel cell types across experiments remains a challenge. Here, we introduce a cluster-to-cluster cell type matching method-FR-Match-that utilizes supervised feature selection for dimensionality reduction and incorporates shared information among cells to determine whether two cell type clusters share the same underlying multivariate gene expression distribution. FR-Match is benchmarked with existing cell-to-cell and cell-to-cluster cell type matching methods using both simulated and real scRNAseq data. FR-Match proved to be a stringent method that produced fewer erroneous matches of distinct cell subtypes and had the unique ability to identify novel cell phenotypes in new datasets. In silico validation demonstrated that the proposed workflow is the only self-contained algorithm that was robust to increasing numbers of true negatives (i.e. non-represented cell types). FR-Match was applied to two human brain scRNAseq datasets sampled from cortical layer 1 and full thickness middle temporal gyrus. When mapping cell types identified in specimens isolated from these overlapping human brain regions, FR-Match precisely recapitulated the laminar characteristics of matched cell type clusters, reflecting their distinct neuroanatomical distributions. An R package and Shiny application are provided at https://github.com/JCVenterInstitute/FRmatch for users to interactively explore and match scRNAseq cell type clusters with complementary visualization tools

PubMed Central

eScholarship - University of California

Recommended from our members

FastMix: a versatile data integration pipeline for cell type-specific biomarker inference.

Author: Aevermann Brian D
Kollmann Tobias R
Mandava Aishwarya
Qian Yu
Qiu Xing
Scheuermann Richard H
Sun Hao
Zhang Yun
Publication venue: eScholarship, University of California
Publication date: 14/10/2022
Field of study

MotivationFlow cytometry (FCM) and transcription profiling are the two widely used assays in translational immunology research. However, there is no data integration pipeline for analyzing these two types of assays together with experiment variables for biomarker inference. Current FCM data analysis mainly relies on subjective manual gating analysis, which is difficult to be directly integrated with other automated computational methods. Existing deconvolutional analysis of bulk transcriptomics relies on predefined marker genes in the transcriptomics data, which are unavailable for novel cell types and does not utilize the FCM data that provide canonical phenotypic definitions of the cell types.ResultsWe developed a novel analytics pipeline-FastMix-for computational immunology, which integrates flow cytometry, bulk transcriptomics and clinical covariates for identifying cell type-specific gene expression signatures and biomarker genes. FastMix addresses the 'large p, small n' problem in the gene expression and flow cytometry integration analysis via a linear mixed effects model (LMER) for both cross-sectional and longitudinal studies. Its novel moment-based estimator not only reduces bias in parameter estimation but also is more efficient than iterative optimization. The FastMix pipeline also includes a cutting-edge flow cytometry data analysis method-DAFi-for identifying cell populations of interest and their characteristics. Simulation studies showed that FastMix produced smaller type I/II errors than competing methods. Validation using real data of two vaccine studies showed that FastMix identified a consistent set of signature genes as in independent single-cell RNA-seq analysis, producing additional interesting findings.Availability and implementationSource code of FastMix is publicly available at https://github.com/terrysun0302/FastMix.Supplementary informationSupplementary data are available at Bioinformatics online

eScholarship - University of California

The genome and preliminary single-nuclei transcriptome of Lemna minuta reveals mechanisms of invasiveness

Author: Abramson Bradley W
Aevermann Brian D
Colt Kelly
Hartwick Nolan T
Michael Todd P
Novotny Mark
Scheuermann Richard H
Publication venue: eScholarship, University of California
Publication date: 06/12/2021
Field of study

The ability to trace every cell in some model organisms has led to the fundamental understanding of development and cellular function. However, in plants the complexity of cell number, organ size, and developmental time makes this a challenge even in the diminutive model plant Arabidopsis (Arabidopsis thaliana). Duckweed, basal nongrass aquatic monocots, provide an opportunity to follow every cell of an entire plant due to their small size, reduced body plan, and fast clonal growth habit. Here we present a chromosome-resolved genome for the highly invasive Lesser Duckweed (Lemna minuta) and generate a preliminary cell atlas leveraging low cell coverage single-nuclei sequencing. We resolved the 360 megabase genome into 21 chromosomes, revealing a core nonredundant gene set with only the ancient tau whole-genome duplication shared with all monocots, and paralog expansion as a result of tandem duplications related to phytoremediation. Leveraging SMARTseq2 single-nuclei sequencing, which provided higher gene coverage yet lower cell count, we profiled 269 nuclei covering 36.9% (8,457) of the L. minuta transcriptome. Since molecular validation was not possible in this nonmodel plant, we leveraged gene orthology with model organism single-cell expression datasets, gene ontology, and cell trajectory analysis to define putative cell types. We found that the tissue that we computationally defined as mesophyll expressed high levels of elemental transport genes consistent with this tissue playing a role in L. minuta wastewater detoxification. The L. minuta genome and preliminary cell map provide a paradigm to decipher developmental genes and pathways for an entire plant

PubMed Central

eScholarship - University of California

Overview of the machine learning pipeline.

Author: Beverly Peng (13851096)
Brian D. Aevermann (11624230)
Daniel Carrillo (4107643)
Huy Le (9856877)
Janelle Uy (13851099)
Richard H. Scheuermann (9704042)
Yun Zhang (131894)
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 23/09/2022
Field of study

A count matrix undergoes pre-processing, including normalization and filtering. The data is randomly split into training (60%), validation (20%), and test (20%) sets independently for each cell type. The training sets are used to train the models. The validation set provides an initial test for accuracy of the trained models and is used to adjust the model’s hyperparameters. Once the hyperparameters are optimized, the test set is run through each model and the F-beta score distribution across all clusters is used for model comparison.</p

FigShare