17 research outputs found
Comparative cellular analysis of motor cortex in human, marmoset and mouse
The primary motor cortex (M1) is essential for voluntary fine-motor control and is functionally conserved across mammals1. Here, using high-throughput transcriptomic and epigenomic profiling of more than 450,000 single nuclei in humans, marmoset monkeys and mice, we demonstrate a broadly conserved cellular makeup of this region, with similarities that mirror evolutionary distance and are consistent between the transcriptome and epigenome. The core conserved molecular identities of neuronal and non-neuronal cell types allow us to generate a cross-species consensus classification of cell types, and to infer conserved properties of cell types across species. Despite the overall conservation, however, many species-dependent specializations are apparent, including differences in cell-type proportions, gene expression, DNA methylation and chromatin state. Few cell-type marker genes are conserved across species, revealing a short list of candidate genes and regulatory mechanisms that are responsible for conserved features of homologous cell types, such as the GABAergic chandelier cells. This consensus transcriptomic classification allows us to use patch-seq (a combination of whole-cell patch-clamp recordings, RNA sequencing and morphological characterization) to identify corticospinal Betz cells from layer 5 in non-human primates and humans, and to characterize their highly specialized physiology and anatomy. These findings highlight the robust molecular underpinnings of cell-type diversity in M1 across mammals, and point to the genes and regulatory pathways responsible for the functional identity of cell types and their species-specific adaptations
A multimodal cell census and atlas of the mammalian primary motor cortex
ABSTRACT We report the generation of a multimodal cell census and atlas of the mammalian primary motor cortex (MOp or M1) as the initial product of the BRAIN Initiative Cell Census Network (BICCN). This was achieved by coordinated large-scale analyses of single-cell transcriptomes, chromatin accessibility, DNA methylomes, spatially resolved single-cell transcriptomes, morphological and electrophysiological properties, and cellular resolution input-output mapping, integrated through cross-modal computational analysis. Together, our results advance the collective knowledge and understanding of brain cell type organization: First, our study reveals a unified molecular genetic landscape of cortical cell types that congruently integrates their transcriptome, open chromatin and DNA methylation maps. Second, cross-species analysis achieves a unified taxonomy of transcriptomic types and their hierarchical organization that are conserved from mouse to marmoset and human. Third, cross-modal analysis provides compelling evidence for the epigenomic, transcriptomic, and gene regulatory basis of neuronal phenotypes such as their physiological and anatomical properties, demonstrating the biological validity and genomic underpinning of neuron types and subtypes. Fourth, in situ single-cell transcriptomics provides a spatially-resolved cell type atlas of the motor cortex. Fifth, integrated transcriptomic, epigenomic and anatomical analyses reveal the correspondence between neural circuits and transcriptomic cell types. We further present an extensive genetic toolset for targeting and fate mapping glutamatergic projection neuron types toward linking their developmental trajectory to their circuit function. Together, our results establish a unified and mechanistic framework of neuronal cell type organization that integrates multi-layered molecular genetic and spatial information with multi-faceted phenotypic properties
Computer vision segmentation model—deep learning for categorizing microplastic debris
The characterization of beached and marine microplastic debris is critical to understanding how plastic litter accumulates across the world’s oceans and identifying hotspots that should be targeted for early cleanup efforts. Currently, the most common monitoring method to quantify microplastics at sea requires physical sampling using surface trawling and sifting for beached microplastics, which are then followed by manual counting and laboratory analysis. The need for manual counting is time-consuming, operator-dependent, and incurs high costs, thereby preventing scalable deployment of consistent marine plastic monitoring worldwide. Here, we describe a workflow combining a simple experimental setup with advanced image processing techniques to conduct both quantitative and qualitative assessments of microplastic (0.05 cm < particle size <0.5 cm). The image processing relies on deep learning models designed for image segmentation and classification. The results demonstrated comparable or superior performance in comparison to manual identification for microplastic particles with a 96% accuracy. Thus, the use of the model offers an efficient, more robust, standardized, highly replicable, and less labor-intensive alternative to particle counting. In addition to the relative simplicity of the network architecture used that made it easy to train, the model presents promising prospects for better-standardized reporting of plastic particles surveyed in the environment. We also made the models and datasets open-source and created a user-friendly web interface for directly annotating new images
DeepProg: an ensemble of deep-learning and machine-learning models for prognosis prediction using multi-omics data
Abstract
Multi-omics data are good resources for prognosis and survival prediction; however, these are difficult to integrate computationally. We introduce DeepProg, a novel ensemble framework of deep-learning and machine-learning approaches that robustly predicts patient survival subtypes using multi-omics data. It identifies two optimal survival subtypes in most cancers and yields significantly better risk-stratification than other multi-omics integration methods. DeepProg is highly predictive, exemplified by two liver cancer (C-index 0.73–0.80) and five breast cancer datasets (C-index 0.68–0.73). Pan-cancer analysis associates common genomic signatures in poor survival subtypes with extracellular matrix modeling, immune deregulation, and mitosis processes. DeepProg is freely available at
https://github.com/lanagarmire/DeepProghttp://deepblue.lib.umich.edu/bitstream/2027.42/173896/1/13073_2021_Article_930.pd
Recommended from our members
A single-cell atlas of chromatin accessibility in the human genome
Current catalogs of regulatory sequences in the human genome are still incomplete and lack cell type resolution. To profile the activity of gene regulatory elements in diverse cell types and tissues in the human body, we applied single-cell chromatin accessibility assays to 30 adult human tissue types from multiple donors. We integrated these datasets with previous single-cell chromatin accessibility data from 15 fetal tissue types to reveal the status of open chromatin for ∼1.2 million candidate cis-regulatory elements (cCREs) in 222 distinct cell types comprised of >1.3 million nuclei. We used these chromatin accessibility maps to delineate cell-type-specificity of fetal and adult human cCREs and to systematically interpret the noncoding variants associated with complex human traits and diseases. This rich resource provides a foundation for the analysis of gene regulatory programs in human cell types across tissues, life stages, and organ systems
Recommended from our members
Human-gained heart enhancers are associated with species-specific cardiac attributes
The heart, a vital organ which is first to develop, has adapted its size, structure and function in order to accommodate the circulatory demands for a broad range of animals. Although heart development is controlled by a relatively conserved network of transcriptional/chromatin regulators, how the human heart has evolved species-specific features to maintain adequate cardiac output and function remains to be defined. Here, we show through comparative epigenomic analysis the identification of enhancers and promoters that have gained activity in humans during cardiogenesis. These cis-regulatory elements (CREs) are associated with genes involved in heart development and function, and may account for species-specific differences between human and mouse hearts. Supporting these findings, genetic variants that are associated with human cardiac phenotypic/disease traits, particularly those differing between human and mouse, are enriched in human-gained CREs. During early stages of human cardiogenesis, these CREs are also gained within genomic loci of transcriptional regulators, potentially expanding their role in human heart development. In particular, we discovered that gained enhancers in the locus of the early human developmental regulator ZIC3 are selectively accessible within a subpopulation of mesoderm cells which exhibits cardiogenic potential, thus possibly extending the function of ZIC3 beyond its conserved left-right asymmetry role. Genetic deletion of these enhancers identified a human gained enhancer that was required for not only ZIC3 and early cardiac gene expression at the mesoderm stage but also cardiomyocyte differentiation. Overall, our results illuminate how human gained CREs may contribute to human-specific cardiac attributes, and provide insight into how transcriptional regulators may gain cardiac developmental roles through the evolutionary acquisition of enhancers
Recommended from our members
Cardiac cell type-specific gene regulatory programs and disease risk association.
Misregulated gene expression in human hearts can result in cardiovascular diseases that are leading causes of mortality worldwide. However, the limited information on the genomic location of candidate cis-regulatory elements (cCREs) such as enhancers and promoters in distinct cardiac cell types has restricted the understanding of these diseases. Here, we defined >287,000 cCREs in the four chambers of the human heart at single-cell resolution, which revealed cCREs and candidate transcription factors associated with cardiac cell types in a region-dependent manner and during heart failure. We further found cardiovascular disease-associated genetic variants enriched within these cCREs including 38 candidate causal atrial fibrillation variants localized to cardiomyocyte cCREs. Additional functional studies revealed that two of these variants affect a cCRE controlling KCNH2/HERG expression and action potential repolarization. Overall, this atlas of human cardiac cCREs provides the foundation for illuminating cell type-specific gene regulation in human hearts during health and disease
Recommended from our members
Cardiac cell type-specific gene regulatory programs and disease risk association.
Misregulated gene expression in human hearts can result in cardiovascular diseases that are leading causes of mortality worldwide. However, the limited information on the genomic location of candidate cis-regulatory elements (cCREs) such as enhancers and promoters in distinct cardiac cell types has restricted the understanding of these diseases. Here, we defined >287,000 cCREs in the four chambers of the human heart at single-cell resolution, which revealed cCREs and candidate transcription factors associated with cardiac cell types in a region-dependent manner and during heart failure. We further found cardiovascular disease-associated genetic variants enriched within these cCREs including 38 candidate causal atrial fibrillation variants localized to cardiomyocyte cCREs. Additional functional studies revealed that two of these variants affect a cCRE controlling KCNH2/HERG expression and action potential repolarization. Overall, this atlas of human cardiac cCREs provides the foundation for illuminating cell type-specific gene regulation in human hearts during health and disease
The MUC5B-associated variant rs35705950 resides within an enhancer subject to lineage- and disease-dependent epigenetic remodeling
The G/T transversion rs35705950, located approximately 3 kb upstream of the MUC5B start site, is the cardinal risk factor for idiopathic pulmonary fibrosis (IPF). Here, we investigate the function and chromatin structure of this –3 kb region and provide evidence that it functions as a classically defined enhancer subject to epigenetic programming. We use nascent transcript analysis to show that RNA polymerase II loads within 10 bp of the G/T transversion site, definitively establishing enhancer function for the region. By integrating Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) analysis of fresh and cultured human airway epithelial cells with nuclease sensitivity data, we demonstrate that this region is in accessible chromatin that affects the expression of MUC5B. Through applying paired single-nucleus RNA- and ATAC-seq to frozen tissue from IPF lungs, we extend these findings directly to disease, with results indicating that epigenetic programming of the –3 kb enhancer in IPF occurs in both MUC5B-expressing and nonexpressing lineages. In aggregate, our results indicate that the MUC5B-associated variant rs35705950 resides within an enhancer that is subject to epigenetic remodeling and contributes to pathologic misexpression in IPF
Robust enhancer-gene regulation identified by single-cell transcriptomes and epigenomes.
Single-cell sequencing could help to solve the fundamental challenge of linking millions of cell-type-specific enhancers with their target genes. However, this task is confounded by patterns of gene co-expression in much the same way that genetic correlation due to linkage disequilibrium confounds fine-mapping in genome-wide association studies (GWAS). We developed a non-parametric permutation-based procedure to establish stringent statistical criteria to control the risk of false-positive associations in enhancer-gene association studies (EGAS). We applied our procedure to large-scale transcriptome and epigenome data from multiple tissues and species, including the mouse and human brain, to predict enhancer-gene associations genome wide. We tested the functional validity of our predictions by comparing them with chromatin conformation data and causal enhancer perturbation experiments. Our study shows how controlling for gene co-expression enables robust enhancer-gene linkage using single-cell sequencing data