Search CORE

16,354 research outputs found

A computational framework to emulate the human perspective in flow cytometric data analysis

Author: AP Dempster
B Ellis
B Lindsay
BG Lindsay
BW Silverman
BW Silverman
C Jarque
Christopher V. Rao
D Novo
D Sarkar
DJ Marchette
DR Parks
E Choy
E Lugli
F Hahne
F Hahne
F Hahne
G Finak
G Finak
G Luta
G McLachlan
H Zare
J Li
J Trotter
JA Hartigan
JA Hartigan
JM Irish
JP Baudry
K Lo
L Herzenberg
LM Maier
MC Minnotte
MY Cheng
MY Cheng
MY Cheng
N Aghaeepour
PM Hartigan
R Scheuermann
R Tibshirani
RR Brinkman
S Pyne
S Ray
S Ray
Saumyadipta Pyne
Surajit Ray
T Lin
T Lin
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

Background: In recent years, intense research efforts have focused on developing methods for automated flow cytometric data analysis. However, while designing such applications, little or no attention has been paid to the human perspective that is absolutely central to the manual gating process of identifying and characterizing cell populations. In particular, the assumption of many common techniques that cell populations could be modeled reliably with pre-specified distributions may not hold true in real-life samples, which can have populations of arbitrary shapes and considerable inter-sample variation. <p/>Results: To address this, we developed a new framework flowScape for emulating certain key aspects of the human perspective in analyzing flow data, which we implemented in multiple steps. First, flowScape begins with creating a mathematically rigorous map of the high-dimensional flow data landscape based on dense and sparse regions defined by relative concentrations of events around modes. In the second step, these modal clusters are connected with a global hierarchical structure. This representation allows flowScape to perform ridgeline analysis for both traversing the landscape and isolating cell populations at different levels of resolution. Finally, we extended manual gating with a new capacity for constructing templates that can identify target populations in terms of their relative parameters, as opposed to the more commonly used absolute or physical parameters. This allows flowScape to apply such templates in batch mode for detecting the corresponding populations in a flexible, sample-specific manner. We also demonstrated different applications of our framework to flow data analysis and show its superiority over other analytical methods. <p/>Conclusions: The human perspective, built on top of intuition and experience, is a very important component of flow cytometric data analysis. By emulating some of its approaches and extending these with automation and rigor, flowScape provides a flexible and robust framework for computational cytomics

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Enlighten

Data reduction for spectral clustering to analyze high throughput flow cytometry data

Author: Brinkman Ryan R.
Gupta Arvind
Shooshtari Parisa
Zare Habil
Publication venue: Scholarship@Western
Publication date: 28/07/2010
Field of study

Background: Recent biological discoveries have shown that clustering large datasets is essential for better understanding biology in many areas. Spectral clustering in particular has proven to be a powerful tool amenable for many applications. However, it cannot be directly applied to large datasets due to time and memory limitations. To address this issue, we have modified spectral clustering by adding an information preserving sampling procedure and applying a post-processing stage. We call this entire algorithm SamSPECTRAL.Results: We tested our algorithm on flow cytometry data as an example of large, multidimensional data containing potentially hundreds of thousands of data points (i.e., events in flow cytometry, typically corresponding to cells). Compared to two state of the art model-based flow cytometry clustering methods, SamSPECTRAL demonstrates significant advantages in proper identification of populations with non-elliptical shapes, low density populations close to dense ones, minor subpopulations of a major population and rare populations.Conclusions: This work is the first successful attempt to apply spectral methodology on flow cytometry data. An implementation of our algorithm as an R package is freely available through BioConductor. © 2010 Zare et al; licensee BioMed Central Ltd

Scholarship@Western

Understanding Health and Disease with Multidimensional Single-Cell Methods

Author: Banavar Jayanth R.
Candia Julián
Losert Wolfgang
Publication venue
Publication date: 01/12/2013
Field of study

Current efforts in the biomedical sciences and related interdisciplinary fields are focused on gaining a molecular understanding of health and disease, which is a problem of daunting complexity that spans many orders of magnitude in characteristic length scales, from small molecules that regulate cell function to cell ensembles that form tissues and organs working together as an organism. In order to uncover the molecular nature of the emergent properties of a cell, it is essential to measure multiple cell components simultaneously in the same cell. In turn, cell heterogeneity requires multiple cells to be measured in order to understand health and disease in the organism. This review summarizes current efforts towards a data-driven framework that leverages single-cell technologies to build robust signatures of healthy and diseased phenotypes. While some approaches focus on multicolor flow cytometry data and other methods are designed to analyze high-content image-based screens, we emphasize the so-called Supercell/SVM paradigm (recently developed by the authors of this review and collaborators) as a unified framework that captures mesoscopic-scale emergence to build reliable phenotypes. Beyond their specific contributions to basic and translational biomedical research, these efforts illustrate, from a larger perspective, the powerful synergy that might be achieved from bringing together methods and ideas from statistical physics, data mining, and mathematics to solve the most pressing problems currently facing the life sciences.Comment: 25 pages, 7 figures; revised version with minor changes. To appear in J. Phys.: Cond. Mat

arXiv.org e-Print Archive

CONICET Digital

PubMed Central

Recommended from our members

Composite lymphoma of concurrent T zone lymphoma and large cell B cell lymphoma in a dog.

Author: Bienzle Dorothee
Darzentas Nikos
Deravi Nariman
Hwang Mei-Hua
Keller Stefan M
Matsuyama Arata
Richardson Danielle
Publication venue: eScholarship, University of California
Publication date: 01/11/2019
Field of study

BackgroundEvolution of indolent to aggressive lymphoma has been described in dogs but is difficult to distinguish from the de novo development of a second, clonally distinct lymphoma. Differentiation of these scenarios can be aided by next generation sequencing (NGS)-based assessment of clonality of lymphocyte antigen receptor genes.Case presentationAn 8-year-old male intact Mastiff presented with generalized lymphadenomegaly was diagnosed with nodal T zone lymphoma (TZL) based on cytology, histopathology, immunohistochemistry and flow cytometry. Thirteen months later, the dog re-presented with progressive lymphadenomegaly, and based on cytology and flow cytometry, a large B cell lymphoma (LBCL) was diagnosed. Sequencing-based clonality testing confirmed the de novo development of a LBCL and the persistence of a TZL.ConclusionsThe occurrence of two distinct lymphoid neoplasms should be considered if patient features and tumor cytomorphology or immunophenotype differ among sequential samples. Sequencing-based clonality testing may provide conclusive evidence of two concurrent and distinct clonal lymphocyte populations, termed most appropriately "composite lymphoma"

eScholarship - University of California

Joint Modeling and Registration of Cell Populations in Cohorts of High-Dimensional Flow Cytometric Data

Author: Duong Tarn
Hafler David
Irish Jonathan
Lee Sharon
Levy Ronald
McLachlan Geoffrey J.
Mesirov Jill
Nazaire Marc-Danie
Ng Shu-Kay
Nolan Garry
Pyne Saumyadipta
Tamayo Pablo
Wang Kui
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 31/05/2013
Field of study

In systems biomedicine, an experimenter encounters different potential sources of variation in data such as individual samples, multiple experimental conditions, and multi-variable network-level responses. In multiparametric cytometry, which is often used for analyzing patient samples, such issues are critical. While computational methods can identify cell populations in individual samples, without the ability to automatically match them across samples, it is difficult to compare and characterize the populations in typical experiments, such as those responding to various stimulations or distinctive of particular patients or time-points, especially when there are many samples. Joint Clustering and Matching (JCM) is a multi-level framework for simultaneous modeling and registration of populations across a cohort. JCM models every population with a robust multivariate probability distribution. Simultaneously, JCM fits a random-effects model to construct an overall batch template -- used for registering populations across samples, and classifying new samples. By tackling systems-level variation, JCM supports practical biomedical applications involving large cohorts

arXiv.org e-Print Archive

Adelaide Research & Scholarship

Directory of Open Access Journals

PubMed Central

University of Queensland eSpace

FigShare

Recommended from our members

Long non-coding RNA profiling of human lymphoid progenitor cells reveals transcriptional divergence of B cell and T cell lineages.

Author: Casero David
Crooks Gay M
Ha Vi Luan
Luong Annie
Parekh Chintan
Sandoval Salemiz
Scholes Jessica
Seet Christopher S
Zhu Yuhua
Publication venue: eScholarship, University of California
Publication date: 01/12/2015
Field of study

To elucidate the transcriptional 'landscape' that regulates human lymphoid commitment during postnatal life, we used RNA sequencing to assemble the long non-coding transcriptome across human bone marrow and thymic progenitor cells spanning the earliest stages of B lymphoid and T lymphoid specification. Over 3,000 genes encoding previously unknown long non-coding RNAs (lncRNAs) were revealed through the analysis of these rare populations. Lymphoid commitment was characterized by lncRNA expression patterns that were highly stage specific and were more lineage specific than those of protein-coding genes. Protein-coding genes co-expressed with neighboring lncRNA genes showed enrichment for ontologies related to lymphoid differentiation. The exquisite cell-type specificity of global lncRNA expression patterns independently revealed new developmental relationships among the earliest progenitor cells in the human bone marrow and thymus

eScholarship - University of California

From Cellular Characteristics to Disease Diagnosis: Uncovering Phenotypes with Supercells

Author: Banavar Jayanth R.
Biancotto Angélique
Candia Julian Marcelo
Cao Kan
Dagur Pradeep
Driscoll Meghan
Losert Wolfgang
Maritan Amos
Maunu Ryan
McCoy Jr. J Philip
Nida Sen H.
Nussenblatt Robert B
Wei Lai
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Cell heterogeneity and the inherent complexity due to the interplay of multiple molecular processes within the cell pose difficult challenges for current single-cell biology. We introduce an approach that identifies a disease phenotype from multiparameter single-cell measurements, which is based on the concept of ‘‘supercell statistics’’, a single-cell-based averaging procedure followed by a machine learning classification scheme. We are able to assess the optimal tradeoff between the number of single cells averaged and the number of measurements needed to capture phenotypic differences between healthy and diseased patients, as well as between different diseases that are difficult to diagnose otherwise. We apply our approach to two kinds of single-cell datasets, addressing the diagnosis of a premature aging disorder using images of cell nuclei, as well as the phenotypes of two non-infectious uveitides (the ocular manifestations of Behc¸et’s disease and sarcoidosis) based on multicolor flow cytometry. In the former case, one nuclear shape measurement taken over a group of 30 cells is sufficient to classify samples as healthy or diseased, in agreement with usual laboratory practice. In the latter, our method is able to identify a minimal set of 5 markers that accurately predict Behc¸et’s disease and sarcoidosis. This is the first time that a quantitative phenotypic distinction between these two diseases has been achieved. To obtain this clear phenotypic signature, about one hundred CD8+ T cells need to be measured. Although the molecular markers identified have been reported to be important players in autoimmune disorders, this is the first report pointing out that CD8+ T cells can be used to distinguish two systemic inflammatory diseases. Beyond these specific cases, the approach proposed here is applicable to datasets generated by other kinds of state-of-the-art and forthcoming single-cell technologies, such as multidimensional mass cytometry, single-cell gene expression, and single-cell full genome sequencing techniques.Fil: Candia, Julian Marcelo. University of Maryland; Estados Unidos. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Física de Líquidos y Sistemas Biológicos. Universidad Nacional de La Plata. Facultad de Ciencias Exactas. Instituto de Física de Líquidos y Sistemas Biológicos; ArgentinaFil: Maunu, Ryan. University of Maryland; Estados UnidosFil: Driscoll, Meghan. University of Maryland; Estados UnidosFil: Biancotto, Angélique. National Institutes of Health; Estados UnidosFil: Dagur, Pradeep. National Institutes of Health; Estados UnidosFil: McCoy Jr., J Philip. National Institutes of Health; Estados UnidosFil: Nida Sen, H.. National Institutes of Health; Estados UnidosFil: Wei, Lai. National Institutes of Health; Estados UnidosFil: Maritan, Amos. Università di Padova; ItaliaFil: Cao, Kan. University of Maryland; Estados UnidosFil: Nussenblatt, Robert B. National Institutes of Health; Estados UnidosFil: Banavar, Jayanth R.. University of Maryland; Estados UnidosFil: Losert, Wolfgang. University of Maryland; Estados Unido

arXiv.org e-Print Archive

CONICET Digital

Directory of Open Access Journals

Dryad Digital Repository (Duke University)

PubMed Central

Electronic Archiving System

FigShare

Mammary molecular portraits reveal lineage-specific features and progenitor cell vulnerabilities.

Author: Abe
Akalin
Alison E. Casey
Ankit Sinha
Asselin-Labat
Buenrostro
Cardiff
Cerami
Cheryl Arrowsmith
Chlebowski
Cox
Cox
Dalia Barsyte-Lovejoy
Daniel De Carvalho
Deugnier
Dos Santos
Edgar
Eirew
Eirew
Eisen
Erik Drysdale
Gary Bader
Gascard
Genevieve Deblois
Gu
Hal Berman
Heinz
Hennighausen
Herschkowitz
Hu
Hui Fang
Huston
Hyeyeon Kim
Ignatchenko
Jackson
Jennifer Cruickshank
Joshi
Joshi
Joshi
Julie Livingstone
Kaltenborn
Kauff
Kelsey
Kendrick
Kiechl
Kislinger
Koboldt
Kotsopoulos
Krueger
Kucera
Labarge
Li
Lim
Lim
Lin
Loenen
Lucas
Lydon
Marotti
Maruyama
Mathieu Lupien
McLean
Meissner
Merico
Michailidou
Michalak
Mohammed
Molyneux
Mona Shehata
Nguyen
Pal
Pathania
Paul C. Boutros
Paul Waterhouse
Pei
Pellacani
Pirashaanthy Tharmapalan
Rajat Singhania
Rama Khokha
Reimand
Rios
Rios
Rugg-Gunn
Ruth Isserlin
Schimanski
Shackleton
Shannon
Shehata
Shiah
Shu
Sigl
Smith
Smyth
Stefan Hofer
Stefan Knapp
Stingl
Storey
Stunnenberg
Subramanian
Swneke Bailey
Thomas Kislinger
Tiago Medina
Tomasetti
van Amerongen
Van Keymeulen
Van Keymeulen
Visvader
Wang
Wojtowicz
Wuidart
Yu-Jia Shiah
Zhang
Publication venue: eScholarship, University of California
Publication date: 01/08/2018
Field of study

The mammary epithelium depends on specific lineages and their stem and progenitor function to accommodate hormone-triggered physiological demands in the adult female. Perturbations of these lineages underpin breast cancer risk, yet our understanding of normal mammary cell composition is incomplete. Here, we build a multimodal resource for the adult gland through comprehensive profiling of primary cell epigenomes, transcriptomes, and proteomes. We define systems-level relationships between chromatin-DNA-RNA-protein states, identify lineage-specific DNA methylation of transcription factor binding sites, and pinpoint proteins underlying progesterone responsiveness. Comparative proteomics of estrogen and progesterone receptor-positive and -negative cell populations, extensive target validation, and drug testing lead to discovery of stem and progenitor cell vulnerabilities. Top epigenetic drugs exert cytostatic effects; prevent adult mammary cell expansion, clonogenicity, and mammopoiesis; and deplete stem cell frequency. Select drugs also abrogate human breast progenitor cell activity in normal and high-risk patient samples. This integrative computational and functional study provides fundamental insight into mammary lineage and stem cell biology

Crossref

eScholarship - University of California