Search CORE

631 research outputs found

Integrated Genomic Analysis Identifies Clinically Relevant Subtypes of Glioblastoma Characterized by Abnormalities in PDGFRA, IDH1, EGFR, and NF1

Author: Mesirov Jill P.
Publication venue: 'Elsevier BV'
Publication date: 01/09/2009
Field of study

The Cancer Genome Atlas Network recently cataloged recurrent genomic abnormalities in glioblastoma multiforme (GBM). We describe a robust gene expression-based molecular classification of GBM into Proneural, Neural, Classical, and Mesenchymal subtypes and integrate multidimensional genomic data to establish patterns of somatic mutations and DNA copy number. Aberrations and gene expression of EGFR, NF1, and PDGFRA/IDH1 each define the Classical, Mesenchymal, and Proneural subtypes, respectively. Gene signatures of normal brain cell types show a strong relationship between subtypes and different neural lineages. Additionally, response to aggressive therapy differs by subtype, with the greatest benefit in the Classical subtype and no benefit in the Proneural subtype. We provide a framework that unifies transcriptomic and genomic dimensions for GBM molecular stratification with important implications for future studies

DSpace@MIT

Continued fraction expansions of rational expressions with irreducible denominators in characteristic 2

Author: Mesirov Jill P.
Sweet Melvin M.
Publication venue: Published by Elsevier Inc.
Publication date: 31/10/1987
Field of study

AbstractGiven any irreducible polynomial q of degree n over the field with two elements, there is a sequence of polynomials pn, pn…1,…, p0 with pn = q, with p0 = 1, with the degree of pi equal to i, and with pi ≡ pi…2 (mod pi−1). In other words, given an irreducible q there is a p, relatively prime to q, with degree one less and such that the degrees of the remainders in Euclid's Algorithm for the greatest common divisor of p and q go down by exactly 1 at each step

Elsevier - Publisher Connector

Cytoscape: the network visualization tool for GenomeSpace workflows.

Author: Demchak Barry
Hull Tim
Ideker Trey
Liefeld Ted
Mesirov Jill P
Reich Michael
Smoot Michael
Publication venue: eScholarship, University of California
Publication date: 01/01/2014
Field of study

Modern genomic analysis often requires workflows incorporating multiple best-of-breed tools. GenomeSpace is a web-based visual workbench that combines a selection of these tools with mechanisms that create data flows between them. One such tool is Cytoscape 3, a popular application that enables analysis and visualization of graph-oriented genomic networks. As Cytoscape runs on the desktop, and not in a web browser, integrating it into GenomeSpace required special care in creating a seamless user experience and enabling appropriate data flows. In this paper, we present the design and operation of the Cytoscape GenomeSpace app, which accomplishes this integration, thereby providing critical analysis and visualization functionality for GenomeSpace users. It has been downloaded over 850 times since the release of its first version in September, 2013

PubMed Central

eScholarship - University of California

Sashimi plots: Quantitative visualization of RNA sequencing read alignments

Author: Airoldi Edoardo M.
Burge Christopher B.
Katz Yarden
Mesirov Jill P.
Schwartz Schraga
Silterra Jacob
Wang Eric T.
Wong Bang
Publication venue: 'Oxford University Press (OUP)'
Publication date: 14/06/2013
Field of study

We introduce Sashimi plots, a quantitative multi-sample visualization of mRNA sequencing reads aligned to gene annotations. Sashimi plots are made using alignments (stored in the SAM/BAM format) and gene model annotations (in GFF format), which can be custom-made by the user or obtained from databases such as Ensembl or UCSC. We describe two implementations of Sashimi plots: (1) a stand-alone command line implementation aimed at making customizable publication quality figures, and (2) an implementation built into the Integrated Genome Viewer (IGV) browser, which enables rapid and dynamic creation of Sashimi plots for any genomic region of interest, suitable for exploratory analysis of alternatively spliced regions of the transcriptome. Isoform expression estimates outputted by the MISO program can be optionally plotted along with Sashimi plots. Sashimi plots can be used to quickly screen differentially spliced exons along genomic regions of interest and can be used in publication quality figures. The Sashimi plot software and documentation is available from: http://genes.mit.edu/burgelab/miso/docs/sashimi.htmlComment: 2 figure

arXiv.org e-Print Archive

CiteSeerX

The Limitations of Simple Gene Set Enrichment Analysis Assuming Gene Independence

Author: Liberzon Arthur
Mesirov Jill P.
Steinhardt George
Tamayo Pablo
Publication venue: 'Elsevier BV'
Publication date: 30/04/2012
Field of study

Since its first publication in 2003, the Gene Set Enrichment Analysis (GSEA) method, based on the Kolmogorov-Smirnov statistic, has been heavily used, modified, and also questioned. Recently a simplified approach, using a one sample t test score to assess enrichment and ignoring gene-gene correlations was proposed by Irizarry et al. 2009 as a serious contender. The argument criticizes GSEA's nonparametric nature and its use of an empirical null distribution as unnecessary and hard to compute. We refute these claims by careful consideration of the assumptions of the simplified method and its results, including a comparison with GSEA's on a large benchmark set of 50 datasets. Our results provide strong empirical evidence that gene-gene correlations cannot be ignored due to the significant variance inflation they produced on the enrichment scores and should be taken into account when estimating gene set enrichment significance. In addition, we discuss the challenges that the complex correlation structure and multi-modality of gene sets pose more generally for gene set enrichment methods.Comment: Submitted to Statistical Methods in Medical Researc

arXiv.org e-Print Archive

Elsevier - Publisher Connector

ISMB 2008 Toronto

Author: Linial Michal
Mesirov Jill P.
Morrison McKay B. J.
Rost Burkhard
Publication venue: Public Library of Science
Publication date: 01/06/2008
Field of study

The International Society for Computational Biology (ISCB) presents the Sixteenth International Conference on Intelligent Systems for Molecular Biology (ISMB 2008), to be held in Toronto, Canada, July 19–23, 2008. Now in the final phases of scheduling selected presentations, demonstrations, and posters, the organizers are preparing what will likely be recognized as the premier conference on computational biology in 2008. ISMB 2008 (http://www.iscb.org/ismb2008/) will follow the road paved by the ISMB/ ECCB 2007 (http://www.iscb.org/ ismbeccb2007/) in Vienna in the attempt to specifically encourage increased participation from previously under-represented disciplines of computational biology. This conference will feature the best of the computer and life sciences through a variety of core sessions running in multiple parallel tracks, along with single-tracked Keynote Presentations, posters on display throughout the duration of the conference, and an extensive commercial exposition. The first day (July 18) of the meeting is reserved for two-day Special Interest Group (SIG) and Satellite meetings, the second day (July 19) runs SIGs for the first time in parallel with Tutorials and the Student Council Symposium, and for the first time two SIGs are running in parallel with the main ISMB meeting (July 20–23)Other Research Uni

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

Joint Modeling and Registration of Cell Populations in Cohorts of High-Dimensional Flow Cytometric Data

Author: Duong Tarn
Hafler David
Irish Jonathan
Lee Sharon
Levy Ronald
McLachlan Geoffrey J.
Mesirov Jill
Nazaire Marc-Danie
Ng Shu-Kay
Nolan Garry
Pyne Saumyadipta
Tamayo Pablo
Wang Kui
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 31/05/2013
Field of study

In systems biomedicine, an experimenter encounters different potential sources of variation in data such as individual samples, multiple experimental conditions, and multi-variable network-level responses. In multiparametric cytometry, which is often used for analyzing patient samples, such issues are critical. While computational methods can identify cell populations in individual samples, without the ability to automatically match them across samples, it is difficult to compare and characterize the populations in typical experiments, such as those responding to various stimulations or distinctive of particular patients or time-points, especially when there are many samples. Joint Clustering and Matching (JCM) is a multi-level framework for simultaneous modeling and registration of populations across a cohort. JCM models every population with a robust multivariate probability distribution. Simultaneously, JCM fits a random-effects model to construct an overall batch template -- used for registering populations across samples, and classifying new samples. By tackling systems-level variation, JCM supports practical biomedical applications involving large cohorts

arXiv.org e-Print Archive

Adelaide Research & Scholarship

Directory of Open Access Journals

PubMed Central

University of Queensland eSpace

FigShare

Interpreting Patterns of Gene Expression with Self-Organizing Maps: Methods and Application to Hematopoietic Differentiation

Author: Dmitrovsky Ethan
Kitareewan Sutisak
Mesirov Jill
Slonim Donna
Tamayo Pablo
Zhu Qing
Publication venue: Dartmouth Digital Commons
Publication date: 01/03/1999
Field of study

Array technologies have made it straightforward to monitor simultaneously the expression pattern of thousands of genes. The challenge now is to interpret such massive data sets. The first step is to extract the fundamental patterns of gene expression inherent in the data. This paper describes the application of self-organizing maps, a type of mathematical cluster analysis that is particularly well suited for recognizing and classifying features in complex, multidimensional data. The method has been implemented in a publicly available computer package, GENECLUSTER, that performs the analytical calculations and provides easy data visualization. To illustrate the value of such analysis, the approach is applied to hematopoietic differentiation in four well studied models (HL-60, U937, Jurkat, and NB4 cells). Expression patterns of some 6,000 human genes were assayed, and an online database was created. GENECLUSTER was used to organize the genes into biologically relevant clusters that suggest novel hypotheses about hematopoietic differentiation-for example, highlighting certain genes and pathways involved in differentiation therapy used in the treatment of acute promyelocytic leukemia

Dartmouth Digital Commons (Dartmouth College)

Subclass Mapping: Identifying Common Subtypes in Independent Disease Data Sets

Author: Brunet Jean-Philippe
Golub Todd R.
Hoshida Yujin
Mesirov Jill P.
Tamayo Pablo
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

Whole genome expression profiles are widely used to discover molecular subtypes of diseases. A remaining challenge is to identify the correspondence or commonality of subtypes found in multiple, independent data sets generated on various platforms. While model-based supervised learning is often used to make these connections, the models can be biased to the training data set and thus miss inherent, relevant substructure in the test data. Here we describe an unsupervised subclass mapping method (SubMap), which reveals common subtypes between independent data sets. The subtypes within a data set can be determined by unsupervised clustering or given by predetermined phenotypes before applying SubMap. We define a measure of correspondence for subtypes and evaluate its significance building on our previous work on gene set enrichment analysis. The strength of the SubMap method is that it does not impose the structure of one data set upon another, but rather uses a bi-directional approach to highlight the common substructures in both. We show how this method can reveal the correspondence between several cancer-related data sets. Notably, it identifies common subtypes of breast cancer associated with estrogen receptor status, and a subgroup of lymphoma patients who share similar survival patterns, thus improving the accuracy of a clinical outcome predictor

CiteSeerX

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

eScholarship - University of California