Search CORE

31,108 research outputs found

Recommended from our members

Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data

Author: All authors are with the Berkeley Drosophila Transcription Network Project Lawrence Berkeley National Laboratory,
Bethel E. Wes
Biggin Mark D.
Computational Research Division Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
Computer Science Department University of California, Irvine, CA, USA,
Computer Science Division University of California, Berkeley, CA, USA,
Data Analysis and Visualization (IDAV) and the Department of Computer Science University of California, Davis, One Shields Avenue, Davis CA 95616, USA,
Eisen Michael B.
Fowlkes Charless C.
Genomics Division Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA
Hagen Hans
Hamann Bernd
Hendriks Cris L. Luengo
Huang Min-Yu
Keranen Soile V. E.
Knowles David W.
Life Sciences Division Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA,
Malik Jitendra
nternational Research Training Group ``Visualization of Large and Unstructured Data Sets '' University of Kaiserslautern, Germany
Rubel Oliver
Weber Gunther H.
Publication venue: Lawrence Berkeley National Laboratory
Publication date: 12/05/2008
Field of study

The recent development of methods for extracting precise measurements of spatial gene expression patterns from three-dimensional (3D) image data opens the way for new analyses of the complex gene regulatory networks controlling animal development. We present an integrated visualization and analysis framework that supports user-guided data clustering to aid exploration of these new complex datasets. The interplay of data visualization and clustering-based data classification leads to improved visualization and enables a more detailed analysis than previously possible. We discuss (i) integration of data clustering and visualization into one framework; (ii) application of data clustering to 3D gene expression data; (iii) evaluation of the number of clusters k in the context of 3D gene expression clustering; and (iv) improvement of overall analysis quality via dedicated post-processing of clustering results based on visualization. We discuss the use of this framework to objectively define spatial pattern boundaries and temporal profiles of genes and to analyze how mRNA patterns are controlled by their regulatory transcription factors

UNT Digital Library

A visual analytics framework for cluster analysis of DNA microarray data

Author: Castellanos-Garzón José A.
Díaz Fernando
García Carlos Armando
Novais Paulo
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

Prova tipográficaCluster analysis of DNA microarray data is an important but difficult task in knowledge discovery processes. Many clustering methods are applied to analysis of data for gene expression, but none of them is able to deal with an absolute way with the challenges that this technology raises. Due to this, many applications have been developed for visually representing clustering algorithm results on DNA microarray data, usually providing dendrogram and heat map visualizations. Most of these applications focus only on the above visualizations, and do not offer further visualization components to the validate the clustering methods or to validate one another. This paper proposes using a visual analytics framework in cluster analysis of gene expression data. Additionally, it presents a new method for finding cluster boundaries based on properties of metric spaces. Our approach presents a set of visualization components able to interact with each other; namely, parallel coordinates, cluster boundary genes, 3D cluster surfaces and DNA microarray visualizations as heat maps. Experimental results have shown that our framework can be very useful in the process of more fully understanding DNA microarray data. The software has been implemented in Java, and the framework is publicly available at http://www. analiticavisual.com/jcastellanos/3DVisualCluster/3D-VisualCluster.This work has been partially funded by the Spanish Ministry of Science and Innovation, the Plan E from the Spanish Government, the European Union from the ERDF (TIN2009-14057-C03-02)

Universidade do Minho: RepositoriUM

Crossref

Proteran : animated terrain evolution for visual analysis of protein folding trajectory

Author: Kapila Kush
Publication venue
Publication date: 01/01/2004
Field of study

In the field of bio-informatics, the analysis of voluminous data is becoming increasingly crucial to understanding the underlying biology and answering important questions. Various clustering techniques such as Hierarchical, SOMs, K-means and PCA are being used to cluster gene expression data to find the functions of unknown genes. Even these sophisticated algorithms are futile if the results are not appropriately interpreted, thus visualization techniques play an important role in analyzing data. Similar to clustering of gene expression data is that of clustering the characteristics of protein folding trajectory and a good visualization tool can help visual analysis and can provide faster and deeper insights into the manner in which a protein folds. With protein characteristics data and specific visualization requirements provided by Dr. Laxmi Parida and Dr. Ruhong Zhou of the Computation Biology Group at the IBM T. J. Watson Research Center, a new 3D visualization technique was designed and developed. This customized technique helps identify the major states a protein folds into through the use of an animated terrain. This technique was implemented as part of the interactive visualization program PROTERAN and tested with the β-Hairpin clustered data provided

Concordia University Research Repository

Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data

Author: B. Hamann
C.C. Fowlkes
C.L. Luengo Hendriks
D.W. Knowles
E.W. Bethel
G.H. Weber
H. Hagen
J. Malik
M.B. Eisen
M.D. Biggin
Min-Yu Huang
O. Rubel
S.V.E. Keranen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

chroGPS, a global chromatin positioning system for the functional analysis and visualization of the epigenome

Author: Boulesteix
Celniker
David Rossell
Dunham
Ernst
Ernst
Ernst
Fernando Azorín
Filion
Gan
Gentleman
Gerstein
Joan Font-Burgada
Karnik
Kharchenko
Lange
Lee
Lin
Liu
Lloret-Llinares
Lloret-Llinares
Miclaus
Motsinger
Musselman
Oscar Reina
Pedersen
Pérez-Lluch
Roudier
Roy
Smoot
Steiner
Torgerson
van Bemmel
van de Wiel
Publication venue: 'Oxford University Press (OUP)'
Publication date: 23/11/2013
Field of study

Development of tools to jointly visualize the genome and the epigenome remains a challenge. chroGPS is a computational approach that addresses this question. chroGPS uses multidimensional scaling techniques to represent similarity between epigenetic factors, or between genetic elements on the basis of their epigenetic state, in 2D/3D reference maps. We emphasize biological interpretability, statistical robustness, integration of genetic and epigenetic data from heterogeneous sources, and computational feasibility. Although chroGPS is a general methodology to create reference maps and study the epigenetic state of any class of genetic element or genomic region, we focus on two specific kinds of maps: chroGPSfactors, which visualizes functional similarities between epigenetic factors, and chroGPSgenes, which describes the epigenetic state of genes and integrates gene expression and other functional data. We use data from the modENCODE project on the genomic distribution of a large collection of epigenetic factors in Drosophila, a model system extensively used to study genome organization and function. Our results show that the maps allow straightforward visualization of relationships between factors and elements, capturing relevant information about their functional properties that helps to interpret epigenetic information in a functional context and derive testable hypotheses

Crossref

PubMed Central

Warwick Research Archives Portal Repository

Digital.CSIC

UPF Digital Repository

Expression cartography of human tissues using self organizing maps

Author: Hans Binder
Henry Wirth
Publication venue
Publication date: 21/03/2011
Field of study

Background: The availability of parallel, high-throughput microarray and sequencing experiments poses a challenge how to best arrange and to analyze the obtained heap of multidimensional data in a concerted way. Self organizing maps (SOM), a machine learning method, enables the parallel sample- and gene-centered view on the data combined with strong visualization and second-level analysis capabilities. The paper addresses aspects of the method with practical impact in the context of expression analysis of complex data sets.
Results: The method was applied to generate a SOM characterizing the whole genome expression profiles of 67 healthy human tissues selected from ten tissue categories (adipose, endocrine, homeostasis, digestion, exocrine, epithelium, sexual reproduction, muscle, immune system and nervous tissues). SOM mapping reduces the dimension of expression data from ten thousands of genes to a few thousands of metagenes where each metagene acts as representative of a minicluster of co-regulated single genes. Tissue-specific and common properties shared between groups of tissues emerge as a handful of localized spots in the tissue maps collecting groups of co-regulated and co-expressed metagenes. The functional context of the spots was discovered using overrepresentation analysis with respect to pre-defined gene sets of known functional impact. We found that tissue related spots typically contain enriched populations of gene sets well corresponding to molecular processes in the respective tissues. Analysis techniques normally used at the gene-level such as two-way hierarchical clustering provide a better signal-to-noise ratio and a better representativeness of the method if applied to the metagenes. Metagene-based clustering analyses aggregate the tissues into essentially three clusters containing nervous, immune system and the remaining tissues. 
Conclusions: The global view on the behavior of a few well-defined modules of correlated and differentially expressed genes is more intuitive and more informative than the separate discovery of the expression levels of hundreds or thousands of individual genes. The metagene approach is less sensitive to a priori selection of genes. It can detect a coordinated expression pattern whose components would not pass single-gene significance thresholds and it is able to extract context-dependent patterns of gene expression in complex data sets.&#xa

Nature Precedings