Search CORE

82,612 research outputs found

Genome Expression Pathway Analysis Tool – Analysis and visualization of microarray gene expression data under genomic, proteomic and metabolic context

Author: A Rosenwald
AA Alizadeh
AI Saeed
B Mlecnik
B Zhang
BM Bolstad
C von Mering
F Al-Shahrour
Gene Ontology Consortium
GJ Dennis
GK Smyth
J Rainer
JM Vaquerizas
Julia C Engelmann
Jörg Schultz
M Kanehisa
M Kapushesky
M Kotera
M Masseroli
M Pelizzola
Markus Weniger
O Troyanskaya
P Khatri
P Lichter
P Shannon
R Gentleman
R Shamir
S Bea
SW Doniger
TJP Hubbard
W Huber
YH Yang
Publication venue: BioMed Central
Publication date: 01/06/2007
Field of study

Abstract Background Regulation of gene expression is relevant to many areas of biology and medicine, in the study of treatments, diseases, and developmental stages. Microarrays can be used to measure the expression level of thousands of mRNAs at the same time, allowing insight into or comparison of different cellular conditions. The data derived out of microarray experiments is highly dimensional and often noisy, and interpretation of the results can get intricate. Although programs for the statistical analysis of microarray data exist, most of them lack an integration of analysis results and biological interpretation. Results We have developed GEPAT, Genome Expression Pathway Analysis Tool, offering an analysis of gene expression data under genomic, proteomic and metabolic context. We provide an integration of statistical methods for data import and data analysis together with a biological interpretation for subsets of probes or single probes on the chip. GEPAT imports various types of oligonucleotide and cDNA array data formats. Different normalization methods can be applied to the data, afterwards data annotation is performed. After import, GEPAT offers various statistical data analysis methods, as hierarchical, k-means and PCA clustering, a linear model based t-test or chromosomal profile comparison. The results of the analysis can be interpreted by enrichment of biological terms, pathway analysis or interaction networks. Different biological databases are included, to give various information for each probe on the chip. GEPAT offers no linear work flow, but allows the usage of any subset of probes and samples as a start for a new data analysis. GEPAT relies on established data analysis packages, offers a modular approach for an easy extension, and can be run on a computer grid to allow a large number of users. It is freely available under the LGPL open source license for academic and commercial users at <url>http://gepat.sourceforge.net</url>. Conclusion GEPAT is a modular, scalable and professional-grade software integrating analysis and interpretation of microarray gene expression data. An installation available for academic users can be found at <url>http://gepat.bioapps.biozentrum.uni-wuerzburg.de</url>.</p

Crossref

University of Regensburg Publication Server

Directory of Open Access Journals

PubMed Central

TREEOME: A framework for epigenetic and transcriptomic data integration to explore regulatory interactions controlling transcription

Author: Budden David M
Crampin Edmund J
Hurley Daniel G
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Motivation: Predictive modelling of gene expression is a powerful framework for the in silico exploration of transcriptional regulatory interactions through the integration of high-throughput -omics data. A major limitation of previous approaches is their inability to handle conditional and synergistic interactions that emerge when collectively analysing genes subject to different regulatory mechanisms. This limitation reduces overall predictive power and thus the reliability of downstream biological inference. Results: We introduce an analytical modelling framework (TREEOME: tree of models of expression) that integrates epigenetic and transcriptomic data by separating genes into putative regulatory classes. Current predictive modelling approaches have found both DNA methylation and histone modification epigenetic data to provide little or no improvement in accuracy of prediction of transcript abundance despite, for example, distinct anti-correlation between mRNA levels and promoter-localised DNA methylation. To improve on this, in TREEOME we evaluate four possible methods of formulating gene-level DNA methylation metrics, which provide a foundation for identifying gene-level methylation events and subsequent differential analysis, whereas most previous techniques operate at the level of individual CpG dinucleotides. We demonstrate TREEOME by integrating gene-level DNA methylation (bisulfite-seq) and histone modification (ChIP-seq) data to accurately predict genome-wide mRNA transcript abundance (RNA-seq) for H1-hESC and GM12878 cell lines. Availability: TREEOME is implemented using open-source software and made available as a pre-configured bootable reference environment. All scripts and data presented in this study are available online at http://sourceforge.net/projects/budden2015treeome/.Comment: 14 pages, 6 figure

arXiv.org e-Print Archive

Springer - Publisher Connector

How to understand the cell by breaking it: network analysis of gene perturbation screens

Author: Markowetz Florian
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 26/11/2009
Field of study

Modern high-throughput gene perturbation screens are key technologies at the forefront of genetic research. Combined with rich phenotypic descriptors they enable researchers to observe detailed cellular reactions to experimental perturbations on a genome-wide scale. This review surveys the current state-of-the-art in analyzing perturbation screens from a network point of view. We describe approaches to make the step from the parts list to the wiring diagram by using phenotypes for network inference and integrating them with complementary data sources. The first part of the review describes methods to analyze one- or low-dimensional phenotypes like viability or reporter activity; the second part concentrates on high-dimensional phenotypes showing global changes in cell morphology, transcriptome or proteome.Comment: Review based on ISMB 2009 tutorial; after two rounds of revisio

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

chroGPS, a global chromatin positioning system for the functional analysis and visualization of the epigenome

Author: Boulesteix
Celniker
David Rossell
Dunham
Ernst
Ernst
Ernst
Fernando Azorín
Filion
Gan
Gentleman
Gerstein
Joan Font-Burgada
Karnik
Kharchenko
Lange
Lee
Lin
Liu
Lloret-Llinares
Lloret-Llinares
Miclaus
Motsinger
Musselman
Oscar Reina
Pedersen
Pérez-Lluch
Roudier
Roy
Smoot
Steiner
Torgerson
van Bemmel
van de Wiel
Publication venue: 'Oxford University Press (OUP)'
Publication date: 23/11/2013
Field of study

Development of tools to jointly visualize the genome and the epigenome remains a challenge. chroGPS is a computational approach that addresses this question. chroGPS uses multidimensional scaling techniques to represent similarity between epigenetic factors, or between genetic elements on the basis of their epigenetic state, in 2D/3D reference maps. We emphasize biological interpretability, statistical robustness, integration of genetic and epigenetic data from heterogeneous sources, and computational feasibility. Although chroGPS is a general methodology to create reference maps and study the epigenetic state of any class of genetic element or genomic region, we focus on two specific kinds of maps: chroGPSfactors, which visualizes functional similarities between epigenetic factors, and chroGPSgenes, which describes the epigenetic state of genes and integrates gene expression and other functional data. We use data from the modENCODE project on the genomic distribution of a large collection of epigenetic factors in Drosophila, a model system extensively used to study genome organization and function. Our results show that the maps allow straightforward visualization of relationships between factors and elements, capturing relevant information about their functional properties that helps to interpret epigenetic information in a functional context and derive testable hypotheses

Crossref

PubMed Central

Warwick Research Archives Portal Repository

Digital.CSIC

UPF Digital Repository

Integrating gene and protein expression data with genome-scale metabolic networks to infer functional pathways

Author: Beasley JE
Pey J
Planes FJ
Rubio A
Valgepea K
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

This article has been made available through the Brunel Open Access Publishing Fund. Copyright @ 2013 Pey et al.; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.Background: The study of cellular metabolism in the context of high-throughput -omics data has allowed us to decipher novel mechanisms of importance in biotechnology and health. To continue with this progress, it is essential to efficiently integrate experimental data into metabolic modeling. Results: We present here an in-silico framework to infer relevant metabolic pathways for a particular phenotype under study based on its gene/protein expression data. This framework is based on the Carbon Flux Path (CFP) approach, a mixed-integer linear program that expands classical path finding techniques by considering additional biophysical constraints. In particular, the objective function of the CFP approach is amended to account for gene/protein expression data and influence obtained paths. This approach is termed integrative Carbon Flux Path (iCFP). We show that gene/protein expression data also influences the stoichiometric balancing of CFPs, which provides a more accurate picture of active metabolic pathways. This is illustrated in both a theoretical and real scenario. Finally, we apply this approach to find novel pathways relevant in the regulation of acetate overflow metabolism in Escherichia coli. As a result, several targets which could be relevant for better understanding of the phenomenon leading to impaired acetate overflow are proposed. Conclusions: A novel mathematical framework that determines functional pathways based on gene/protein expression data is presented and validated. We show that our approach is able to provide new insights into complex biological scenarios such as acetate overflow in Escherichia coli.Basque Governmen

Crossref

Springer - Publisher Connector

PubMed Central

Brunel University Research Archive