Search CORE

6 research outputs found

Cyclebase.org—a comprehensive multi-organism online database of cell-cycle experiments

Author: Ahdesm ki
Andersson
Chen
Cho
de Lichtenberg
Hertz-Fowler
Jensen
Johansson
L. J. Jensen
Langmead
Liew
Lu
Lu
Lu
Luan
M. E. Larsen
Marguerat
Menges
N. P. Gauthier
Oliva
Pramila
R. Wernersson
Rhee
Rustici
S. Brunak
Spellman
T. S. Jensen
U. de Lichtenberg
Wichert
Zhao
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

The past decade has seen the publication of a large number of cell-cycle microarray studies and many more are in the pipeline. However, data from these experiments are not easy to access, combine and evaluate. We have developed a centralized database with an easy-to-use interface, Cyclebase.org, for viewing and downloading these data. The user interface facilitates searches for genes of interest as well as downloads of genome-wide results. Individual genes are displayed with graphs of expression profiles throughout the cell cycle from all available experiments. These expression profiles are normalized to a common timescale to enable inspection of the combined experimental evidence. Furthermore, state-of-the-art computational analyses provide key information on both individual experiments and combined datasets such as whether or not a gene is periodically expressed and, if so, the time of peak expression. Cyclebase is available at http://www.cyclebase.org

CiteSeerX

Crossref

PubMed Central

Online Research Database In Technology

Large scale comparison of global gene expression patterns in human and mouse

Author: Brazma Alvis
Parkinson Helen
Rung Johan
Zheng-Bradley Xiangqun
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

PubMed Central

A novel method for cross-species gene expression analysis

Author: A Alexa
A Arukwe
A Campain
A Ramasamy
A Rasche
A Sjögren
A Studer R
A Sweet-Cordero
AC Berglund
AP Davis
B Vallant
BM Bolstad
CE Purdom
CJ Martyniuka
CY Ung
D Chen
D G Joakim Larsson
DB Allison
DB Berry
DGJ Larsson
DM Kristensen
DS Bhoj
E Kristiansson
E Kristiansson
E Segal
E Thomas-Jones
EJ Routledge
Erik Kristiansson
EV Forbes
EW Sayers
F Chen
F Chen
F Geoghegan
F Pan
FZ Marques
Gabriella Arne
GC Tseng
GK Smythe
H Parkinson
HS Le
I Ginis
IJ Good
J Kilian
JA Miller
JC Kwekel
JC Newman
JG Sorensen
JM Laramie
JM Raser
JM Stuart
JP de Magalhaes
JP Sumpter
K Richter
L Gunnarsson
L Gunnarsson
L Huminiecki
L Klebanov
L Li
LA Henríquez-Hernández
Lina Gunnarsson
M Consortium
M de Wit
M Lynch
M Sárvári
ME Feder
NJB Isaac
O Carnevali
Olle Nerman
P Cahan
P Hu
R Grützmann
R Xu
RA Fisher
S Jobling
S Ohno
SC Tilton
T Barrett
TD Williams
Tobias Österlund
U Ala
W Hu
WP Kuo
X Chen
Y Lu
Y Lu
Y Lu
Y Taniguchi
YH Yang
Z Gu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Identifying Cycling Genes by Combining Sequence Homology and Expression Data

Author: Roni Rosenfeld (5360024)
Yong Lu (5411423)
Ziv Bar-Joseph (3896062)
Publication venue
Publication date: 01/01/2006
Field of study

Motivation: The expression of genes during the cell division process has now been studied in many different species. An important goal of these studies is to identify the set of cycling genes. To date, this was done independently for each of the species studied. Due to noise and other data analysis problems, accurately deriving a set of cycling genes from expression data is a hard problem. This is especially true for some of the multicellular organisms, including humans. Results: Here we present the first algorithm that combines microarray expression data from multiple species for identifying cycling genes. Our algorithm represents genes from multiple species as nodes in a graph. Edges between genes represent sequence similarity. Starting with the measured expression values for each species we use Belief Propagation to determine a posterior score for genes. This posterior isusedtodetermineanewset ofcyclinggenes foreachspecies. We applied our algorithm to improve the identification of the set of cell cycle genes in budding yeast and humans. As we show, by incorporating sequence similarity information we were able to obtain a more accurate set of genes compared to methods that rely on expression data alone. Our method was especially successful for the human dataset indicating that it can use a high quality dataset from one species to overcome noise problems in another. Availability: C implementation is available from the supportin

Text Mining and Gene Expression Analysis Towards Combined Interpretation of High Throughput Data

Author: Fundel Katrin
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 13/09/2007
Field of study

Microarrays can capture gene expression activity for thousands of genes simultaneously and thus make it possible to analyze cell physiology and disease processes on molecular level. The interpretation of microarray gene expression experiments profits from knowledge on the analyzed genes and proteins and the biochemical networks in which they play a role. The trend is towards the development of data analysis methods that integrate diverse data types. Currently, the most comprehensive biomedical knowledge source is a large repository of free text articles. Text mining makes it possible to automatically extract and use information from texts. This thesis addresses two key aspects, biomedical text mining and gene expression data analysis, with the focus on providing high-quality methods and data that contribute to the development of integrated analysis approaches. The work is structured in three parts. Each part begins by providing the relevant background, and each chapter describes the developed methods as well as applications and results. Part I deals with biomedical text mining: Chapter 2 summarizes the relevant background of text mining; it describes text mining fundamentals, important text mining tasks, applications and particularities of text mining in the biomedical domain, and evaluation issues. In Chapter 3, a method for generating high-quality gene and protein name dictionaries is described. The analysis of the generated dictionaries revealed important properties of individual nomenclatures and the used databases (Fundel and Zimmer, 2006). The dictionaries are publicly available via a Wiki, a web service, and several client applications (Szugat et al., 2005). In Chapter 4, methods for the dictionary-based recognition of gene and protein names in texts and their mapping onto unique database identifiers are described. These methods make it possible to extract information from texts and to integrate text-derived information with data from other sources. Three named entity identification systems have been set up, two of them building upon the previously existing tool ProMiner (Hanisch et al., 2003). All of them have shown very good performance in the BioCreAtIvE challenges (Fundel et al., 2005a; Hanisch et al., 2005; Fundel and Zimmer, 2007). In Chapter 5, a new method for relation extraction (Fundel et al., 2007) is presented. It was applied on the largest collection of biomedical literature abstracts, and thus a comprehensive network of human gene and protein relations has been generated. A classification approach (Küffner et al., 2006) can be used to specify relation types further; e. g., as activating, direct physical, or gene regulatory relation. Part II deals with gene expression data analysis: Gene expression data needs to be processed so that differentially expressed genes can be identified. Gene expression data processing consists of several sequential steps. Two important steps are normalization, which aims at removing systematic variances between measurements, and quantification of differential expression by p-value and fold change determination. Numerous methods exist for these tasks. Chapter 6 describes the relevant background of gene expression data analysis; it presents the biological and technical principles of microarrays and gives an overview of the most relevant data processing steps. Finally, it provides a short introduction to osteoarthritis, which is in the focus of the analyzed gene expression data sets. In Chapter 7, quality criteria for the selection of normalization methods are described, and a method for the identification of differentially expressed genes is proposed, which is appropriate for data with large intensity variances between spots representing the same gene (Fundel et al., 2005b). Furthermore, a system is described that selects an appropriate combination of feature selection method and classifier, and thus identifies genes which lead to good classification results and show consistent behavior in different sample subgroups (Davis et al., 2006). The analysis of several gene expression data sets dealing with osteoarthritis is described in Chapter 8. This chapter contains the biomedical analysis of relevant disease processes and distinct disease stages (Aigner et al., 2006a), and a comparison of various microarray platforms and osteoarthritis models. Part III deals with integrated approaches and thus provides the connection between parts I and II: Chapter 9 gives an overview of different types of integrated data analysis approaches, with a focus on approaches that integrate gene expression data with manually compiled data, large-scale networks, or text mining. In Chapter 10, a method for the identification of genes which are consistently regulated and have a coherent literature background (Küffner et al., 2005) is described. This method indicates how gene and protein name identification and gene expression data can be integrated to return clusters which contain genes that are relevant for the respective experiment together with literature information that supports interpretation. Finally, in Chapter 11 ideas on how the described methods can contribute to current research and possible future directions are presented

Digitale Hochschulschriften der LMU