Search CORE

GeneTrailExpress: a web-based pipeline for the statistical evaluation of microarray experiments

Author: A Keller
A Subramanian
Andreas Gerasch
Andreas Keller
B Zhang
BJ Breitkreutz
C Backes
C Liu
Christina Backes
D Nam
F Al-Shahrour
F Al-Shahrour
H Lee
Hans-Peter Lenhof
J Herrero
J Kuentzer
J Kuentzer
J Morris
Jan Küntzer
K Hokamp
M Ashburner
M Kanehisa
M Krull
M Pelizzola
M Sirava
Maher Al-Awadhi
Michael Kaufmann
Oliver Kohlbacher
P Shannon
R Edgar
R Vicentini
S Maere
T Beissbarth
X Wang
Z Hu
Publication venue: BioMed Central
Publication date: 22/12/2008
Field of study

Computation of significance scores of unweighted Gene Set Enrichment Analyses

Author: A Subramanian
A Zanzoni
Andreas Keller
C Backes
C Backes
Christina Backes
E Rubin
H Hermjakob
H Lee
Hans-Peter Lenhof
J Küntzer
J Lamb
L Salwinski
M Kanehisa
M Krull
S Kim
S Peri
S Wachi
T Barrett
TGO Consortium
V Matys
V Mootha
Y Benjamini
Y Hochberg
Z Jiang
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Gene Set Enrichment Analysis (GSEA) is a computational method for the statistical evaluation of sorted lists of genes or proteins. Originally GSEA was developed for interpreting microarray gene expression data, but it can be applied to any sorted list of genes. Given the gene list and an arbitrary biological category, GSEA evaluates whether the genes of the considered category are randomly distributed or accumulated on top or bottom of the list. Usually, significance scores (p-values) of GSEA are computed by nonparametric permutation tests, a time consuming procedure that yields only estimates of the p-values. Results We present a novel dynamic programming algorithm for calculating exact significance values of unweighted Gene Set Enrichment Analyses. Our algorithm avoids typical problems of nonparametric permutation tests, as varying findings in different runs caused by the random sampling procedure. Another advantage of the presented dynamic programming algorithm is its runtime and memory efficiency. To test our algorithm, we applied it not only to simulated data sets, but additionally evaluated expression profiles of squamous cell lung cancer tissue and autologous unaffected tissue.</p

GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists

Author: A Subramanian
B Zeeberg
B Zhang
C Backes
Doron Lipson
E Eden
E Gansner
EI Boyle
Eran Eden
F Al-Shahrour
F Al-Shahrour
GD Jr
Israel Steinfeld
JJJ Goeman
LJ van't Veer
M Ashburner
P Khatri
Q Xu
QWX Zheng
R Breitling
R Sealfon
Roy Navon
S Maere
TST Beissbarth
Zohar Yakhini
Publication venue: BioMed Central
Publication date: 01/02/2009
Field of study

Abstract Background Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results. Results <it>GOrilla </it>is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression). <it>GOrilla </it>employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the <it>top </it>of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, <it>GOrilla </it>computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms. Conclusion <it>GOrilla </it>is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. <it>GOrilla</it>'s unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. <it>GOrilla </it>is publicly available at: <url>http://cbl-gorilla.cs.technion.ac.il</url></p

miRTargetLink—miRNAs, Genes and Interaction Networks

Author: Backes Christina
Fehlmann Tobias
Hamberg Maarten
Hart Martin
Keller Andreas
Meder Benjamin
Meese Eckart
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 09/11/2018
Field of study

Information on miRNA targeting genes is growing rapidly. For high-throughput experiments, but also for targeted analyses of few genes or miRNAs, easy analysis with concise representation of results facilitates the work of life scientists. We developed miRTargetLink, a tool for automating respective analysis procedures that are frequently applied. Input of the web-based solution is either a single gene or single miRNA, but also sets of genes or miRNAs, can be entered. Validated and predicted targets are extracted from databases and an interaction network is presented. Users can select whether predicted targets, experimentally validated targets with strong or weak evidence, or combinations of those are considered. Central genes or miRNAs are highlighted and users can navigate through the network interactively. To discover the most relevant biochemical processes influenced by the target network, gene set analysis and miRNA set analysis are integrated. As a showcase for miRTargetLink, we analyze targets of five cardiac miRNAs. miRTargetLink is freely available without restrictions at www.ccb.uni-saarland.de/mirtargetlink

Universaar

Acronym

BNDB – The Biochemical Network Database

Author: Backes Christina
Blum Torsten
Gerasch Andreas
Kaufmann Michael
Kohlbacher Oliver
Küntzer Jan
Lenhof Hans-Peter
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Technological advances in high-throughput techniques and efficient data acquisition methods have resulted in a massive amount of life science data. The data is stored in numerous databases that have been established over the last decades and are essential resources for scientists nowadays. However, the diversity of the databases and the underlying data models make it difficult to combine this information for solving complex problems in systems biology. Currently, researchers typically have to browse several, often highly focused, databases to obtain the required information. Hence, there is a pressing need for more efficient systems for integrating, analyzing, and interpreting these data. The standardization and virtual consolidation of the databases is a major challenge resulting in a unified access to a variety of data sources. Description We present the Biochemical Network Database (BNDB), a powerful relational database platform, allowing a complete semantic integration of an extensive collection of external databases. BNDB is built upon a comprehensive and extensible object model called BioCore, which is powerful enough to model most known biochemical processes and at the same time easily extensible to be adapted to new biological concepts. Besides a web interface for the search and curation of the data, a Java-based viewer (BiNA) provides a powerful platform-independent visualization and navigation of the data. BiNA uses sophisticated graph layout algorithms for an interactive visualization and navigation of BNDB. Conclusion BNDB allows a simple, unified access to a variety of external data sources. Its tight integration with the biochemical network library BN++ offers the possibility for import, integration, analysis, and visualization of the data. BNDB is freely accessible at <url>http://www.bndb.org</url>.</p

GeneTrail 3: advanced high-throughput enrichment analysis

Author: Diener Caroline
Eckhart Lea
Gerstner Nico
Grammes Nadja Liddy
Hahn Oliver
Hart Martin
Kehl Tim
Keller Andreas
Lenhof Hans-Peter
Lenhof Kerstin
Mayer Carolin
Meese Eckart
Müller Anne
Walter Jörn
Wyss-Coray Tony
Publication venue: Saarländische Universitäts- und Landesbibliothek
Publication date: 01/01/2020
Field of study

We present GeneTrail 3, a major extension of our web service GeneTrail that offers rich functionality for the identification, analysis, and visualization of deregulated biological processes. Our web service provides a comprehensive collection of biological processes and signaling pathways for 12 model organisms that can be analyzed with a powerful framework for enrichment and network analysis of transcriptomic, miRNomic, proteomic, and genomic data sets. Moreover, GeneTrail offers novel workflows for the analysis of epigenetic marks, time series experiments, and single cell data. We demonstrate the capabilities of our web service in two case-studies, which highlight that GeneTrail is well equipped for uncovering complex molecular mechanisms. GeneTrail is freely accessible at: http://genetrail.bioinf.uni-sb.de

Universaar

Acronym

KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases

Author: Al-Shahrour
Alibes
Ashburner
Backes
Bauer
Becker
Benjamini
Benjamini
Berriz
Blalock
Carmona-Saez
Chen Xie
Chuan-Yun Li
Chung
Croft
Du
Ge Gao
Gentleman
Haider
Henegar
Hindorff
Hosack
Huang
Huang da
Huang da
Jiaju Huang
Jianmin Wu
Kanehisa
Kanehisa
Karp
Kersey
Lei Kong
Liping Wei
Maere
Mao
Masseroli
Matthews
Nogales-Cadenas
Osborne
Prifti
Reimand
Salomonis
Schaefer
Shan Dong
Shi
Sridhar
Storey
Thomas
Usadel
Wu
Xizeng Mao
Yang Ding
Zhang
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

High-throughput experimental technologies often identify dozens to hundreds of genes related to, or changed in, a biological or pathological process. From these genes one wants to identify biological pathways that may be involved and diseases that may be implicated. Here, we report a web server, KOBAS 2.0, which annotates an input set of genes with putative pathways and disease relationships based on mapping to genes with known annotations. It allows for both ID mapping and cross-species sequence similarity mapping. It then performs statistical tests to identify statistically significantly enriched pathways and diseases. KOBAS 2.0 incorporates knowledge across 1327 species from 5 pathway databases (KEGG PATHWAY, PID, BioCyc, Reactome and Panther) and 5 human disease databases (OMIM, KEGG DISEASE, FunDO, GAD and NHGRI GWAS Catalog). KOBAS 2.0 can be accessed at http://kobas.cbi.pku.edu.cn

Proceedings - University of Groningen

FUNAGE-Pro:comprehensive web server for gene set enrichment analysis of prokaryotes

Author: de Jong Anne
Kok Jan
Kuipers Oscar P
Publication venue: 'Oxford University Press (OUP)'
Publication date: 31/05/2022
Field of study

Recent advances in the field of high throughput (meta-)transcriptomics and proteomics call for easy and rapid methods enabling to explore not only single genes or proteins but also extended biological systems. Gene set enrichment analysis is commonly used to find relations in a set of genes and helps to uncover the biological meaning in results derived from high-throughput data. The basis for gene set enrichment analysis is a solid functional classification of genes. Here, we describe a comprehensive database containing multiple functional classifications of genes of all (>55 000) publicly available complete bacterial genomes. In addition to the most common functional classes such as COG and GO, also KEGG, InterPro, PFAM, eggnog and operon classes are supported. As classification data for features is often not available, we offer fast annotation and classification of proteins in any newly sequenced bacterial genome. The web server FUNAGE-Pro enables fast functional analysis on single gene sets, multiple experiments, time series data, clusters, and gene network modules for any prokaryote species or strain. FUNAGE-Pro is freely available at http://funagepro.molgenrug.nl

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Novel autoantigens immunogenic in COPD patients

Author: A Agusti
A Davidson
C Bauer
CA Feghali-Bostwick
CJ Murray
D Mori
DI Jeoung
H Shiels
IK Demedts
J Greiner
JC Hogg
JG Dohlman
K Bussow
K Russo
KA Moore
L Casciola-Rosen
L Taraseviciene-Stewart
LM Ayer
LM Ayer
MZ Atassi
N Comtesse
N Comtesse
RJ Halbert
S Iwashita
SH Lee
TJ MacDonald
W MacNee
X Wang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Chronic obstructive pulmonary disease (COPD) is a respiratory inflammatory condition with autoimmune features including IgG autoantibodies. In this study we analyze the complexity of the autoantibody response and reveal the nature of the antigens that are recognized by autoantibodies in COPD patients. Methods An array of 1827 gridded immunogenic peptide clones was established and screened with 17 sera of COPD patients and 60 healthy controls. Protein arrays were evaluated both by visual inspection and a recently developed computer aided image analysis technique. By this computer aided image analysis technique we computed the intensity values for each peptide clone and each serum and calculated the area under the receiver operator characteristics curve (AUC) for each clone and the separation COPD sera versus control sera. Results By visual evaluation we detected 381 peptide clones that reacted with autoantibodies of COPD patients including 17 clones that reacted with more than 60% of the COPD sera and seven clones that reacted with more than 90% of the COPD sera. The comparison of COPD sera and controls by the automated image analysis system identified 212 peptide clones with informative AUC values. By <it>in silico </it>sequence analysis we found an enrichment of sequence motives previously associated with immunogenicity. Conclusion The identification of a rather complex humoral immune response in COPD patients supports the idea of COPD as a disease with strong autoimmune features. The identification of novel immunogenic antigens is a first step towards a better understanding of the autoimmune component of COPD.</p