Search CORE

36 research outputs found

Linking the Epigenome to the Genome: Correlation of Different Features to DNA Methylation of CpG Islands

Author: A Barski
A Bird
A Henckel
A Jeltsch
A Meissner
A Siepel
AH Ting
Andreas Zell
AP Bird
B Rhead
BE Bernstein
BE Bernstein
Brock C. Christensen
C Bock
C Bock
C Bock
C Previti
C Wrzodek
CC Chang
CD Bustos
Clemens Wrzodek
D Jia
D Takai
D Zilberman
DE Schones
E Schilling
EJ Gardiner
ES Lander
F Antequera
F Antequera
F Eckhardt
F Fang
F Fuks
F Mohn
FA Feltus
Finja Büchel
Florian Mittag
GD Stormo
Georg Hinselmann
H Cedar
H Vikas
JF Costello
JG Cleary
Johannes Eichner
JT Bell
KL Thu
M Burset
M Esteller
M Esteller
M Gardiner-Garden
M Hall
M Oka
P Baldi
P Dehan
P Hajkova
PA Jones
R Das
R Fan
R Lister
RA Rollins
RM Brena
RM Brena
S Aerts
S Fan
S Kim
S Kochanek
SE Celniker
SKT Ooi
W Reik
WJ Kent
Y Wang
Y Zhang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

DNA methylation of CpG islands plays a crucial role in the regulation of gene expression. More than half of all human promoters contain CpG islands with a tissue-specific methylation pattern in differentiated cells. Still today, the whole process of how DNA methyltransferases determine which region should be methylated is not completely revealed. There are many hypotheses of which genomic features are correlated to the epigenome that have not yet been evaluated. Furthermore, many explorative approaches of measuring DNA methylation are limited to a subset of the genome and thus, cannot be employed, e.g., for genome-wide biomarker prediction methods. In this study, we evaluated the correlation of genetic, epigenetic and hypothesis-driven features to DNA methylation of CpG islands. To this end, various binary classifiers were trained and evaluated by cross-validation on a dataset comprising DNA methylation data for 190 CpG islands in HEPG2, HEK293, fibroblasts and leukocytes. We achieved an accuracy of up to 91% with an MCC of 0.8 using ten-fold cross-validation and ten repetitions. With these models, we extended the existing dataset to the whole genome and thus, predicted the methylation landscape for the given cell types. The method used for these predictions is also validated on another external whole-genome dataset. Our results reveal features correlated to DNA methylation and confirm or disprove various hypotheses of DNA methylation related features. This study confirms correlations between DNA methylation and histone modifications, DNA structure, DNA sequence, genomic attributes and CpG island properties. Furthermore, the method has been validated on a genome-wide dataset from the ENCODE consortium. The developed software, as well as the predicted datasets and a web-service to compare methylation states of CpG islands are available at http://www.cogsys.cs.uni-tuebingen.de/software/dna-methylation/

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Publikationsserver der Universität Tübingen

Predicting DNA-Binding Specificities of Eukaryotic Transcription Factors

Author: A Juncker
A Kel
A Moll
A Prakash
A Sandelin
A Sarai
Adrian Schröder
AM Leontovich
AM Waterhouse
Andreas Zell
BC Foat
BE Engelhardt
C Bock
C Wrzodek
Carsten Henneges
CJ Harrison
CJ Mungall
CM Bergman
CS Leslie
D Alamanova
D Wilson
D Zhou
DA Rodionov
DE Newburger
Dierk Wanke
DL Wheeler
E Boutet
E Kretschmann
E Wingender
G Badis
H Hegyi
H Li
H Saigo
H Saigo
HG Roider
J Kilian
J Kopp
J Supper
J Zhu
JA Gerlt
JC Bryne
JL Risler
Jochen Supper
Johannes Eichner
Jonas Eichner
JV Turatsinze
K Higo
K Liolios
K Niefind
K Pearson
L Liao
L Narlikar
L Wei
LJ Jensen
M Akerfelt
M Piipari
MA Andrade
MC Teixeira
MO Dayhoff
N Shental
P Baldi
P Bork
P Flicek
P Stegmaier
PH von Hippel
PK Mehta
PV Loo
R Bonneau
R Lüthy
RCG Holland
RV Davuluri
S Aerts
S Henikoff
S Kawashima
S Mahony
S Mahony
S Miyazawa
SB Needleman
SJ Maerkl
T Miyata
Tim J. Hubbard
TM Alleyne
U Gerland
UJ Pape
V Matys
V Matys
XD Liu
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Today, annotated amino acid sequences of more and more transcription factors (TFs) are readily available. Quantitative information about their DNA-binding specificities, however, are hard to obtain. Position frequency matrices (PFMs), the most widely used models to represent binding specificities, are experimentally characterized only for a small fraction of all TFs. Even for some of the most intensively studied eukaryotic organisms (i.e., human, rat and mouse), roughly one-sixth of all proteins with annotated DNA-binding domain have been characterized experimentally. Here, we present a new method based on support vector regression for predicting quantitative DNA-binding specificities of TFs in different eukaryotic species. This approach estimates a quantitative measure for the PFM similarity of two proteins, based on various features derived from their protein sequences. The method is trained and tested on a dataset containing 1 239 TFs with known DNA-binding specificity, and used to predict specific DNA target motifs for 645 TFs with high accuracy

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

SBML Level 3: an extensible format for the exchange and reuse of biological models

Author: Akira Funahashi
Alex Gutteridge
Ali Ebrahim
Alida Palmisano
Allyson L Lister
Andreas Dräger
Andrew Finney
Anna Zhukova
Aurélien Naldi
Axel Kamp
Bas Teusink
Bastian R Angermann
Benjamin D Heavner
Bernhard Palsson
Bin Hu
Brett G Olivier
Bruce E Shapiro
Camille Laibe
Carole J Proctor
Chris D Cox
Chris J Myers
Chris T Evelo
Christian Knüpfer
Christoph Flamm
Claudine Chaouiya
Clemens Wrzodek
Clifford A Shaffer
Colin S Gillespie
Conor Lawless
Dagmar Waltemath
Damon Hachmeister
Daniel Lucio
Daniel R Hyduke
Darren J Wilkinson
David Tolnay
Denis Thieffry
Devin P Sullivan
Dirk Drasdo
Douglas B Kell
Edda Klipp
Emanuel Gonçalves
Emek Demir
Eric Mjolsness
Faeder JR
Falk Schreiber
Falko Krause
Fedor Kolpakov
Fengkai Zhang
Finja Wrzodek
Florian Mittag
Frank T Bergmann
Gary D Bader
Hamid Bolouri
Harish Dharuri
Harold F Gómez
Henkel R
Henning Hermjakob
Henning Schmidt
Herbert M Sauro
Hidde Jong
Hiroaki Kitano
Hovakim Grabski
Huaiyu Mi
Hucka M
Hucka M
Ibrahim Vazirabad
Ilya Kiselev
Ioannis Xenarios
Ion I Moraru
James C Schaff
James R Faeder
Jan Červený
Jean‐Baptiste Pettit
Jeremy Zucker
Joerg Stelling
Johan Elf
Johann M Rohwer
Johannes Eichner
John C Doyle
John Wagner
Jonathan R Karr
Julien Dorier
Julio Saez‐Rodriguez
Kacser H
Karthik Raman
Kedar Nath Natarajan
Kieran Smallbone
Kimberly Begley
Koichi Takahashi
Leandro Watanabe
Leonard A Harris
Leslie M Loew
Liebermeister W
Lu Li
Lucian P Smith
Lukas Endler
Maciej J Swat
Malik‐Sheriff RS
Marco Antoniotti
Martin Golebiewski
Martin Meier‐Schellersheim
Martin Scharm
Martina Fröhlich
Martina Kutmon
Matthias König
Melanie I Stefan
Michael Hucka
Michael L Blinov
Michael Schubert
Mihai Glont
Mélanie Courtot
Naoki Tanimura
Neil Swainston
Nicholas A Allen
Nick Juty
Nicolas Rodriguez
Norsigian CJ
Oliver A Ruebenacker
Pedro Mendes
Pedro T Monteiro
Peter D Karp
Peters M
Piero Dalle Pezze
Poul MF Nielsen
Rahuman S Malik‐Sheriff
Rainer Machne
Richard R Adams
Robert D Phair
Roland Keller
Roman Schulte
Ron Henkel
Ronan MT Fleming
Sarah M Keating
Stefan Hoops
Steffen Klamt
Stuart C Sealfon
Stuart L Moodie
Sven Sahle
Sylvain Soliman
Thomas M Hamm
Thomas Pfau
Tomas Radivoyevitch
Tomáš Helikar
Tramy Nguyen
Ulrike Wittig
Watanabe LH
William S Denney
William S Hlavacek
Wolfram Liebermeister
Yuichiro Inagaki
Yukiko Matsuoka
Publication venue: 'EMBO'
Publication date: 01/08/2020
Field of study

Systems biology has experienced dramatic growth in the number, size, and complexity of computational models. To reproduce simulation results and reuse models, researchers must exchange unambiguous model descriptions. We review the latest edition of the Systems Biology Markup Language (SBML), a format designed for this purpose. A community of modelers and software authors developed SBML Level 3 over the past decade. Its modular form consists of a core suited to representing reaction-based models and packages that extend the core with features suited to other model types including constraint-based models, reaction-diffusion models, logical network models, and rule-based models. The format leverages two decades of SBML and a rich software ecosystem that transformed how systems biologists build and interact with models. More recently, the rise of multiscale models of whole cells and organs, and new data sources such as single-cell measurements and live imaging, has precipitated new ways of integrating data with models. We provide our perspectives on the challenges presented by these developments and how SBML Level 3 provides the foundation needed to support this evolution

VU Research Portal

Crossref

OIST Institutional Repository

INRIA a CCSD electronic archive server

KEGGViewer, a BioJS component to visualize KEGG Pathways

Author: C Wrzodek
C Wrzodek
H Mi
J Gómez
J Villaveces
M Kanehisa
S Kawashima
Z Hu
Publication venue: 'F1000 Research Ltd'
Publication date
Field of study

Crossref

CyKEGGParser: tailoring KEGG pathways to fit into systems biology analysis workflows

Author: A Arakelyan
C von Mering
C Wrzodek
M Kanehisa
T Packard
Z Isik
Publication venue: 'F1000 Research Ltd'
Publication date
Field of study

Crossref

Integrated enrichment analysis and pathway-centered visualization of metabolomics, proteomics, transcriptomics, and genomics data by using the InCroMAP software.

Author: Eichner J.
Häring H.-U.
Lehmann R.
Rosenbaum L.
Wrzodek C.
Zell A.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

In systems biology, the combination of multiple types of omics data, such as metabolomics, proteomics, transcriptomics, and genomics, yields more information on a biological process than the analysis of a single type of data. Thus, data from different omics platforms is usually combined in one experimental setup to obtain insight into a biological process or a disease state. Particularly high accuracy metabolomics data from modern mass spectrometry instruments is currently more and more integrated into biological studies. Reflecting this trend, we extended InCroMAP, a data integration, analysis and visualization tool for genomics, transcriptomics, and proteomics data. Now, the tool is able to perform an integrated enrichment analysis and pathway-based visualization of multi-omics data and thus, it is suitable for the evaluation of comprehensive systems biology studies

Publikationsserver der Universität Tübingen

PuSH

TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors

Author: Alexey Porollo
Andreas Dräger
Andreas Zell
C-C Chang
Clemens Wrzodek
Dierk Wanke
Florian Topf
Johannes Eichner
P Stegmaier
SF Altschul
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref

Pathway-based visualization of cross-platform microarray datasets

Author: A. Zell
Bartel
C. Wrzodek
Cline
Gehlenborg
Golub
Hoheisel
J. Eichner
Kanehisa
Lim
Lopez-Romero
Maglott
Markowetz
Pirnia
Salomonis
Schena
Schumacher
Wu
Yates
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Gene Annotation Easy Viewer (GAEV): Integrating KEGG’s Gene Function Annotations and Associated Molecular Pathways

Author: A Bateman
C Camacho
C Wrzodek
G Boratyn
J Mistry
K Moutselos
M Kanehisa
M Kanehisa
M Kanehisa
M Kanehisa
P Jones
R Finn
Z Ye
Publication venue: 'F1000 Research Ltd'
Publication date
Field of study

Crossref

Gene Annotation Easy Viewer (GAEV): Integrating KEGG’s Gene Function Annotations and Associated Molecular Pathways

Author: A Bateman
C Camacho
C Wrzodek
D Huson
G Boratyn
H Trung
J Mistry
K Moutselos
M Kanehisa
M Kanehisa
M Kanehisa
M Kanehisa
P Jones
R Finn
S Camiolo
Y Ye
Z Ye
Publication venue: 'F1000 Research Ltd'
Publication date
Field of study

Crossref