Search CORE

91 research outputs found

Information-driven protein–DNA docking using HADDOCK: it is a matter of flexibility

Author: Boelens Rolf
Bonvin Alexandre M. J. J.
Hsu Victor
van Dijk Aalt D. J.
van Dijk Marc
Publication venue: Oxford University Press
Publication date: 01/01/2006
Field of study

Intrinsic flexibility of DNA has hampered the development of efficient protein−DNA docking methods. In this study we extend HADDOCK (High Ambiguity Driven DOCKing) [C. Dominguez, R. Boelens and A. M. J. J. Bonvin (2003) J. Am. Chem. Soc. 125, 1731–1737] to explicitly deal with DNA flexibility. HADDOCK uses non-structural experimental data to drive the docking during a rigid-body energy minimization, and semi-flexible and water refinement stages. The latter allow for flexibility of all DNA nucleotides and the residues of the protein at the predicted interface. We evaluated our approach on the monomeric repressor−DNA complexes formed by bacteriophage 434 Cro, the Escherichia coli Lac headpiece and bacteriophage P22 Arc. Starting from unbound proteins and canonical B-DNA we correctly predict the correct spatial disposition of the complexes and the specific conformation of the DNA in the published complexes. This information is subsequently used to generate a library of pre-bent and twisted DNA structures that served as input for a second docking round. The resulting top ranking solutions exhibit high similarity to the published complexes in terms of root mean square deviations, intermolecular contacts and DNA conformation. Our two-stage docking method is thus able to successfully predict protein−DNA complexes from unbound constituents using non-structural experimental data to drive the docking

Crossref

VU Research Portal

PubMed Central

Utrecht University Repository

PRI-CAT: a web-tool for the analysis, storage and visualization of plant ChIP-seq experiments

Author: Aalt D. J. van Dijk
Buisine
Cairns
Cesaroni
Feuillet
Gentleman
Gibbons
Goecks
Ji
Jose M. Muiño
Kaufmann
Kaufmann
Kaufmann
Kozarewa
Lan
Li
Marlous Hoogstraat
Muiño
Nicol
Pepke
Quail
Roeland C. H. J. van Ham
Zacher
Zhang
Zhang
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

Although several tools for the analysis of ChIP-seq data have been published recently, there is a growing demand, in particular in the plant research community, for computational resources with which such data can be processed, analyzed, stored, visualized and integrated within a single, user-friendly environment. To accommodate this demand, we have developed PRI-CAT (Plant Research International ChIP-seq analysis tool), a web-based workflow tool for the management and analysis of ChIP-seq experiments. PRI-CAT is currently focused on Arabidopsis, but will be extended with other plant species in the near future. Users can directly submit their sequencing data to PRI-CAT for automated analysis. A QuickLoad server compatible with genome browsers is implemented for the storage and visualization of DNA-binding maps. Submitted datasets and results can be made publicly available through PRI-CAT, a feature that will enable community-based integrative analysis and visualization of ChIP-seq experiments. Secondary analysis of data can be performed with the aid of GALAXY, an external framework for tool and data integration. PRI-CAT is freely available at http://www.ab.wur.nl/pricat. No login is required

Crossref

PubMed Central

Wageningen University & Research Publications

CAPICE:a computational method for Consequence-Agnostic Pathogenicity Interpretation of Clinical Exome variations

Author: Abbott Kristin
Charbon Bart
de Ridder Dick
Deelen Patrick
Hendriksen Dennis
Kerstjens-Frederikse Wilhelmina S
Li Shuang
Sikkema-Raddatz Birgit
Sinke Richard J
Soudis Dimitrios
Swertz Morris A
van der Velde K Joeri
van Diemen Cleo C
van Dijk Aalt D J
van Gijn Marielle E
Zwerwer Leslie R
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/08/2020
Field of study

Exome sequencing is now mainstream in clinical practice. However, identification of pathogenic Mendelian variants remains time-consuming, in part, because the limited accuracy of current computational prediction methods requires manual classification by experts. Here we introduce CAPICE, a new machine-learning-based method for prioritizing pathogenic variants, including SNVs and short InDels. CAPICE outperforms the best general (CADD, GAVIN) and consequence-type-specific (REVEL, ClinPred) computational prediction methods, for both rare and ultra-rare variants. CAPICE is easily added to diagnostic pipelines as pre-computed score file or command-line software, or using online MOLGENIS web service with API. Download CAPICE for free and open-source (LGPLv3) at https://github.com/molgenis/capice.

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Simulation of Organ Patterning on the Floral Meristem Using a Polar Auxin Transport Model

Author: Aalt D. J. van Dijk
AG Rolland-Lagan
AW Woodward
D Reinhardt
D Reinhardt
DR Smyth
E Coen
EM Bayer
EM Kramer
EM Kramer
Gerco C. Angenent
GJ Mitchison
H Jönsson
Henrik Jönsson
J Friml
J Friml
Jaap Molenaar
K Kaufmann
K Kaufmann
K Wabnik
Kerstin Kaufmann
M Michniewicz
MG Heisler
MH Goldsmith
PB de Reuille
R Aloni
R Benjamins
R Tobena-Santamaria
RMH Merks
RMH Merks
Roeland M. H. Merks
RS Smith
RS Smith
S Bennett
S Stoma
Simon van Mourik
SL Urbanus
T Paciorek
W Crone
Y Cheng
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

An intriguing phenomenon in plant development is the timing and positioning of lateral organ initiation, which is a fundamental aspect of plant architecture. Although important progress has been made in elucidating the role of auxin transport in the vegetative shoot to explain the phyllotaxis of leaf formation in a spiral fashion, a model study of the role of auxin transport in whorled organ patterning in the expanding floral meristem is not available yet. We present an initial simulation approach to study the mechanisms that are expected to play an important role. Starting point is a confocal imaging study of Arabidopsis floral meristems at consecutive time points during flower development. These images reveal auxin accumulation patterns at the positions of the organs, which strongly suggests that the role of auxin in the floral meristem is similar to the role it plays in the shoot apical meristem. This is the basis for a simulation study of auxin transport through a growing floral meristem, which may answer the question whether auxin transport can in itself be responsible for the typical whorled floral pattern. We combined a cellular growth model for the meristem with a polar auxin transport model. The model predicts that sepals are initiated by auxin maxima arising early during meristem outgrowth. These form a pre-pattern relative to which a series of smaller auxin maxima are positioned, which partially overlap with the anlagen of petals, stamens, and carpels. We adjusted the model parameters corresponding to properties of floral mutants and found that the model predictions agree with the observed mutant patterns. The predicted timing of the primordia outgrowth and the timing and positioning of the sepal primordia show remarkable similarities with a developing flower in nature

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

FigShare

Bayesian Markov Random Field Analysis for Protein Function Prediction Based on Network Data

Author: A Kuzniar
A Vazquez
Aalt D. J. van Dijk
AJ Enright
C Moler
Cajo J. F. ter Braak
CJF Ter Braak
CJF Ter Braak
CM Federovitch
DJC MacKay
GD Bader
GR Lanckriet
H Lee
I Kosmidis
I Ulitsky
Iddo Friedberg
IM Cheeseman
J Besag
JA Hanley
L Milligan
L Peña Castillo
M Ashburner
M Deng
M Deng
M Punta
Marco C. A. M. Bink
N Nariai
NJ Mulder
P McCullagh
R Sharan
RI Kondor
Roeland C. H. J. van Ham
S Ferré
S Geman
S Letovsky
S Mostafavi
SF Altschul
SR Collins
SZ Li
T Gabaldon
U Karaoz
V Vethantham
XL Chen
Y Chen
Y Guan
Yiannis A. I. Kourmpetis
Z Barutcuoglu
Z Wei
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Inference of protein functions is one of the most important aims of modern biology. To fully exploit the large volumes of genomic data typically produced in modern-day genomic experiments, automated computational methods for protein function prediction are urgently needed. Established methods use sequence or structure similarity to infer functions but those types of data do not suffice to determine the biological context in which proteins act. Current high-throughput biological experiments produce large amounts of data on the interactions between proteins. Such data can be used to infer interaction networks and to predict the biological process that the protein is involved in. Here, we develop a probabilistic approach for protein function prediction using network data, such as protein-protein interaction measurements. We take a Bayesian approach to an existing Markov Random Field method by performing simultaneous estimation of the model parameters and prediction of protein functions. We use an adaptive Markov Chain Monte Carlo algorithm that leads to more accurate parameter estimates and consequently to improved prediction performance compared to the standard Markov Random Fields method. We tested our method using a high quality S.cereviciae validation network with 1622 proteins against 90 Gene Ontology terms of different levels of abstraction. Compared to three other protein function prediction methods, our approach shows very good prediction performance. Our method can be directly applied to protein-protein interaction or coexpression networks, but also can be extended to use multiple data sources. We apply our method to physical protein interaction data from S. cerevisiae and provide novel predictions, using 340 Gene Ontology terms, for 1170 unannotated proteins and we evaluate the predictions using the available literature

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

Continuous-time modeling of cell fate determination in Arabidopsis flowers

Abstract Background The genetic control of floral organ specification is currently being investigated by various approaches, both experimentally and through modeling. Models and simulations have mostly involved boolean or related methods, and so far a quantitative, continuous-time approach has not been explored. Results We propose an ordinary differential equation (ODE) model that describes the gene expression dynamics of a gene regulatory network that controls floral organ formation in the model plant <it>Arabidopsis thaliana</it>. In this model, the dimerization of MADS-box transcription factors is incorporated explicitly. The unknown parameters are estimated from (known) experimental expression data. The model is validated by simulation studies of known mutant plants. Conclusions The proposed model gives realistic predictions with respect to independent mutation data. A simulation study is carried out to predict the effects of a new type of mutation that has so far not been made in <it>Arabidopsis</it>, but that could be used as a severe test of the validity of the model. According to our predictions, the role of dimers is surprisingly important. Moreover, the functional loss of any dimer leads to one or more phenotypic alterations.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

Conserved and variable correlated mutations in the plant MADS protein network

Author: A Bairoch
A Becker
A Fuchs
A Lupas
A Sali
AA Fodor
Aalt DJ van Dijk
AD Han
ADJ van Dijk
AH Paterson
AK Ramani
AS Veron
AT Brunger
BA Krizek
C Espinosa-soto
CM Buslje
CS Goh
CS Miller
D Altschuh
D Juan
DA Afonnikov
DS Horner
E Santelli
EA Merritt
F Fornara
F Pazos
F Pazos
F Pazos
G Angenent
GA Tuskan
H Ashkenazy
HB Fraser
HY Shan
HY Shan
HY Yu
I Halperin
J Lim
J Sundstrom
JD Thompson
JG Caporaso
JL Riechmann
JMG Izarzugaza
K Hill
K Huang
K Kaufmann
K Kaufmann
L Hakes
L Mendoza
L Parenicova
L Pellegrini
LC Martin
LJ Cseke
LP Martinez-Castilla
M Hassler
M Ng
M Socolich
MA Fares
MJ Buck
N Shitsukawa
NA Kane
NJ Mulder
O Noivirt
PJ Kraulis
PJ Waddell
R Melzer
R Ming
R Velasco
RC Edgar
RGH Immink
RKP Kuipers
RM Clark
Roeland CHJ van Ham
S Ciannamea
S De Bodt
S de Folter
S Henikoff
S Mika
SA Goff
SA Rensing
SAA Travers
SAA Travers
SR Eddy
T Hernandez-Hernandez
T Sato
Y Mo
YZ Yang
YZ Yang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Plant MADS domain proteins are involved in a variety of developmental processes for which their ability to form various interactions is a key requisite. However, not much is known about the structure of these proteins or their complexes, whereas such knowledge would be valuable for a better understanding of their function. Here, we analyze those proteins and the complexes they form using a correlated mutation approach in combination with available structural, bioinformatics and experimental data. Results Correlated mutations are affected by several types of noise, which is difficult to disentangle from the real signal. In our analysis of the MADS domain proteins, we apply for the first time a correlated mutation analysis to a family of interacting proteins. This provides a unique way to investigate the amount of signal that is present in correlated mutations because it allows direct comparison of mutations in various family members and assessing their conservation. We show that correlated mutations in general are conserved within the various family members, and if not, the variability at the respective positions is less in the proteins in which the correlated mutation does not occur. Also, intermolecular correlated mutation signals for interacting pairs of proteins display clear overlap with other bioinformatics data, which is not the case for non-interacting protein pairs, an observation which validates the intermolecular correlated mutations. Having validated the correlated mutation results, we apply them to infer the structural organization of the MADS domain proteins. Conclusion Our analysis enables understanding of the structural organization of the MADS domain proteins, including support for predicted helices based on correlated mutation patterns, and evidence for a specific interaction site in those proteins.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

Sequence Motifs in MADS Transcription Factors Responsible for Specificity and Diversification of Protein-Protein Interaction

Author: A Becker
A Sali
A Shmygelska
Aalt D. J. van Dijk
AD Han
ADJ van Dijk
AH Paterson
AJM Walhout
B Causier
B Davies
BA Krizek
BA Shoemaker
BJ Adamczyk
C Landgraf
CH Yeang
Christos Ouzounis
CT Rollins
D Li
D Weigel
DH Erwin
DJ Reiss
E Akiva
E Ferraro
E Santelli
ES Coen
G Ditta
G Grigoryan
G Theissen
GD Amoutzias
Gerco C. Angenent
Giuseppa Morabito
H Ma
H Wang
HY Yu
IE Sanchez
J DeBartolo
JD Klemm
JL Riechmann
JL Riechmann
JM Skerker
JR Chen
K Kaufmann
K Kaufmann
KB Levin
KL Morrison
L Breiman
L Burger
L Parenicova
L Yant
M Egea-Cortines
M Ng
M Socolich
M Vandenbussche
M Weigt
MA Fares
Martijn Fiers
ME Cusick
NJ Marianayagam
O Keller
OJ Ratcliffe
P Bradley
R Arora
R Diaz-Uriate
R Favaro
R Ming
R Velasco
RB Jones
RC Edgar
RD Finn
RD Gietz
RGH Immink
RGH Immink
RGH Immink
RGH Immink
RGH Immink
Richard G. H. Immink
Roeland C. H. J. van Ham
S Ciannamea
S De Bodt
S De Bodt
S De Bodt
S de Folter
S Drea
S Ferrario
S Ferrario
S Mika
S Pelaz
SA Kempin
SH Tan
SJ Liljegren
SJ Nurrish
SR Eddy
T Honma
U Hartmann
WP Lehrach
WP Russ
X Daura
Y Hanzawa
Y Ofran
YZ Yang
YZ Yang
Z Schwarzsommer
Z Wunderlich
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Protein sequences encompass tertiary structures and contain information about specific molecular interactions, which in turn determine biological functions of proteins. Knowledge about how protein sequences define interaction specificity is largely missing, in particular for paralogous protein families with high sequence similarity, such as the plant MADS domain transcription factor family. In comparison to the situation in mammalian species, this important family of transcription regulators has expanded enormously in plant species and contains over 100 members in the model plant species Arabidopsis thaliana. Here, we provide insight into the mechanisms that determine protein-protein interaction specificity for the Arabidopsis MADS domain transcription factor family, using an integrated computational and experimental approach. Plant MADS proteins have highly similar amino acid sequences, but their dimerization patterns vary substantially. Our computational analysis uncovered small sequence regions that explain observed differences in dimerization patterns with reasonable accuracy. Furthermore, we show the usefulness of the method for prediction of MADS domain transcription factor interaction networks in other plant species. Introduction of mutations in the predicted interaction motifs demonstrated that single amino acid mutations can have a large effect and lead to loss or gain of specific interactions. In addition, various performed bioinformatics analyses shed light on the way evolution has shaped MADS domain transcription factor interaction specificity. Identified protein-protein interaction motifs appeared to be strongly conserved among orthologs, indicating their evolutionary importance. We also provide evidence that mutations in these motifs can be a source for sub- or neo-functionalization. The analyses presented here take us a step forward in understanding protein-protein interactions and the interplay between protein sequences and network evolution

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

Prioritization of candidate genes in QTL regions based on associations between traits and biological processes

Author: A Sifrim
A Subramanian
Aalt DJ van Dijk
AM Hancock
AN Egan
B Han
C Chen
C Herold
C Jung
C Zhang
D Bornigen
D Shriner
DJ Schaid
DM Goodstein
E Durand
E Fridman
F Fornara
F Supek
F Tian
Gabino F Sanchez-Perez
H Wuriyanghan
I Lee
J Chen
J Jin
J Li
J Ni
JA Fawcett
Jan-Peter Nap
JN Cobb
Joachim W Bargsten
JW Bargsten
JX Shan
K Wang
K Youens-Clark
K Zhao
KC Falke
L Hou
L Sun
LA Hindorff
M Ashburner
M Falda
M Fujisawa
M Ikeda
M Mutwil
M Rauf
N Atias
P Armengaud
P Gour
P Holmans
P Radivojac
P Wang
PY Chibon
R Breitling
R Monclus
RDC Team
RK Varshney
RS Meyer
S Atwell
S Grossmann
S Ouyang
S Wang
SC Chantha
SY Rhee
T Lenser
T Liu
W Qi
W Wu
X Bai
X Dai
X Gao
X Huang
X Huang
X Wei
X Zhang
Y Benjamini
Y Liu
Y Makita
Y Makita
Y Moreau
Y Su
Y Xiong
YA Kourmpetis
YA Kourmpetis
Z Gao
Z Milec
ZK Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

An Expanded Evaluation of Protein Function Prediction Methods Shows an Improvement In Accuracy

Author: Almeida-e-Silva Danillo C.
Altenhoff Adrian
Babbitt Patricia C.
Bankapur Asma R.
Bargsten Joachim W.
Ben-Hur Asa
Benso Alfredo
Bhat Prajwal
BKC Dukka
Bonneau Richard
Brenner Steven E.
Bryson Kevin
Cao Renzhi
Casadio Rita
Cejuela Juan M.
Chapan Samuel
Chen Ching-Tai
Cheng Jianlin
Cibrian-Uhalte Elenia
Clark Wyatt T.
Cozzetto Domenico
D\u27Andrea Daniel
Das Sayoni
Dawson Natalie L.
del Pozo Angela
Denny Paul
Dessimoz Christophe
Di Carlo Stefano
Dogan Tunca
ElShal Sarah
Falda Marco
Fang Hai
Feng Shou
Fernández José M.
Ferrari Carlo
Fontana Paolo
Foulger Rebecca E.
Friedberg Iddo
Funk Christopher S.
Gabaldon Toni
Gemovic Branislava
Gillis Jesse
Ginter Filip
Giollo Manuel
Glisic Sanja
Goldberg Tatyana
Gong Qingtian
Gough Julian
Greene Casey S.
Hakala Kai
Hamp Tobias
Hieta Reija
Holm Liisa
Hsu Wen-Lian
Huntley Rachael P.
Jiang Yuxiang
Jones David T.
Kaewphan Suwisa
Kahanda Indika
Kansakar Lakesh
Khan Ishita K.
Kihara Daisuke
Koo Da Chen Emily
Koskinen Patrik
Lavezzo Enrico
Lee David
Lees Jonathan G.
Legge Duncan
Lepore Rosalba
Li Biao
Lin Alexandra
Linial Michal
Lovering Ruth C.
Magrane Michele
Maietta Paolo
Marcet-Houben Marina
Martelli Pier Luigi
Martin Maria J.
Mehryar Farrokh
Melidoni Anna N.
Mesiti Marco
Minneci Federico
Mooney Sean D.
Moreau Yves
Mutowo-Meullenet Prudence
Nepusz Tamás
Ning Wei
O\u27Donovan Claire
Oates Matt
Ofer Dan
Orengo Christine A.
Oron Tal Ronnen
Paccanaro Alberto
Pavlidis Paul
Penfold-Brown Duncan
Perovic Vladmir
Pichler Klemens
Piovesan Damiano
Politano Gianfranco
Profiti Giuseppe
Radivojac Predrag
Rappoport Nadav
Re Matteo
Rehman Hafeez Ur
Richter Lothar
Robinson Peter N.
Romero Alfonso E.
Rost Burkhard
Sahraeian Sayed M.E.
Salakoski Tapio
Salamov Asaf
Sasidharan Rajkumar
Savino Alessandro
Sedeño-Cortés Adriana E.
Sharan Malvika
Shasha Dennis
Shypitsyna Aleksandra
Skunca Nives
Smithers Ben
Stern Amos
Sternberg Michael J.E.
Stilltoe Ian
Supek Fran
Tian Weidong
Toppo Stefano
Tosatto Silvio C.E.
Tramontano Anna
Tranchevent Léon-Charles
Tress Michael L.
Törönen Petri
Valencia Alfonso
Valentini Giorgio
van Dijk Aalt D.J.
Veljkovic Nevena
Veljkovic Veljko
Vencio Ricardo Z.N.
Verspoor Karin M.
Vogel Jörg
Vucetic Slobodan
Wang Zheng
Wass Mark N.
Yang Haixuan
Youngs Noah
Zakeri Pooya
Zhang Shanshan
Zhong Zhaolong
Zhou Yuanpeng
Publication venue: The Aquila Digital Community
Publication date: 07/09/2016
Field of study

Background: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging. Results: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2. Conclusions: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent

Aquila Digital Community (University of Southern Mississippi, USM)