Search CORE

60 research outputs found

PRI-CAT: a web-tool for the analysis, storage and visualization of plant ChIP-seq experiments

Author: Aalt D. J. van Dijk
Buisine
Cairns
Cesaroni
Feuillet
Gentleman
Gibbons
Goecks
Ji
Jose M. Muiño
Kaufmann
Kaufmann
Kaufmann
Kozarewa
Lan
Li
Marlous Hoogstraat
Muiño
Nicol
Pepke
Quail
Roeland C. H. J. van Ham
Zacher
Zhang
Zhang
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

Although several tools for the analysis of ChIP-seq data have been published recently, there is a growing demand, in particular in the plant research community, for computational resources with which such data can be processed, analyzed, stored, visualized and integrated within a single, user-friendly environment. To accommodate this demand, we have developed PRI-CAT (Plant Research International ChIP-seq analysis tool), a web-based workflow tool for the management and analysis of ChIP-seq experiments. PRI-CAT is currently focused on Arabidopsis, but will be extended with other plant species in the near future. Users can directly submit their sequencing data to PRI-CAT for automated analysis. A QuickLoad server compatible with genome browsers is implemented for the storage and visualization of DNA-binding maps. Submitted datasets and results can be made publicly available through PRI-CAT, a feature that will enable community-based integrative analysis and visualization of ChIP-seq experiments. Secondary analysis of data can be performed with the aid of GALAXY, an external framework for tool and data integration. PRI-CAT is freely available at http://www.ab.wur.nl/pricat. No login is required

Crossref

PubMed Central

Wageningen University & Research Publications

Sequencing the Potato Genome: Outline and First Results to Come from the Elucidation of the Sequence of the World’s Third Most Important Food Crop

Author: Boris Kuznetsov
Boris Sagredo
Christian W. B. Bachem
Dan Milbourne
Gisella Orjeda
Glenn J. Bryan
Jan M. de Boer
Jeanne M. E. Jacobs
Paulo E. de Melo
Richard G. F. Visser
Robert Gromadka
Roeland C. H. J. van Ham
Sanwen Huang
Sergio Feingold
Swarup K. Chakrabati
Xiaomin Tang
Publication venue: Springer Nature
Publication date: 01/01/2009
Field of study

Potato is a member of the Solanaceae, a plant family that includes several other economically important species, such as tomato, eggplant, petunia, tobacco and pepper. The Potato Genome Sequencing Consortium (PGSC) aims to elucidate the complete genome sequence of potato, the third most important food crop in the world. The PGSC is a collaboration between 13 research groups from China, India, Poland, Russia, the Netherlands, Ireland, Argentina, Brazil, Chile, Peru, USA, New Zealand and the UK. The potato genome consists of 12 chromosomes and has a (haploid) length of approximately 840 million base pairs, making it a medium-sized plant genome. The sequencing project builds on a diploid potato genomic bacterial artificial chromosome (BAC) clone library of 78000 clones, which has been fingerprinted and aligned into ~7000 physical map contigs. In addition, the BAC-ends have been sequenced and are publicly available. Approximately 30000 BACs are anchored to the Ultra High Density genetic map of potato, composed of 10000 unique AFLPTM markers. From this integrated genetic-physical map, between 50 to 150 seed BACs have currently been identified for every chromosome. Fluorescent in situ hybridization experiments on selected BAC clones confirm these anchor points. The seed clones provide the starting point for a BAC-by-BAC sequencing strategy. This strategy is being complemented by whole genome shotgun sequencing approaches using both 454 GS FLX and Illumina GA2 instruments. Assembly and annotation of the sequence data will be performed using publicly available and tailor-made tools. The availability of the annotated data will help to characterize germplasm collections based on allelic variance and to assist potato breeders to more fully exploit the genetic potential of potat

Springer - Publisher Connector

Unsupervised protein embeddings outperform hand-crafted sequence and structure features at predicting molecular function

Author: Alley
Altschul
Amelia Villegas-Morcillo
Anfinsen
Angel M Gomez
Arne Elofsson
Ashburner
Bartoli
Bepler
Berman
Bonetta
Cao
Cheng
Clark
Cozzetto
Devlin
Doersch
Duarte
Eddy
Fa
Fout
Fu
Gidaris
Gligorijevic
Heinzinger
Jiang
Jones
Kabsch
Kane
Kimura
Kingma
Kipf
Kulmanov
Kulmanov
Liu
Liu
Lyons
Marcel J T Reinders
Mathis
McCann
Pesquita
Peters
Radivojac
Rao
Rives
Roeland C H J van Ham
Srivastava
Stavros Makrodimitris
Sureyya Rifaioglu
Victoria Sanchez
Wang
Weinhold
Wilson
Zamora-Resendiz
Zheng
Zhou
Zhu
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2020
Field of study

This work was supported by Keygene N.V., a crop innovation company in the Netherlands and by the Spanish MINECO/FEDER Project TEC201680141-P with the associated FPI grant BES-2017-079792.The authors thank Dr. Elvin Isufi and Chirag Raman for their valuable comments and feedback.Motivation: Protein function prediction is a difficult bioinformatics problem. Many recent methods use deep neural networks to learn complex sequence representations and predict function from these. Deep supervised models require a lot of labeled training data which are not available for this task. However, a very large amount of protein sequences without functional labels is available. Results: We applied an existing deep sequence model that had been pretrained in an unsupervised setting on the supervised task of protein molecular function prediction. We found that this complex feature representation is effective for this task, outperforming hand-crafted features such as one-hot encoding of amino acids, k-mer counts, secondary structure and backbone angles. Also, it partly negates the need for complex prediction models, as a two-layer perceptron was enough to achieve competitive performance in the third Critical Assessment of Functional Annotation benchmark. We also show that combining this sequence representation with protein 3D structure information does not lead to performance improvement, hinting that 3D structure is also potentially learned during the unsupervised pretraining.Keygene N.V., a crop innovation company in the NetherlandsSpanish MINECO/FEDER TEC201680141-PFPI grant BES-2017-07979

Crossref

TU Delft Repository

Repositorio Institucional Universidad de Granada

Bayesian Markov Random Field Analysis for Protein Function Prediction Based on Network Data

Author: A Kuzniar
A Vazquez
Aalt D. J. van Dijk
AJ Enright
C Moler
Cajo J. F. ter Braak
CJF Ter Braak
CJF Ter Braak
CM Federovitch
DJC MacKay
GD Bader
GR Lanckriet
H Lee
I Kosmidis
I Ulitsky
Iddo Friedberg
IM Cheeseman
J Besag
JA Hanley
L Milligan
L Peña Castillo
M Ashburner
M Deng
M Deng
M Punta
Marco C. A. M. Bink
N Nariai
NJ Mulder
P McCullagh
R Sharan
RI Kondor
Roeland C. H. J. van Ham
S Ferré
S Geman
S Letovsky
S Mostafavi
SF Altschul
SR Collins
SZ Li
T Gabaldon
U Karaoz
V Vethantham
XL Chen
Y Chen
Y Guan
Yiannis A. I. Kourmpetis
Z Barutcuoglu
Z Wei
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Inference of protein functions is one of the most important aims of modern biology. To fully exploit the large volumes of genomic data typically produced in modern-day genomic experiments, automated computational methods for protein function prediction are urgently needed. Established methods use sequence or structure similarity to infer functions but those types of data do not suffice to determine the biological context in which proteins act. Current high-throughput biological experiments produce large amounts of data on the interactions between proteins. Such data can be used to infer interaction networks and to predict the biological process that the protein is involved in. Here, we develop a probabilistic approach for protein function prediction using network data, such as protein-protein interaction measurements. We take a Bayesian approach to an existing Markov Random Field method by performing simultaneous estimation of the model parameters and prediction of protein functions. We use an adaptive Markov Chain Monte Carlo algorithm that leads to more accurate parameter estimates and consequently to improved prediction performance compared to the standard Markov Random Fields method. We tested our method using a high quality S.cereviciae validation network with 1622 proteins against 90 Gene Ontology terms of different levels of abstraction. Compared to three other protein function prediction methods, our approach shows very good prediction performance. Our method can be directly applied to protein-protein interaction or coexpression networks, but also can be extended to use multiple data sources. We apply our method to physical protein interaction data from S. cerevisiae and provide novel predictions, using 340 Gene Ontology terms, for 1170 unannotated proteins and we evaluate the predictions using the available literature

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

Accurate mass error correction in liquid chromatography time-of-flight mass spectrometry based metabolomics

Author: A. Aharoni
A. Makarov
A. Saghatelian
C. A. Smith
C. Eckers
Chris Maliepaard
D. J. Kliebenstein
E. M. Thurman
E. Roepenack-Lahaye von
H. A. Verhoeven
H. C. Kofeler
H. Idborg
H. K. Lim
Harrie A. Verhoeven
I. D. Wilson
I. V. Chernushevich
J. J. B. Keurentjes
J. V. Olsen
K. Clauwaert
M. Colombo
M. Katajamaa
M. Reichelt
O. Vorst
Oscar Vorst
R. C. H. Vos De
R. J. Bino
Ric C. H. de Vos
Robert D. Hall
Roeland C. H. J. van Ham
S. J. Murch
S. M. Peterman
S. Moco
S. Moco
T. Cajka
Velitchka V. Mihaleva
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The Genomes of the Fungal Plant Pathogens Cladosporium fulvum and Dothistroma septosporum Reveal Adaptation to Different Hosts and Lifestyles But Also Signatures of Common Ancestry.

We sequenced and compared the genomes of the Dothideomycete fungal plant pathogensCladosporium fulvum (Cfu) (syn. Passalora fulva) and Dothistroma septosporum (Dse) that are closely related phylogenetically, but have different lifestyles and hosts. Although both fungi grow extracellularly in close contact with host mesophyll cells, Cfu is a biotroph infecting tomato, while Dse is a hemibiotroph infecting pine. The genomes of these fungi have a similar set of genes (70% of gene content in both genomes are homologs), but differ significantly in size (Cfu \u3e61.1-Mb; Dse 31.2-Mb), which is mainly due to the difference in repeat content (47.2% in Cfu versus 3.2% in Dse). Recent adaptation to different lifestyles and hosts is suggested by diverged sets of genes. Cfu contains an α-tomatinase gene that we predict might be required for detoxification of tomatine, while this gene is absent in Dse. Many genes encoding secreted proteins are unique to each species and the repeat-rich areas in Cfu are enriched for these species-specific genes. In contrast, conserved genes suggest common host ancestry. Homologs of Cfu effector genes, including Ecp2 and Avr4, are present in Dse and induce a Cf-Ecp2- and Cf-4-mediated hypersensitive response, respectively. Strikingly, genes involved in production of the toxin dothistromin, a likely virulence factor for Dse, are conserved in Cfu, but their expression differs markedly with essentially no expression by Cfu in planta. Likewise, Cfu has a carbohydrate-degrading enzyme catalog that is more similar to that of necrotrophs or hemibiotrophs and a larger pectinolytic gene arsenal than Dse, but many of these genes are not expressed in planta or are pseudogenized. Overall, comparison of their genomes suggests that these closely related plant pathogens had a common ancestral host but since adapted to different hosts and lifestyles by a combination of differentiated gene content, pseudogenization, and gene regulation

Crossref

HAL AMU

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Purdue E-Pubs

ProdInra

Sequencing the Potato Genome: Outline and First Results to Come from the Elucidation of the Sequence of the World’s Third Most Important Food Crop

Author: A Ballvora
AJ Haverkort
Boris Kuznetsov
Boris Sagredo
C Bachem
C Gebhardt
C Soderlund
C Soderlund
Christian W. B. Bachem
CM Menendez
CM Ronning
Dan Milbourne
DM Spooner
DM Spooner
E Vossen van der
EA Vossen van der
EA Vossen van der
Gisella Orjeda
Glenn J. Bryan
H Kuang
H Os van
H Uhrig
HJ Eck van
HY Kim-Lee
I Hein
II Fock
J Paal
J Song
Jan M. de Boer
JE Bradshaw
Jeanne M. E. Jacobs
JY Song
L Li
M Iovene
MW Fiers
Paulo E. de Melo
RE Veilleux
Richard G. F. Visser
Robert Gromadka
Roeland C. H. J. van Ham
S Huang
SA Peters
Sanwen Huang
Sergio Feingold
Swarup K. Chakrabati
TJ Borm
TP Jesse
W Zhu
WA Rensink
X Tang
Xiaomin Tang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Sequence Motifs in MADS Transcription Factors Responsible for Specificity and Diversification of Protein-Protein Interaction

Author: A Becker
A Sali
A Shmygelska
Aalt D. J. van Dijk
AD Han
ADJ van Dijk
AH Paterson
AJM Walhout
B Causier
B Davies
BA Krizek
BA Shoemaker
BJ Adamczyk
C Landgraf
CH Yeang
Christos Ouzounis
CT Rollins
D Li
D Weigel
DH Erwin
DJ Reiss
E Akiva
E Ferraro
E Santelli
ES Coen
G Ditta
G Grigoryan
G Theissen
GD Amoutzias
Gerco C. Angenent
Giuseppa Morabito
H Ma
H Wang
HY Yu
IE Sanchez
J DeBartolo
JD Klemm
JL Riechmann
JL Riechmann
JM Skerker
JR Chen
K Kaufmann
K Kaufmann
KB Levin
KL Morrison
L Breiman
L Burger
L Parenicova
L Yant
M Egea-Cortines
M Ng
M Socolich
M Vandenbussche
M Weigt
MA Fares
Martijn Fiers
ME Cusick
NJ Marianayagam
O Keller
OJ Ratcliffe
P Bradley
R Arora
R Diaz-Uriate
R Favaro
R Ming
R Velasco
RB Jones
RC Edgar
RD Finn
RD Gietz
RGH Immink
RGH Immink
RGH Immink
RGH Immink
RGH Immink
Richard G. H. Immink
Roeland C. H. J. van Ham
S Ciannamea
S De Bodt
S De Bodt
S De Bodt
S de Folter
S Drea
S Ferrario
S Ferrario
S Mika
S Pelaz
SA Kempin
SH Tan
SJ Liljegren
SJ Nurrish
SR Eddy
T Honma
U Hartmann
WP Lehrach
WP Russ
X Daura
Y Hanzawa
Y Ofran
YZ Yang
YZ Yang
Z Schwarzsommer
Z Wunderlich
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Protein sequences encompass tertiary structures and contain information about specific molecular interactions, which in turn determine biological functions of proteins. Knowledge about how protein sequences define interaction specificity is largely missing, in particular for paralogous protein families with high sequence similarity, such as the plant MADS domain transcription factor family. In comparison to the situation in mammalian species, this important family of transcription regulators has expanded enormously in plant species and contains over 100 members in the model plant species Arabidopsis thaliana. Here, we provide insight into the mechanisms that determine protein-protein interaction specificity for the Arabidopsis MADS domain transcription factor family, using an integrated computational and experimental approach. Plant MADS proteins have highly similar amino acid sequences, but their dimerization patterns vary substantially. Our computational analysis uncovered small sequence regions that explain observed differences in dimerization patterns with reasonable accuracy. Furthermore, we show the usefulness of the method for prediction of MADS domain transcription factor interaction networks in other plant species. Introduction of mutations in the predicted interaction motifs demonstrated that single amino acid mutations can have a large effect and lead to loss or gain of specific interactions. In addition, various performed bioinformatics analyses shed light on the way evolution has shaped MADS domain transcription factor interaction specificity. Identified protein-protein interaction motifs appeared to be strongly conserved among orthologs, indicating their evolutionary importance. We also provide evidence that mutations in these motifs can be a source for sub- or neo-functionalization. The analyses presented here take us a step forward in understanding protein-protein interactions and the interplay between protein sequences and network evolution

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

A genome-wide genetic map of NB-LRR disease resistance loci in potato

Author: A Ballvora
A Ballvora
A Barone
A Bendahmane
A Bendahmane
A Finkers-Tomczak
A Kohler
A Konieczny
AA Lokossou
AF Bent
Aska Goverse
B Caromel
B Caromel
B Flis
BC Meyers
BC Meyers
C Feuillet
C Gebhardt
C Leonards-Schippers
C Moloney
CG Linden van der
D Leister
DB Goldstein
E Bakker
E Isidore
E Ritter
E Sinapidou
E Vossen van der
E Vossen van der
EAG Vossen van der
Edwin van der Vossen
Erin Bakker
F Celebi-Toprak
F Lin
F Zhou
FC Lanfermeijer
G Segal
GB Martin
Geert Smant
Gerard van der Linden
Gerda Uenk
GJ Lawrence
GT Bryan
H Os van
Herman van Eck
HH Flor
HH Kuang
I Kaloshian
J Bai
J Paal
J Rouppe van der Voort
J Sambrook
Jaap Bakker
Jack Vossen
Jan de Boer
JE Bradshaw
JE Parker
JG Ellis
JH Hamalainen
JM McDowell
JM Salmeron
JM Salmeron
JR Peart
K-T Sekine
KA Bakari
L Deslandes
L Huang
LA Mueller
LK McHale
M Mazourek
M Mindrinos
M Sato
MA Botella
Mariëlle Muskens
Marjon Arens
MB Cooley
MH Borhan
MI Spassova
MR Grant
MR Stevens
MW Bonierbale
MW Ganal
N Collins
N Goto
N Ori
N Yahiaoui
P Vos
PA Anderson
PD Bittner-Eddy
Pjotr Prins
PN Dodds
PN Dodds
R Chen
R Hehl
RC Grube
Rene Klein-Lankhorst
RF Warren
RGF Visser
Richard Visser
Roeland van Ham
RW Michelmore
S Cloutier
S Schornack
S Vidal
S Whitham
S Yang
S Yoshimura
SB Milligan
SF Altschul
SHEJ Gabriëls
SW Huang
SW Huang
T Ashfield
T Zhou
TH Park
TH Park
TH Tai
Theo Borm
TJ Vision
TJA Borm
TM Fulton
W Gassmann
W Gish
W Marczewski
W Marczewski
W Marczewski
WIL Tameling
X Li
XQ Liu
Y Shirano
YS Song
ZX Wang
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

Like all plants, potato has evolved a surveillance system consisting of a large array of genes encoding for immune receptors that confer resistance to pathogens and pests. The majority of these so-called resistance or R proteins belong to the super-family that harbour a nucleotide binding and a leucine-rich-repeat domain (NB-LRR). Here, sequence information of the conserved NB domain was used to investigate the genome-wide genetic distribution of the NB-LRR resistance gene loci in potato. We analysed the sequences of 288 unique BAC clones selected using filter hybridisation screening of a BAC library of the diploid potato clone RH89-039-16 (S. tuberosum ssp. tuberosum) and a physical map of this BAC library. This resulted in the identification of 738 partial and full-length NB-LRR sequences. Based on homology of these sequences with known resistance genes, 280 and 448 sequences were classified as TIR-NB-LRR (TNL) and CC-NB-LRR (CNL) sequences, respectively. Genetic mapping revealed the presence of 15 TNL and 32 CNL loci. Thirty-six are novel, while three TNL loci and eight CNL loci are syntenic with previously identified functional resistance genes. The genetic map was complemented with 68 universal CAPS markers and 82 disease resistance trait loci described in literature, providing an excellent template for genetic studies and applied research in potato

Crossref

PubMed Central

Wageningen University & Research Publications

Conserved and variable correlated mutations in the plant MADS protein network

Author: A Bairoch
A Becker
A Fuchs
A Lupas
A Sali
AA Fodor
Aalt DJ van Dijk
AD Han
ADJ van Dijk
AH Paterson
AK Ramani
AS Veron
AT Brunger
BA Krizek
C Espinosa-soto
CM Buslje
CS Goh
CS Miller
D Altschuh
D Juan
DA Afonnikov
DS Horner
E Santelli
EA Merritt
F Fornara
F Pazos
F Pazos
F Pazos
G Angenent
GA Tuskan
H Ashkenazy
HB Fraser
HY Shan
HY Shan
HY Yu
I Halperin
J Lim
J Sundstrom
JD Thompson
JG Caporaso
JL Riechmann
JMG Izarzugaza
K Hill
K Huang
K Kaufmann
K Kaufmann
L Hakes
L Mendoza
L Parenicova
L Pellegrini
LC Martin
LJ Cseke
LP Martinez-Castilla
M Hassler
M Ng
M Socolich
MA Fares
MJ Buck
N Shitsukawa
NA Kane
NJ Mulder
O Noivirt
PJ Kraulis
PJ Waddell
R Melzer
R Ming
R Velasco
RC Edgar
RGH Immink
RKP Kuipers
RM Clark
Roeland CHJ van Ham
S Ciannamea
S De Bodt
S de Folter
S Henikoff
S Mika
SA Goff
SA Rensing
SAA Travers
SAA Travers
SR Eddy
T Hernandez-Hernandez
T Sato
Y Mo
YZ Yang
YZ Yang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Plant MADS domain proteins are involved in a variety of developmental processes for which their ability to form various interactions is a key requisite. However, not much is known about the structure of these proteins or their complexes, whereas such knowledge would be valuable for a better understanding of their function. Here, we analyze those proteins and the complexes they form using a correlated mutation approach in combination with available structural, bioinformatics and experimental data. Results Correlated mutations are affected by several types of noise, which is difficult to disentangle from the real signal. In our analysis of the MADS domain proteins, we apply for the first time a correlated mutation analysis to a family of interacting proteins. This provides a unique way to investigate the amount of signal that is present in correlated mutations because it allows direct comparison of mutations in various family members and assessing their conservation. We show that correlated mutations in general are conserved within the various family members, and if not, the variability at the respective positions is less in the proteins in which the correlated mutation does not occur. Also, intermolecular correlated mutation signals for interacting pairs of proteins display clear overlap with other bioinformatics data, which is not the case for non-interacting protein pairs, an observation which validates the intermolecular correlated mutations. Having validated the correlated mutation results, we apply them to infer the structural organization of the MADS domain proteins. Conclusion Our analysis enables understanding of the structural organization of the MADS domain proteins, including support for predicted helices based on correlated mutation patterns, and evidence for a specific interaction site in those proteins.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications