90 research outputs found
SLIDER: Mining correlated motifs in protein-protein interaction networks
Abstract—Correlated motif mining (CMM) is the problem to find overrepresented pairs of patterns, called motif pairs, in interacting protein sequences. Algorithmic solutions for CMM thereby provide a computational method for predicting binding sites for protein interaction. In this paper, we adopt a motif-driven approach where the support of candidate motif pairs is evaluated in the network. We experimentally establish the superiority of the Chi-square-based support measure over other support measures. Furthermore, we obtain that CMM is an NP-hard problem for a large class of support measures (including Chi-square) and reformulate the search for correlated motifs as a combinatorial optimization problem. We then present the method SLIDER which uses local search with a neighborhood function based on sliding motifs and employs the Chi-square-based support measure. We show that SLIDER outperforms existing motif-driven CMM methods and scales to large protein-protein interaction networks
Співвідношення категорій природного та позитивного права: сучасні погляди та концепції
В статье проведен анализ позитивных и негативных сторон естественно-правового и позитивного правопонимания и осуществлен поиск идеальной системы, объединяющий положительные моменты обеих концепций.In the article the analysis of positive and negative sides of absolute law and positive rights understanding is conducted and the search of the ideal system, uniting the positive moments of both conceptions, is carried out
Predicting the Impact of Alternative Splicing on Plant MADS Domain Protein Function
Several genome-wide studies demonstrated that alternative splicing (AS) significantly increases the transcriptome complexity in plants. However, the impact of AS on the functional diversity of proteins is difficult to assess using genome-wide approaches. The availability of detailed sequence annotations for specific genes and gene families allows for a more detailed assessment of the potential effect of AS on their function. One example is the plant MADS-domain transcription factor family, members of which interact to form protein complexes that function in transcription regulation. Here, we perform an in silico analysis of the potential impact of AS on the protein-protein interaction capabilities of MIKC-type MADS-domain proteins. We first confirmed the expression of transcript isoforms resulting from predicted AS events. Expressed transcript isoforms were considered functional if they were likely to be translated and if their corresponding AS events either had an effect on predicted dimerisation motifs or occurred in regions known to be involved in multimeric complex formation, or otherwise, if their effect was conserved in different species. Nine out of twelve MIKC MADS-box genes predicted to produce multiple protein isoforms harbored putative functional AS events according to those criteria. AS events with conserved effects were only found at the borders of or within the K-box domain. We illustrate how AS can contribute to the evolution of interaction networks through an example of selective inclusion of a recently evolved interaction motif in the MADS AFFECTING FLOWERING1-3 (MAF1–3) subclade. Furthermore, we demonstrate the potential effect of an AS event in SHORT VEGETATIVE PHASE (SVP), resulting in the deletion of a short sequence stretch including a predicted interaction motif, by overexpression of the fully spliced and the alternatively spliced SVP transcripts. For most of the AS events we were able to formulate hypotheses about the potential impact on the interaction capabilities of the encoded MIKC protein
Gene Ontology consistent protein function prediction: the FALCON algorithm applied to six eukaryotic genomes
Gene Ontology (GO) is a hierarchical vocabulary for the description of biological functions and locations, often employed by computational methods for protein function prediction. Due to the structure of GO, function predictions can be self- contradictory. For example, a protein may be predicted to belong to a detailed functional class, but not in a broader class that, due to the vocabulary structure, includes the predicted one.We present a novel discrete optimization algorithm called Functional Annotation with Labeling CONsistency (FALCON) that resolves such contradictions. The GO is modeled as a discrete Bayesian Network. For any given input of GO term membership probabilities, the algorithm returns the most probable GO term assignments that are in accordance with the Gene Ontology structure. The optimization is done using the Differential Evolution algorithm. Performance is evaluated on simulated and also real data from Arabidopsis thaliana showing improvement compared to related approaches. We finally applied the FALCON algorithm to obtain genome-wide function predictions for six eukaryotic species based on data provided by the CAFA (Critical Assessment of Function Annotation) projec
Interactome-Wide Prediction of Protein-Protein Binding Sites Reveals Effects of Protein Sequence Variation in Arabidopsis thaliana
The specificity of protein-protein interactions is encoded in those parts of the sequence that compose the binding interface. Therefore, understanding how changes in protein sequence influence interaction specificity, and possibly the phenotype, requires knowing the location of binding sites in those sequences. However, large-scale detection of protein interfaces remains a challenge. Here, we present a sequence- and interactome-based approach to mine interaction motifs from the recently published Arabidopsis thaliana interactome. The resultant proteome-wide predictions are available via www.ab.wur.nl/sliderbio and set the stage for further investigations of protein-protein binding sites. To assess our method, we first show that, by using a priori information calculated from protein sequences, such as evolutionary conservation and residue surface accessibility, we improve the performance of interface prediction compared to using only interactome data. Next, we present evidence for the functional importance of the predicted sites, which are under stronger selective pressure than the rest of protein sequence. We also observe a tendency for compensatory mutations in the binding sites of interacting proteins. Subsequently, we interrogated the interactome data to formulate testable hypotheses for the molecular mechanisms underlying effects of protein sequence mutations. Examples include proteins relevant for various developmental processes. Finally, we observed, by analysing pairs of paralogs, a correlation between functional divergence and sequence divergence in interaction sites. This analysis suggests that large-scale prediction of binding sites can cast light on evolutionary processes that shape protein-protein interaction network
Simulation of organ patterning on the floral meristem using a polar auxin transport model
An intriguing phenomenon in plant development is the timing and positioning of lateral organ initiation,
which is a fundamental aspect of plant architecture. Although important progress has been made in
elucidating the role of auxin transport in the vegetative shoot to explain the phyllotaxis of leaf formation
in a spiral fashion, a model that explains the whorled organ patterning in the expanding floral meristem
is not available yet. We present an initial simulation approach to study the mechanisms that are expected
to play an important role. Starting point is a confocal imaging study of Arabidopsis floral meristems
at consecutive time points during flower development. These images reveal auxin accumulation patterns
at the positions of the organs, which strongly suggests that the role of auxin in the floral meristem is
similar to the role it plays in the shoot apical meristem. This is the basis for a simulation study of auxin
transport through a growing floral meristem, which may answer the question whether auxin transport can
in itself be responsible for the typical whorled floral pattern. We combined a cellular growth model for
the meristem with a polar auxin transport model. The model predicts that sepals are initiated by auxin maxima arising early during meristem outgrowth. These form a pre-pattern relative to which a series of
smaller auxin maxima are positioned, which partially overlap with the anlagen of petals, stamens, and
carpels. We adjusted the model parameters corresponding to properties of floral mutants and found that
the model predictions agree with the observed mutant patterns. The predicted timing of the primordia
outgrowth and the timing and positioning of the sepal primordia show remarkable similarities with a
developing flower in nature
Inferring the Gene Network Underlying the Branching of Tomato Inflorescence
The architecture of tomato inflorescence strongly affects flower production and subsequent crop yield. To understand the genetic activities involved, insight into the underlying network of genes that initiate and control the sympodial growth in the tomato is essential. In this paper, we show how the structure of this network can be derived from available data of the expressions of the involved genes. Our approach starts from employing biological expert knowledge to select the most probable gene candidates behind branching behavior. To find how these genes interact, we develop a stepwise procedure for computational inference of the network structure. Our data consists of expression levels from primary shoot meristems, measured at different developmental stages on three different genotypes of tomato. With the network inferred by our algorithm, we can explain the dynamics corresponding to all three genotypes simultaneously, despite their apparent dissimilarities. We also correctly predict the chronological order of expression peaks for the main hubs in the network. Based on the inferred network, using optimal experimental design criteria, we are able to suggest an informative set of experiments for further investigation of the mechanisms underlying branching behavio
Novel routes towards bioplastics from plants: elucidation of the methylperillate biosynthesis pathway from <i>Salvia dorisiana </i>trichomes
Plants produce a large variety of highly functionalized terpenoids. Functional groups such as partially unsaturated rings and carboxyl groups provide handles to use these compounds as feedstock for biobased commodity chemicals. For instance, methylperillate, a monoterpenoid found in Salvia dorisiana, may be used for this purpose, as it carries both an unsaturated ring and a methylated carboxyl group. The biosynthetic pathway of methylperillate in plants is still unclear. In this work, we identified glandular trichomes from S. dorisiana as the location of biosynthesis and storage of methylperillate. mRNA from purified trichomes was used to identify four genes that can encode the pathway from geranyl diphosphate towards methylperillate. This pathway includes a (–)-limonene synthase (SdLS), a limonene 7-hydroxylase (SdL7H, CYP71A76), and a perillyl alcohol dehydrogenase (SdPOHDH). We also identified a terpene acid methyltransferase, perillic acid O-methyltransferase (SdPAOMT), with homology to salicylic acid OMTs. Transient expression in Nicotiana benthamiana of these four genes, in combination with a geranyl diphosphate synthase to boost precursor formation, resulted in production of methylperillate. This demonstrates the potential of these enzymes for metabolic engineering of a feedstock for biobased commodity chemicals
- …