Search CORE

2,581 research outputs found

SLIDER: Mining correlated motifs in protein-protein interaction networks

Author: Boyen P.
Dijk A.D.J., van
Ham R.C.H.J., van
Neven F.
Publication venue
Publication date: 01/01/2009
Field of study

Abstract—Correlated motif mining (CMM) is the problem to find overrepresented pairs of patterns, called motif pairs, in interacting protein sequences. Algorithmic solutions for CMM thereby provide a computational method for predicting binding sites for protein interaction. In this paper, we adopt a motif-driven approach where the support of candidate motif pairs is evaluated in the network. We experimentally establish the superiority of the Chi-square-based support measure over other support measures. Furthermore, we obtain that CMM is an NP-hard problem for a large class of support measures (including Chi-square) and reformulate the search for correlated motifs as a combinatorial optimization problem. We then present the method SLIDER which uses local search with a neighborhood function based on sliding motifs and employs the Chi-square-based support measure. We show that SLIDER outperforms existing motif-driven CMM methods and scales to large protein-protein interaction networks

Wageningen University & Research Publications

Algorithms for the analysis of molecular sequences

Author: Vayani Fatima
Publication venue
Publication date: 01/12/2019
Field of study

King's Research Portal

Signal search analysis server

Author: Ambrosini Giovanna
Bucher Philipp
Jagannathan Vidhya
Praz Viviane
Publication venue
Publication date: 02/08/2017
Field of study

Signal search analysis is a general method to discover and characterize sequence motifs that are positionally correlated with a functional site (e.g. a transcription or translation start site). The method has played an instrumental role in the analysis of eukaryotic promoter elements. The signal search analysis server provides access to four different computer programs as well as to a large number of precompiled functional site collections. The programs offered allow: (i) the identification of non-random sequence regions under evolutionary constraint; (ii) the detection of consensus sequence-based motifs that are over- or under-represented at a particular distance from a functional site; (iii) the analysis of the positional distribution of a consensus sequence- or weight matrix-based sequence motif around a functional site; and (iv) the optimization of a weight matrix description of a locally over-represented sequence motif. These programs can be accessed at: http://www.isrec.isb-sib.ch/ss

RERO DOC Digital Library

Mining approximate motifs in time series

Author: Azevedo Paulo J.
Brito Rui M. M.
Ferreira Pedro Gabriel
Silva Cândida G.
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2006
Field of study

The problem of discovering previously unknown frequent patterns in time series, also called motifs, has been recently introduced. A motif is a subseries pattern that appears a significant number of times. Results demonstrate that motifs may provide valuable insights about the data and have a wide range of applications in data mining tasks. The main motivation for this study was the need to mine time series data from protein folding/unfolding simulations. We propose an algorithm that extracts approximate motifs, i.e. motifs that capture portions of time series with a similar and eventually symmetric behaviour. Preliminary results on the analysis of protein unfolding data support this proposal as a valuable tool. Additional experiments demonstrate that the application of utility of our algorithm is not limited to this particular problem. Rather it can be an interesting tool to be applied in many real world problems.Fundação para a Ciência e a Tecnologia (FCT).Fundo Europeu de Desenvolvimento Regional (FEDER) - POCTI/BME/49583/2002; SFRH/BD/13462/2003; SFRH/BD/16888/2004

Universidade do Minho: RepositoriUM