Search CORE

3,151 research outputs found

Characterization of complex networks: A survey of measurements

Author: Altaf-Ul-Amin M
Anderberg MR
Arenas A
Baker WE
Baker WE
Baldi P
Bar-Yam Y
Barabási A-L
Barabási A-L
Batagelj V
Ben-Naim E
Benkler Y
Boccara N
Boguñá M
Bollobás B
Bollobás B
Bornholdt S
Brillouin L
Buchanan M
Bunde A
Bunde A
Carrington PJ
Castells M
Codenotti B
Costa L DA F
Csermely P
Danon L
Dawson Ross
di Bernardo M
Diestel R
Dodge M
Dodge M
Dorogovtsev SN
Duda RO
Edwards AL
Erdős P
Erdős P
F. A. Rodrigues
Fiedler M
Freeman LC
Fukunaga K
G. Travieso
Garrido PL
Hair JF
Hayes B
Hayes B
Huberman BA
Jain AK
Johnson RA
Kochen M
L. da F. Costa
McLachlan GJ
McNeill RR
Mehta ML
Messner D
Milgram S
Monasson R
Monge PR
Newman MEJ
Newman MEJ
Newman MEJ
P. R. Villas Boas
Pastor-Satorras R
Reichl LE
Reif F
Romesburg HC
Schlosser G
Scott JP
Shannon CE
Stauffer D
Stoyan D
Strogatz S
Tyler JR
Wasserman S
Watts DJ
Watts DJ
West DB
Westland C
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2005
Field of study

Each complex network (or class of networks) presents specific topological features which characterize its connectivity and highly influence the dynamics of processes executed on the network. The analysis, discrimination, and synthesis of complex networks therefore rely on the use of measurements capable of expressing the most relevant topological features. This article presents a survey of such measurements. It includes general considerations about complex network characterization, a brief review of the principal models, and the presentation of the main existing measurements. Important related issues covered in this work comprise the representation of the evolution of complex networks in terms of trajectories in several measurement spaces, the analysis of the correlations between some of the most traditional measurements, perturbation analysis, as well as the use of multivariate statistics for feature selection and network classification. Depending on the network and the analysis task one has in mind, a specific set of features may be chosen. It is hoped that the present survey will help the proper application and interpretation of measurements.Comment: A working manuscript with 78 pages, 32 figures. Suggestions of measurements for inclusion are welcomed by the author

arXiv.org e-Print Archive

CiteSeerX

Crossref

EXMOTIF: efficient structured motif extraction

Author: A Apostolico
A Apostolico
A Brazma
A Carvalho
A Carvalho
A Policriti
AM Carvalho
D Thakurta
E Eskin
E Eskin
G Benson
G Pavesi
G Pavesi
J van Helden
J Zhu
L Marsan
M Friberg
M Zhang
MF Sagot
MJ Zaki
Mohammed J Zaki
N Pisanti
P Michailidis
S Sinha
S Sinha
TL Bailey
Yongqiang Zhang
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Extracting motifs from sequences is a mainstay of bioinformatics. We look at the problem of mining structured motifs, which allow variable length gaps between simple motif components. We propose an efficient algorithm, called EXMOTIF, that given some sequence(s), and a structured motif template, extracts all frequent structured motifs that have quorum q. Potential applications of our method include the extraction of single/composite regulatory binding sites in DNA sequences. RESULTS: EXMOTIF is efficient in terms of both time and space and is shown empirically to outperform RISO, a state-of-the-art algorithm. It is also successful in finding potential single/composite transcription factor binding sites. CONCLUSION: EXMOTIF is a useful and efficient tool in discovering structured motifs, especially in DNA sequences. The algorithm is available as open-source at:

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Parallel Position Weight Matrices Algorithms

Author: Giraud Mathieu
Varré Jean-Stéphane
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

International audiencePosition Weight Matrices (PWMs) are broadly used in computational biology. The basic problems, Scan and MultipleScan, aim to find all the occurrences of a given PWM or a set of PWMs in long sequences. Some other PWM tasks share a common NP-hard subproblem, ScoreDistribution. The existing algorithms rely on the enumeration on a large set of scores or words, and they are mostly not suitable for parallelization. We propose a new algorithm, BucketScoreDistribution, that is both very efficient and suitable for parallelization. We bound the error induced by this algorithm. We realized a GPU prototype for Scan, MultipleScan and BucketScoreDistribution with the CUDA libraries, and report for the different problems speedups larger than 10× on several Nvidia cards

HAL - Lille 3

INRIA a CCSD electronic archive server

Prediction and Analysis of Gene Regulatory Networks in Prokaryotic Genomes

Author: Dieter Jahn
Johannes Klein
Richard Münch
Publication venue: 'IntechOpen'
Publication date: 15/09/2011
Field of study

IntechOpen

MODER2: First-order Markov Modeling and Discovery of Monomeric and Dimeric Binding Motifs

Author: Das Pratyush
Taipale Jussi
Toivonen Jarkko
Ukkonen Esko
Publication venue
Publication date: 01/05/2020
Field of study

Motivation: Position-specific probability matrices (PPMs, also called position-specific weight matrices) have been the dominating model for transcription factor (TF)-binding motifs in DNA. There is, however, increasing recent evidence of better performance of higher order models such as Markov models of order one, also called adjacent dinucleotide matrices (ADMs). ADMs can model dependencies between adjacent nucleotides, unlike PPMs. A modeling technique and software tool that would estimate such models simultaneously both for monomers and their dimers have been missing. Results: We present an ADM-based mixture model for monomeric and dimeric TF-binding motifs and an expectation maximization algorithm MODER2 for learning such models from training data and seeds. The model is a mixture that includes monomers and dimers, built from the monomers, with a description of the dimeric structure (spacing, orientation). The technique is modular, meaning that the co-operative effect of dimerization is made explicit by evaluating the difference between expected and observed models. The model is validated using HT-SELEX and generated datasets, and by comparing to some earlier PPM and ADM techniques. The ADM models explain data slightly better than PPM models for 314 tested TFs (or their DNA-binding domains) from four families (bHLH, bZIP, ETS and Homeodomain), the ADM mixture models by MODER2 being the best on average.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Practical Strategies for Discovering Regulatory DNA Sequence Motifs

Author: Fraenkel Ernest
MacIsaac Kenzie D
Publication venue: Public Library of Science
Publication date: 01/04/2006
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

Efficient exact motif discovery

Author: Ettwiller
Fratkin
Li
Lladser
Pavesi
Reinert
S. Rahmann
Sandve
Sandve
Sinha
T. Marschall
Tompa
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

Motivation: The motif discovery problem consists of finding over-represented patterns in a collection of biosequences. It is one of the classical sequence analysis problems, but still has not been satisfactorily solved in an exact and efficient manner. This is partly due to the large number of possibilities of defining the motif search space and the notion of over-representation. Even for well-defined formalizations, the problem is frequently solved in an ad hoc manner with heuristics that do not guarantee to find the best motif

CiteSeerX

Crossref

PubMed Central