Search CORE

120 research outputs found

Model-en data-analyse ten behoeve van betere tij‐verwachtingen: deelrapport 1. Data-analyse

Author: Boeckx L.
D'Haeseleer E.
Deschamps M.
Meire D.
Mostaert F.
Nossent J.
Vanderkimpen P.
Verwaest T.
Publication venue
Publication date: 01/01/2017
Field of study

Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space

Author: Ashburner
Bridges
D'haeseleer
E. Portugaly
Finn
Fitch
Kaplan
Kaplan
Liu
M. Fromer
M. Linial
Mulder
Murzin
Shachar
Sneath
Tatusov
Y. Loewenstein
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: UPGMA (average linking) is probably the most popular algorithm for hierarchical data clustering, especially in computational biology. However, UPGMA requires the entire dissimilarity matrix in memory. Due to this prohibitive requirement, UPGMA is not scalable to very large datasets

Crossref

PubMed Central

RegPredict: an integrated system for regulon inference in prokaryotes by comparative genomics approach

Author: A. A. Mironov
A. E. Kazakov
A. P. Arkin
Alkema
Baumbach
D'haeseleer
D. A. Rodionov
E. D. Stavrovskaya
E. S. Novichkova
Fredrickson
Gelfand
Gelfand
I. Dubchak
M. S. Gelfand
Manson McGuire
McCue
Overbeek
P. S. Novichkov
Price
Rodionov
Rodionov
Rodionov
Rodionov
Rodionov
Tan
Publication venue: Oxford University Press
Publication date: 26/05/2010
Field of study

RegPredict web server is designed to provide comparative genomics tools for reconstruction and analysis of microbial regulons using comparative genomics approach. The server allows the user to rapidly generate reference sets of regulons and regulatory motif profiles in a group of prokaryotic genomes. The new concept of a cluster of co-regulated orthologous operons allows the user to distribute the analysis of large regulons and to perform the comparative analysis of multiple clusters independently. Two major workflows currently implemented in RegPredict are: (i) regulon reconstruction for a known regulatory motif and (ii) ab initio inference of a novel regulon using several scenarios for the generation of starting gene sets. RegPredict provides a comprehensive collection of manually curated positional weight matrices of regulatory motifs. It is based on genomic sequences, ortholog and operon predictions from the MicrobesOnline. An interactive web interface of RegPredict integrates and presents diverse genomic and functional information about the candidate regulon members from several web resources. RegPredict is freely accessible at http://regpredict.lbl.gov

Crossref

PubMed Central

UNT Digital Library

Bayesian hierarchical clustering for studying cancer gene expression data with unknown statistics

Author: A Su
B Frey
C Nutt
C Rasmussen
C Rasmussen
D Arango
D Jiang
D Singh
David R. J. Snead
E Cooke
Ferdinando Di Cunto
G Brock
J Ihmels
J Yao
K Yeung
Korsuk Sirinukunwattana
L Hubert
L McQuitty
LF Wu
M De Souto
M Eisen
M Shipp
Muhammad F. Bari
Nasir M. Rajpoot
P D'haeseleer
P Laiho
R Neal
R Savage
R Sokal
Richard S. Savage
S Armstrong
S Datta
S Eschrich
S Falcon
S Matsui
S Pomeroy
S Ramaswamy
S Varambally
T Golub
Y Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Clustering analysis is an important tool in studying gene expression data. The Bayesian hierarchical clustering (BHC) algorithm can automatically infer the number of clusters and uses Bayesian model selection to improve clustering quality. In this paper, we present an extension of the BHC algorithm. Our Gaussian BHC (GBHC) algorithm represents data as a mixture of Gaussian distributions. It uses normal-gamma distribution as a conjugate prior on the mean and precision of each of the Gaussian components. We tested GBHC over 11 cancer and 3 synthetic datasets. The results on cancer datasets show that in sample clustering, GBHC on average produces a clustering partition that is more concordant with the ground truth than those obtained from other commonly used algorithms. Furthermore, GBHC frequently infers the number of clusters that is often close to the ground truth. In gene clustering, GBHC also produces a clustering partition that is more biologically plausible than several other state-of-the-art methods. This suggests GBHC as an alternative tool for studying gene expression data. The implementation of GBHC is available at https://sites. google.com/site/gaussianbhc

CiteSeerX

Public Library of Science (PLOS)

Qatar University Institutional Repository

Crossref

Directory of Open Access Journals

PubMed Central

Warwick Research Archives Portal Repository

Parallel mutual information estimation for inferring gene regulatory networks on GPUs

Author: AJ Butte
AM Fraser
Bertil Schmidt
CO Daub
E Lindholm
Haixiang Shi
I Arsic
J Schäfer
J Wilson
J Zola
J Zola
JPW Pluim
M Tebmann
N CUDA
N Friedman
P D'Haeseleer
SA Manavski
W Liu
Weiguo Liu
Wolfgang Müller-Wittig
X Chen
X Zhou
X Zhou
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Mutual information is a measure of similarity between two variables. It has been widely used in various application domains including computational biology, machine learning, statistics, image processing, and financial computing. Previously used simple histogram based mutual information estimators lack the precision in quality compared to kernel based methods. The recently introduced B-spline function based mutual information estimation method is competitive to the kernel based methods in terms of quality but at a lower computational complexity. Results We present a new approach to accelerate the B-spline function based mutual information estimation algorithm with commodity graphics hardware. To derive an efficient mapping onto this type of architecture, we have used the Compute Unified Device Architecture (CUDA) programming model to design and implement a new parallel algorithm. Our implementation, called CUDA-MI, can achieve speedups of up to 82 using double precision on a single GPU compared to a multi-threaded implementation on a quad-core CPU for large microarray datasets. We have used the results obtained by CUDA-MI to infer gene regulatory networks (GRNs) from microarray data. The comparisons to existing methods including ARACNE and TINGe show that CUDA-MI produces GRNs of higher quality in less time. Conclusions CUDA-MI is publicly available open-source software, written in CUDA and C++ programming languages. It obtains significant speedup over sequential multi-threaded implementation by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Equilibrium reconstruction for Single Helical Axis reversed field pinch plasmas

Author: A Alfier
A Canton
A Fassina
B Momo
Bodin H A B
Cappello S Paccagnella R Sindoni E
Carraro L
D Terranova
D'Haeseleer W D
E Martines
F Bonomo
Fitzpatrick R
Franz P
Ji H
Marrelli L
Martini S
Menmuir S
Ortolani S
P Franz
P Innocente
P Zanca
Pereverzev G Yushmanov P N
Piovesan P
Puiatti M E
Pustovitov V D
R Lorenzini
Valisa M
Zanca P
Publication venue: 'IOP Publishing'
Publication date: 11/01/2011
Field of study

Single Helical Axis (SHAx) configurations are emerging as the natural state for high current reversed field pinch (RFP) plasmas. These states feature the presence of transport barriers in the core plasma. Here we present a method for computing the equilibrium magnetic surfaces for these states in the force-free approximation, which has been implemented in the SHEq code. The method is based on the superposition of a zeroth order axisymmetric equilibrium and of a first order helical perturbation computed according to Newcomb's equation supplemented with edge magnetic field measurements. The mapping of the measured electron temperature profiles, soft X-ray emission and interferometric density measurements on the computed magnetic surfaces demonstrates the quality of the equilibrium reconstruction. The procedure for computing flux surface averages is illustrated, and applied to the evaluation of the thermal conductivity profile. The consistency of the evaluated equilibria with Ohm's law is also discussed.Comment: Submitted to Plasma Physics and Controlled Fusio

arXiv.org e-Print Archive

Crossref

Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study

Peer reviewedPublisher PD

Aberdeen University Research

Crossref

Springer - Publisher Connector

PubMed Central

Repositorium für Naturwissenschaften und Technik

MPG.PuRe

Eigengene networks for studying the relationships between co-expression modules

Author: A Barabási
A Ghazalpour
A Li
A Yip
B Zhang
D Reiss
E Ravasz
E Segal
G Dennis
H Hotelling
H Wei
J Dong
JM Stuart
L Hartwell
M Oldham
O Alter
P D'haeseleer
P Khaitovich
P Langfelder
Peter Langfelder
R Albert
RA Fisher
RI Jennrich
S Carter
S Horvath
Steve Horvath
T Fuller
WS Wu
X Xu
X Zhou
Y Ye
Z Bar-Joseph
Publication venue: BioMed Central
Publication date: 01/11/2007
Field of study

Abstract Background There is evidence that genes and their protein products are organized into functional modules according to cellular processes and pathways. Gene co-expression networks have been used to describe the relationships between gene transcripts. Ample literature exists on how to detect biologically meaningful modules in networks but there is a need for methods that allow one to study the relationships between modules. Results We show that network methods can also be used to describe the relationships between co-expression modules and present the following methodology. First, we describe several methods for detecting modules that are shared by two or more networks (referred to as consensus modules). We represent the gene expression profiles of each module by an eigengene. Second, we propose a method for constructing an eigengene network, where the edges are undirected but maintain information on the sign of the co-expression information. Third, we propose methods for differential eigengene network analysis that allow one to assess the preservation of network properties across different data sets. We illustrate the value of eigengene networks in studying the relationships between consensus modules in human and chimpanzee brains; the relationships between consensus modules in brain, muscle, liver, and adipose mouse tissues; and the relationships between male-female mouse consensus modules and clinical traits. In some applications, we find that module eigengenes can be organized into higher level clusters which we refer to as meta-modules. Conclusion Eigengene networks can be effective and biologically meaningful tools for studying the relationships between modules of a gene co-expression network. The proposed methods may reveal a higher order organization of the transcriptome. R software tutorials, the data, and supplementary material can be found at the following webpage: <url>http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/EigengeneNetwork</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Demonstration of TVoIP services in a multimedia broadband enabled access network

Author: D'HAESELEER S
De Vleeschauwer Bart
GEILHARDT F
GILON E
HOET J
LE MANSEC G
MAILLET A
NAGEL B
PEñA C
Simoens Pieter
Van de Meerssche Wim
Publication venue
Publication date: 01/01/2007
Field of study

Ghent University Academic Bibliography

A ChIP-Seq Benchmark Shows That Sequence Conservation Mainly Improves Detection of Strong Transcription Factor Binding Sites

Author: A Moses
A Siepel
A Stark
BT Naughton
D Boffelli
D Karolchik
DT Odom
E Birney
Finn Drabløs
G Badis
G Sandve
J Bryne
J Ernst
J Hawkins
JA Hanley
K Klepper
L Elnitski
M Rye
M Tompa
Morten Beck Rye
P D'haeseleer
P Kheradpour
PJ Park
Pål Sætrom
R Jothi
Sridhar Hannenhalli
T Vavouri
Tony Håndstad
V Matys
WW Wasserman
X Xie
Y Zhang
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Transcription factors are important controllers of gene expression and mapping transcription factor binding sites (TFBS) is key to inferring transcription factor regulatory networks. Several methods for predicting TFBS exist, but there are no standard genome-wide datasets on which to assess the performance of these prediction methods. Also, it is believed that information about sequence conservation across different genomes can generally improve accuracy of motif-based predictors, but it is not clear under what circumstances use of conservation is most beneficial.Here we use published ChIP-seq data and an improved peak detection method to create comprehensive benchmark datasets for prediction methods which use known descriptors or binding motifs to detect TFBS in genomic sequences. We use this benchmark to assess the performance of five different prediction methods and find that the methods that use information about sequence conservation generally perform better than simpler motif-scanning methods. The difference is greater on high-affinity peaks and when using short and information-poor motifs. However, if the motifs are specific and information-rich, we find that simple motif-scanning methods can perform better than conservation-based methods.Our benchmark provides a comprehensive test that can be used to rank the relative performance of transcription factor binding site prediction methods. Moreover, our results show that, contrary to previous reports, sequence conservation is better suited for predicting strong than weak transcription factor binding sites

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

NORA - Norwegian Open Research Archives