Search CORE

23 research outputs found

Parallel mutual information estimation for inferring gene regulatory networks on GPUs

Author: AJ Butte
AM Fraser
Bertil Schmidt
CO Daub
E Lindholm
Haixiang Shi
I Arsic
J Schäfer
J Wilson
J Zola
J Zola
JPW Pluim
M Tebmann
N CUDA
N Friedman
P D'Haeseleer
SA Manavski
W Liu
Weiguo Liu
Wolfgang Müller-Wittig
X Chen
X Zhou
X Zhou
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Mutual information is a measure of similarity between two variables. It has been widely used in various application domains including computational biology, machine learning, statistics, image processing, and financial computing. Previously used simple histogram based mutual information estimators lack the precision in quality compared to kernel based methods. The recently introduced B-spline function based mutual information estimation method is competitive to the kernel based methods in terms of quality but at a lower computational complexity. Results We present a new approach to accelerate the B-spline function based mutual information estimation algorithm with commodity graphics hardware. To derive an efficient mapping onto this type of architecture, we have used the Compute Unified Device Architecture (CUDA) programming model to design and implement a new parallel algorithm. Our implementation, called CUDA-MI, can achieve speedups of up to 82 using double precision on a single GPU compared to a multi-threaded implementation on a quad-core CPU for large microarray datasets. We have used the results obtained by CUDA-MI to infer gene regulatory networks (GRNs) from microarray data. The comparisons to existing methods including ARACNE and TINGe show that CUDA-MI produces GRNs of higher quality in less time. Conclusions CUDA-MI is publicly available open-source software, written in CUDA and C++ programming languages. It obtains significant speedup over sequential multi-threaded implementation by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

CUDASW++2.0: enhanced Smith-Waterman protein database search on CUDA-enabled GPUs based on SIMT and virtualized SIMD abstractions

Author: A Szalkowski
A Wirawan
A Wozniak
Bertil Schmidt
Douglas L Maskell
E Lindholm
G Peris
J Nickolls
JD Thompson
JP Comet
M Farrar
MA Larkin
O Bastien
O Gotoh
SA Manavski
SF Altschul
SF Altschul
T Oliver
T Oliver
T Rognes
T Smith
TI Li
W Liu
WR Pearson
Y Liu
Yongchao Liu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Due to its high sensitivity, the Smith-Waterman algorithm is widely used for biological database searches. Unfortunately, the quadratic time complexity of this algorithm makes it highly time-consuming. The exponential growth of biological databases further deteriorates the situation. To accelerate this algorithm, many efforts have been made to develop techniques in high performance architectures, especially the recently emerging many-core architectures and their associated programming models. Findings This paper describes the latest release of the CUDASW++ software, CUDASW++ 2.0, which makes new contributions to Smith-Waterman protein database searches using compute unified device architecture (CUDA). A parallel Smith-Waterman algorithm is proposed to further optimize the performance of CUDASW++ 1.0 based on the single instruction, multiple thread (SIMT) abstraction. For the first time, we have investigated a partitioned vectorized Smith-Waterman algorithm using CUDA based on the virtualized single instruction, multiple data (SIMD) abstraction. The optimized SIMT and the partitioned vectorized algorithms were benchmarked, and remarkably, have similar performance characteristics. CUDASW++ 2.0 achieves performance improvement over CUDASW++ 1.0 as much as 1.74 (1.72) times using the optimized SIMT algorithm and up to 1.77 (1.66) times using the partitioned vectorized algorithm, with a performance of up to 17 (30) billion cells update per second (GCUPS) on a single-GPU GeForce GTX 280 (dual-GPU GeForce GTX 295) graphics card. Conclusions CUDASW++ 2.0 is publicly available open-source software, written in CUDA and C++ programming languages. It obtains significant performance improvement over CUDASW++ 1.0 using either the optimized SIMT algorithm or the partitioned vectorized algorithm for Smith-Waterman protein database searches by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

CBESW: Sequence Alignment on the Playstation 3

Author: A Stamatakis
A Wozniak
Adrianto Wirawan
Bertil Schmidt
Chee Keong Kwoh
D Pham
DA Benson
IBM
International Business Machines
ITS Li
JA Kahle
M Farrar
Nim Tri Hieu
O Gotoh
R Durbin
SA Manavski
T Rognes
T Smith
TF Oliver
V Pande
V Sachdeva
W Liu
W Liu
WR Pearson
WR Pearson
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background The exponential growth of available biological data has caused bioinformatics to be rapidly moving towards a data-intensive, computational science. As a result, the computational power needed by bioinformatics applications is growing exponentially as well. The recent emergence of accelerator technologies has made it possible to achieve an excellent improvement in execution time for many bioinformatics applications, compared to current general-purpose platforms. In this paper, we demonstrate how the PlayStation® 3, powered by the Cell Broadband Engine, can be used as a computational platform to accelerate the Smith-Waterman algorithm. Results For large datasets, our implementation on the PlayStation® 3 provides a significant improvement in running time compared to other implementations such as SSEARCH, Striped Smith-Waterman and CUDA. Our implementation achieves a peak performance of up to 3,646 MCUPS. Conclusion The results from our experiments demonstrate that the PlayStation® 3 console can be used as an efficient low cost computational platform for high performance sequence alignment applications.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

An efficient algorithm for systematic analysis of nucleotide strings suitable for siRNA design

Author: A Apostolico
A Verdel
AC Hsieh
AL Jackson
AL Jackson
AM Chalk
Ancha Baranova
CF Hung
E Ukkonen
EM McCreight
F Fernandes
F Tilesi
Ganiraju Manyam
IT Li
J Na
Jonathan Bode
K Ui-Tei
M Scherr
Maria Emelianenko
MH Schulz
P Saetrom
P Svoboda
P Weiner
PB Hajeri
PC Scacheri
R Giegerich
SA Manavski
T Alsheddi
W Cui
X Dai
Y Naito
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The "off-target" silencing effect hinders the development of siRNA-based therapeutic and research applications. Existing solutions for finding possible locations of siRNA seats within a large database of genes are either too slow, miss a portion of the targets, or are simply not designed to handle a very large number of queries. We propose a new approach that reduces the computational time as compared to existing techniques. Findings The proposed method employs tree-based storage in a form of a modified truncated suffix tree to sort all possible short string substrings within given set of strings (i.e. transcriptome). Using the new algorithm, we pre-computed a list of the best siRNA locations within each human gene ("siRNA seats"). siRNAs designed to reside within siRNA seats are less likely to hybridize off-target. These siRNA seats could be used as an input for the traditional "set-of-rules" type of siRNA designing software. The list of siRNA seats is available through a publicly available database located at <url>http://web.cos.gmu.edu/~gmanyam/siRNA_db/search.php</url> Conclusions In attempt to perform top-down prediction of the human siRNA with minimized off-target hybridization, we developed an efficient algorithm that employs suffix tree based storage of the substrings. Applications of this approach are not limited to optimal siRNA design, but can also be useful for other tasks involving selection of the characteristic strings specific to individual genes. These strings could then be used as siRNA seats, as specific probes for gene expression studies by oligonucleotide-based microarrays, for the design of molecular beacon probes for Real-Time PCR and, generally, any type of PCR primers.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

On the Virtualization of CUDA Based GPU Remoting on ARM and X86 Machines in the GVirtuS Framework

Author: A Herrera
C Shuai
C Yang
Carlo Palmieri
Carmine Ferraro
Cheol-Ho Hong
Dimitrios S. Nikolopoulos
G Giunta
Giuliano Laccetti
Giulio Giunta
Ivor Spence
L Dagum
L Shi
M Garland
Marco Lapegna
N Rajovic
R Montella
R Montella
Raffaele Montella
SA Manavski
Valentina Pelliccia
Y Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Fast and accurate protein substructure searching with simulated annealing and GPUs

Abstract Background Searching a database of protein structures for matches to a query structure, or occurrences of a structural motif, is an important task in structural biology and bioinformatics. While there are many existing methods for structural similarity searching, faster and more accurate approaches are still required, and few current methods are capable of substructure (motif) searching. Results We developed an improved heuristic for tableau-based protein structure and substructure searching using simulated annealing, that is as fast or faster and comparable in accuracy, with some widely used existing methods. Furthermore, we created a parallel implementation on a modern graphics processing unit (GPU). Conclusions The GPU implementation achieves up to 34 times speedup over the CPU implementation of tableau-based structure search with simulated annealing, making it one of the fastest available methods. To the best of our knowledge, this is the first application of a GPU to the protein structural search problem.</p

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository