Search CORE

362 research outputs found

Fine-grained parallelization of fitness functions in bioinformatics optimization problems: gene selection for cancer classification and biclustering of gene expression data

Author: Cerrada Barrios José Luis
Crawford Broderick
Fernández Díaz Ramón
Gómez Pulido Juan Antonio
Lanza Gutiérrez José Manuel
Soto Guzmán Ricardo
Trinidad Amado Sebastián
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

ANTECEDENTES: las metaheurísticas se utilizan ampliamente para resolver grandes problemas de optimización combinatoria en bioinformática debido al enorme conjunto de posibles soluciones. Dos problemas representativos son la selección de genes para la clasificación del cáncer y el agrupamiento de los datos de expresión génica. En la mayoría de los casos, estas metaheurísticas, así como otras técnicas no lineales, aplican una función de adecuación a cada solución posible con una población de tamaño limitado, y ese paso involucra latencias más altas que otras partes de los algoritmos, lo cual es la razón por la cual el tiempo de ejecución de las aplicaciones dependerá principalmente del tiempo de ejecución de la función de aptitud. Además, es habitual encontrar formulaciones aritméticas de punto flotante para las funciones de fitness. De esta manera, una paralelización cuidadosa de estas funciones utilizando la tecnología de hardware reconfigurable acelerará el cálculo, especialmente si se aplican en paralelo a varias soluciones de la población. RESULTADOS: una paralelización de grano fino de dos funciones de aptitud de punto flotante de diferentes complejidades y características involucradas en el biclustering de los datos de expresión génica y la selección de genes para la clasificación del cáncer permitió obtener mayores aceleraciones y cómputos de potencia reducida con respecto a los microprocesadores habituales. CONCLUSIONES: Los resultados muestran mejores rendimientos utilizando tecnología de hardware reconfigurable en lugar de los microprocesadores habituales, en términos de tiempo de consumo y consumo de energía, no solo debido a la paralelización de las operaciones aritméticas, sino también gracias a la evaluación de aptitud concurrente para varios individuos de la población en La metaheurística. Esta es una buena base para crear soluciones aceleradas y de bajo consumo de energía para escenarios informáticos intensivos.BACKGROUND: Metaheuristics are widely used to solve large combinatorial optimization problems in bioinformatics because of the huge set of possible solutions. Two representative problems are gene selection for cancer classification and biclustering of gene expression data. In most cases, these metaheuristics, as well as other non-linear techniques, apply a fitness function to each possible solution with a size-limited population, and that step involves higher latencies than other parts of the algorithms, which is the reason why the execution time of the applications will mainly depend on the execution time of the fitness function. In addition, it is usual to find floating-point arithmetic formulations for the fitness functions. This way, a careful parallelization of these functions using the reconfigurable hardware technology will accelerate the computation, specially if they are applied in parallel to several solutions of the population. RESULTS: A fine-grained parallelization of two floating-point fitness functions of different complexities and features involved in biclustering of gene expression data and gene selection for cancer classification allowed for obtaining higher speedups and power-reduced computation with regard to usual microprocessors. CONCLUSIONS: The results show better performances using reconfigurable hardware technology instead of usual microprocessors, in computing time and power consumption terms, not only because of the parallelization of the arithmetic operations, but also thanks to the concurrent fitness evaluation for several individuals of the population in the metaheuristic. This is a good basis for building accelerated and low-energy solutions for intensive computing scenarios.• Ministerio de Economía y Competitividad y Fondos FEDER. Contrato TIN2012-30685 (I+D+i) • Gobierno de Extremadura. Ayuda GR15011 para grupos TIC015 • CONICYT/FONDECYT/REGULAR/1160455. Beca para Ricardo Soto Guzmán • CONICYT/FONDECYT/REGULAR/1140897. Beca para Broderick CrawfordpeerReviewe

Springer - Publisher Connector

Dehesa. Repositorio Institucional de la Universidad de Extremadura

Hardware acceleration of genomics data analysis: challenges and opportunities

Author: Abdallah
Al Kawam
Al-Absi
Alser
Alser
Altschul
Angerer
Antipov
Arram
Arram
Audano
Ayling
Bahrebar
Banerjee
Bao
Bao
Barron
Behjati
Bohannan
Brittain
Broad Institute
Broad Institute
Cardon
Carrillo
Carrillo
Challis
Chen
Chen
Ciccolella
Cingolani
Clark
Croville
Das
Denti
Doan
Dobin
Du
Fei
Fleckhaus
Fonseca
Genome Research Ltd
Ghurye
Golosova
Goodwin
Goyal
Gök
Hackl
Hasnain
Houtgast
Hu
Illumina Inc
Jackson
Javed
Joardar
Joshi
Jourdren
Kaplan
Kent
Kim
Kim
Kosuri
Langmead
Langmead
Langmead
Lesk
Li
Li
Li
Li
Li
Li
Li
Lightbody
Lightbody
Liu
Liu
Liu
Lv
Margulies
Maruyama
Mcvicar
Milward
Muir
NCBI
Niedringhaus
Nsame
Orth
Oxford Nanopore Technologies
Park
Patel
Payne
Peddie
Rizzo
Robinson
Sarkar
Sboner
Schatz
Shang
Shang
Sharifi
Subbulakshmi
Sundfeld
Tian
Tsai
Turakhia
Turakhia
Wang
Wang
Ward
xilinx
Yano
Zaharia
Zokaee
Publication venue: 'Oxford University Press (OUP)'
Publication date: 25/05/2021
Field of study

Ulster University's Research Portal

Custom Hardware Versus Cloud Computing in Big Data

Author: A O’Driscoll
A Papadopoulos
AM Townsend
CLP Chen
D Qiao
D Schneider
D Singh
E O’Dwyer
EL Dijk Van
G Tan
I Aldridge
I Foster
IAT Hashem
J Fan
J Gubbi
J Shen
JJJM Seddon
M Almalki
M Armbrust
M Baker
M Batty
M Chen
M Kühnemund
M Swan
ML Metzker
MM Najafabadi
NJ Loman
P Klus
PC Zikopoulos
R Kitchin
Y Erlich
Y Liu
Y Wang
Z Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/07/2017
Field of study

The computational and data handling challenges in big data are immense yet a market is steadily growing traditionally supported by technologies such as Hadoop for management and processing of huge and unstructured datasets. With this ever increasing deluge of data we now need the algorithms, tools and computing infrastructure to handle the extremely computationally intense data analytics, looking for patterns and information pertinent to creating a market edge for a range of applications. Cloud computing has provided opportunities for scalable high-performance solutions without the initial outlay of developing and creating the core infrastructure. One vendor in particular, Amazon Web Services, has been leading this field. However, other solutions exist to take on the computational load of big data analytics. This chapter provides an overview of the extent of applications in which big data analytics is used. Then an overview is given of some of the high-performance computing options that are available, ranging from multiple Central Processing Unit (CPU) setups, Graphical Processing Units (GPUs), Field Programmable Gate Arrays (FPGAs) and cloud solutions. The chapter concludes by looking at some of the state of the art solutions for deep learning platforms in which custom hardware such as FPGAs and Application Specific Integrated Circuits (ASICs) are used within a cloud platform for key computational bottlenecks

FPGA acceleration of DNA sequence alignment: design analysis and optimization

Author: Ng Ho-Cheung
Publication venue: Computing, Imperial College London
Publication date: 01/01/2022
Field of study

Existing FPGA accelerators for short read mapping often fail to utilize the complete biological information in sequencing data for simple hardware design, leading to missed or incorrect alignment. In this work, we propose a runtime reconfigurable alignment pipeline that considers all information in sequencing data for the biologically accurate acceleration of short read mapping. We focus our efforts on accelerating two string matching techniques: FM-index and the Smith-Waterman algorithm with the affine-gap model which are commonly used in short read mapping. We further optimize the FPGA hardware using a design analyzer and merger to improve alignment performance. The contributions of this work are as follows. 1. We accelerate the exact-match and mismatch alignment by leveraging the FM-index technique. We optimize memory access by compressing the data structure and interleaving the access with multiple short reads. The FM-index hardware also considers complete information in the read data to maximize accuracy. 2. We propose a seed-and-extend model to accelerate alignment with indels. The FM-index hardware is extended to support the seeding stage while a Smith-Waterman implementation with the affine-gap model is developed on FPGA for the extension stage. This model can improve the efficiency of indel alignment with comparable accuracy versus state-of-the-art software. 3. We present an approach for merging multiple FPGA designs into a single hardware design, so that multiple place-and-route tasks can be replaced by a single task to speed up functional evaluation of designs. We first experiment with this approach to demonstrate its feasibility for different designs. Then we apply this approach to optimize one of the proposed FPGA aligners for better alignment performance.Open Acces

Spiral - Imperial College Digital Repository