Memory-Optimised Parallel Processing of Hi-C Data

Aldinucci, Marco; Drocco, Maurizio; Misale, Claudia; Peretti Pezzi, Guilherme; Tordini, Fabio

Memory-Optimised Parallel Processing of Hi-C Data

Authors: Marco Aldinucci
Maurizio Drocco
Claudia Misale
Guilherme Peretti Pezzi
Fabio Tordini
Publication date: 1 January 2015
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

Abstract—This paper presents the optimisation efforts on the creation of a graph-based mapping representation of gene adjacency. The method is based on the Hi-C process, starting from Next Generation Sequencing data, and it analyses a huge amount of static data in order to produce maps for one or more genes. Straightforward parallelisation of this scheme does not yield acceptable performance on multicore architectures since the scalability is rather limited due to the memory bound nature of the problem. This work focuses on the memory optimisations that can be applied to the graph construction algorithm and its (complex) data structures to derive a cache-oblivious algorithm and eventually to improve the memory bandwidth utilisation. We used as running example NuChart-II, a tool for annotation and statistic analysis of Hi-C data that creates a gene-centric neigh-borhood graph. The proposed approach, which is exemplified for Hi-C, addresses several common issue in the parallelisation of memory bound algorithms for multicore. Results show that the proposed approach is able to increase the parallel speedup from 7x to 22x (on a 32-core platform). Finally, the proposed C++ implementation outperforms the first R NuChart prototype, by which it was not possible to complete the graph generation because of strong memory-saturation problems. I

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Institutional Research Information System University of Turin

oai:iris.unito.it:2318/1521910

Last time updated on 18/04/2020

CiteSeerX

oai:CiteSeerX.psu:10.1.1.735.6...

Last time updated on 30/10/2017