Search CORE

7 research outputs found

Temporal workload-aware replicated partitioning for social networks

Author: Aykanat Cevdet
Ferhatosmanoglu Hakan
Selvitopi R. Oguz
Turk Ata
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 23/01/2014
Field of study

Most frequent and expensive queries in social networks involve multi-user operations such as requesting the latest tweets or news-feeds of friends. The performance of such queries are heavily dependent on the data partitioning and replication methodologies adopted by the underlying systems. Existing solutions for data distribution in these systems involve hashor graph-based approaches that ignore the multi-way relations among data. In this work, we propose a novel data partitioning and selective replication method that utilizes the temporal information in prior workloads to predict future query patterns. Our method utilizes the social network structure and the temporality of the interactions among its users to construct a hypergraph that correctly models multi-user operations. It then performs simultaneous partitioning and replication of this hypergraph to reduce the query span while respecting load balance and I/O load constraints under replication. To test our model, we enhance the Cassandra NoSQL system to support selective replication and we implement a social network application (a Twitter clone) utilizing our enhanced Cassandra. We conduct experiments on a cloud computing environment (Amazon EC2) to test the developed systems. Comparison of the proposed method with hash- and enhanced graph-based schemes indicate that it significantly improves latency and throughput

Crossref

Bilkent University Institutional Repository

Warwick Research Archives Portal Repository

A Novel Method for Scaling Iterative Solvers: Avoiding Latency Overhead of Parallel Sparse-Matrix Vector Multiplies

Author: Cevdet Aykanat
Muhammet Mustafa Ozdal
R. Oguz Selvitopi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Recommended from our members

Combinatorial BLAS 2.0: Scaling Combinatorial Algorithms on Distributed-Memory Systems

Author: Azad Ariful
Buluç Aydın
Gilbert John R
Hussain Taufique
Selvitopi Oguz
Publication venue: eScholarship, University of California
Publication date: 01/01/2022
Field of study

Combinatorial algorithms such as those that arise in graph analysis, modeling of discrete systems, bioinformatics, and chemistry, are often hard to parallelize. The Combinatorial BLAS library implements key computational primitives for rapid development of combinatorial algorithms in distributed-memory systems. During the decade since its first introduction, the Combinatorial BLAS library has evolved and expanded significantly. This article details many of the key technical features of Combinatorial BLAS version 2.0, such as communication avoidance, hierarchical parallelism via in-node multithreading, accelerator support via GPU kernels, generalized semiring support, implementations of key data structures and functions, and scalable distributed I/O operations for human-readable files. Our article also presents several rules of thumb for choosing the right data structures and functions in Combinatorial BLAS 2.0, under various common application scenarios

eScholarship - University of California

Temporal Workload-Aware Replicated Partitioning for Social Networks

Author: Ata Turk
Cevdet Aykanat
Hakan Ferhatosmanoglu
R. Oguz Selvitopi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Replicated partitioning for undirected hypergraphs

Author: Alpert
Ata Turk
Aykanat
Aykanat
Barroso
Cahoon
Cambazoglu
Cambazoglu
Catalyurek
Cevdet Aykanat
Demir
Demir
Devine
Dingle
Enos
Fiduccia
Gilbert
Hendrickson
Hwang
Jeong
Johnson
Karp
Karypis
Karypis
Kernighan
Kužnar
Lengauer
Liu
Liu
MacFarlane
Moffat
Moffat
Pass
R. Oguz Selvitopi
Shekhar
Tomasic
Tomasic
Trifunović
Uçar
Vastenhouw
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

ADEPT: a domain independent sequence alignment strategy for gpu architectures

Author: A Azad
A Ba
A Enright
A Wozniak
Aydin Buluc
B Buchfink
B Langmead
D Doerfler
E Georganas
E Georganas
E Georganas
FdO Edans
Jack Deslippe
K Reinert
Katherine Yelick
Leonid Oliker
M Awan
M de la Bastide
M Farrar
M Shakya
M Steinegger
Muaaz G. Awan
N Ahmed
N Fox
O Gotoh
Oguz Selvitopi
R Rahn
S Altschul
S Hofmeyr
S Manavski
Steven Hofmeyr
T Rognes
T Rognes
T Smith
T Wittkop
W Pearson
Y Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2020
Field of study

BackgroundBioinformatic workflows frequently make use of automated genome assembly and protein clustering tools. At the core of most of these tools, a significant portion of execution time is spent in determining optimal local alignment between two sequences. This task is performed with the Smith-Waterman algorithm, which is a dynamic programming based method. With the advent of modern sequencing technologies and increasing size of both genome and protein databases, a need for faster Smith-Waterman implementations has emerged. Multiple SIMD strategies for the Smith-Waterman algorithm are available for CPUs. However, with the move of HPC facilities towards accelerator based architectures, a need for an efficient GPU accelerated strategy has emerged. Existing GPU based strategies have either been optimized for a specific type of characters (Nucleotides or Amino Acids) or for only a handful of application use-cases.ResultsIn this paper, we present ADEPT, a new sequence alignment strategy for GPU architectures that is domain independent, supporting alignment of sequences from both genomes and proteins. Our proposed strategy uses GPU specific optimizations that do not rely on the nature of sequence. We demonstrate the feasibility of this strategy by implementing the Smith-Waterman algorithm and comparing it to similar CPU strategies as well as the fastest known GPU methods for each domain. ADEPT's driver enables it to scale across multiple GPUs and allows easy integration into software pipelines which utilize large scale computational systems. We have shown that the ADEPT based Smith-Waterman algorithm demonstrates a peak performance of 360 GCUPS and 497 GCUPs for protein based and DNA based datasets respectively on a single GPU node (8 GPUs) of the Cori Supercomputer. Overall ADEPT shows 10x faster performance in a node-to-node comparison against a corresponding SIMD CPU implementation.ConclusionsADEPT demonstrates a performance that is either comparable or better than existing GPU strategies. We demonstrated the efficacy of ADEPT in supporting existing bionformatics software pipelines by integrating ADEPT in MetaHipMer a high-performance denovo metagenome assembler and PASTIS a high-performance protein similarity graph construction pipeline. Our results show 10% and 30% boost of performance in MetaHipMer and PASTIS respectively

Crossref

eScholarship - University of California