Search CORE

5,267 research outputs found

Upward Three-Dimensional Grid Drawings of Graphs

Author: A. Garg
C. Ware
David R. Wood
E. Giacomo Di
F. Harary
F.T. Leighton
G. Battista Di
G. Fertin
J. Nešetřil
K. Edwards
K. Edwards
L.S. Heath
L.S. Heath
L.S. Heath
L.S. Heath
M. Kaufmann
M.D. Hutton
P. Bertolazzi
P. Bose
P. Morin
R.F. Cohen
S. Felsner
T. Calamoneri
V. Dujmović
V. Dujmović
V. Dujmović
V. Dujmović
Vida Dujmović
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

A \emph{three-dimensional grid drawing} of a graph is a placement of the vertices at distinct points with integer coordinates, such that the straight line segments representing the edges do not cross. Our aim is to produce three-dimensional grid drawings with small bounding box volume. We prove that every

n

-vertex graph with bounded degeneracy has a three-dimensional grid drawing with

O(n^{3/2})

volume. This is the broadest class of graphs admiting such drawings. A three-dimensional grid drawing of a directed graph is \emph{upward} if every arc points up in the z-direction. We prove that every directed acyclic graph has an upward three-dimensional grid drawing with

(n^3)

volume, which is tight for the complete dag. The previous best upper bound was

O(n^4)

. Our main result is that every

c

-colourable directed acyclic graph (

c

constant) has an upward three-dimensional grid drawing with

O(n^2)

volume. This result matches the bound in the undirected case, and improves the best known bound from

O(n^3)

for many classes of directed acyclic graphs, including planar, series parallel, and outerplanar

arXiv.org e-Print Archive

CiteSeerX

Crossref

Carleton University's Institutional Repository

Measuring and Understanding Throughput of Network Topologies

Author: Godfrey P. Brighten
Jyothi Sangeetha Abdu
Kolla Alexandra
Singla Ankit
Publication venue
Publication date: 14/11/2016
Field of study

High throughput is of particular interest in data center and HPC networks. Although myriad network topologies have been proposed, a broad head-to-head comparison across topologies and across traffic patterns is absent, and the right way to compare worst-case throughput performance is a subtle problem. In this paper, we develop a framework to benchmark the throughput of network topologies, using a two-pronged approach. First, we study performance on a variety of synthetic and experimentally-measured traffic matrices (TMs). Second, we show how to measure worst-case throughput by generating a near-worst-case TM for any given topology. We apply the framework to study the performance of these TMs in a wide range of network topologies, revealing insights into the performance of topologies with scaling, robustness of performance across TMs, and the effect of scattered workload placement. Our evaluation code is freely available

arXiv.org e-Print Archive

CiteSeerX

Crossref

Almost-Tight Distributed Minimum Cut Algorithms

Author: A. Das Sarma
D. Pritchard
D.R. Karger
H. Nagamochi
H.N. Gabow
J.A. Garay
M. Ghaffari
M. Khan
M. Stoer
M. Thorup
R. Thurimella
S. Kutten
W.T. Tutte
Z. Lotker
Publication venue
Publication date: 01/01/2014
Field of study

We study the problem of computing the minimum cut in a weighted distributed message-passing networks (the CONGEST model). Let

\lambda

be the minimum cut,

n

be the number of nodes in the network, and

D

be the network diameter. Our algorithm can compute

\lambda

exactly in

O((\sqrt{n} \log^{*} n+D)\lambda^4 \log^2 n)

time. To the best of our knowledge, this is the first paper that explicitly studies computing the exact minimum cut in the distributed setting. Previously, non-trivial sublinear time algorithms for this problem are known only for unweighted graphs when

\lambda\leq 3

due to Pritchard and Thurimella's

O(D)

-time and

O(D+n^{1/2}\log^* n)

-time algorithms for computing

2

-edge-connected and

3

-edge-connected components. By using the edge sampling technique of Karger's, we can convert this algorithm into a

(1+\epsilon)

-approximation

O((\sqrt{n}\log^{*} n+D)\epsilon^{-5}\log^3 n)

-time algorithm for any

\epsilon>0

. This improves over the previous

(2+\epsilon)

-approximation

O((\sqrt{n}\log^{*} n+D)\epsilon^{-5}\log^2 n\log\log n)

-time algorithm and

O(\epsilon^{-1})

-approximation

O(D+n^{\frac{1}{2}+\epsilon} \mathrm{poly}\log n)

-time algorithm of Ghaffari and Kuhn. Due to the lower bound of

\Omega(D+n^{1/2}/\log n)

by Das Sarma et al. which holds for any approximation algorithm, this running time is tight up to a

\mathrm{poly}\log n

factor. To get the stated running time, we developed an approximation algorithm which combines the ideas of Thorup's algorithm and Matula's contraction algorithm. It saves an

\epsilon^{-9}\log^{7} n

factor as compared to applying Thorup's tree packing theorem directly. Then, we combine Kutten and Peleg's tree partitioning algorithm and Karger's dynamic programming to achieve an efficient distributed algorithm that finds the minimum cut when we are given a spanning tree that crosses the minimum cut exactly once

arXiv.org e-Print Archive

Crossref

An efficient multi-core implementation of a novel HSS-structured multifrontal solver using randomized sampling

Author: Ghysels Pieter
Li Xiaoye S.
Napov Artem
Rouet Francois-Henry
Williams Samuel
Publication venue
Publication date: 25/02/2015
Field of study

We present a sparse linear system solver that is based on a multifrontal variant of Gaussian elimination, and exploits low-rank approximation of the resulting dense frontal matrices. We use hierarchically semiseparable (HSS) matrices, which have low-rank off-diagonal blocks, to approximate the frontal matrices. For HSS matrix construction, a randomized sampling algorithm is used together with interpolative decompositions. The combination of the randomized compression with a fast ULV HSS factorization leads to a solver with lower computational complexity than the standard multifrontal method for many applications, resulting in speedups up to 7 fold for problems in our test suite. The implementation targets many-core systems by using task parallelism with dynamic runtime scheduling. Numerical experiments show performance improvements over state-of-the-art sparse direct solvers. The implementation achieves high performance and good scalability on a range of modern shared memory parallel systems, including the Intel Xeon Phi (MIC). The code is part of a software package called STRUMPACK -- STRUctured Matrices PACKage, which also has a distributed memory component for dense rank-structured matrices

arXiv.org e-Print Archive

eScholarship - University of California

DI-fusion