21 research outputs found
Coresets Meet EDCS: Algorithms for Matching and Vertex Cover on Massive Graphs
As massive graphs become more prevalent, there is a rapidly growing need for
scalable algorithms that solve classical graph problems, such as maximum
matching and minimum vertex cover, on large datasets. For massive inputs,
several different computational models have been introduced, including the
streaming model, the distributed communication model, and the massively
parallel computation (MPC) model that is a common abstraction of
MapReduce-style computation. In each model, algorithms are analyzed in terms of
resources such as space used or rounds of communication needed, in addition to
the more traditional approximation ratio.
In this paper, we give a single unified approach that yields better
approximation algorithms for matching and vertex cover in all these models. The
highlights include:
* The first one pass, significantly-better-than-2-approximation for matching
in random arrival streams that uses subquadratic space, namely a
-approximation streaming algorithm that uses space
for constant .
* The first 2-round, better-than-2-approximation for matching in the MPC
model that uses subquadratic space per machine, namely a
-approximation algorithm with memory per
machine for constant .
By building on our unified approach, we further develop parallel algorithms
in the MPC model that give a -approximation to matching and an
-approximation to vertex cover in only MPC rounds and
memory per machine. These results settle multiple open
questions posed in the recent paper of Czumaj~et.al. [STOC 2018]
REDS: Random Ensemble Deep Spatial prediction
There has been a great deal of recent interest in the development of spatial
prediction algorithms for very large datasets and/or prediction domains. These
methods have primarily been developed in the spatial statistics community, but
there has been growing interest in the machine learning community for such
methods, primarily driven by the success of deep Gaussian process regression
approaches and deep convolutional neural networks. These methods are often
computationally expensive to train and implement and consequently, there has
been a resurgence of interest in random projections and deep learning models
based on random weights -- so called reservoir computing methods. Here, we
combine several of these ideas to develop the Random Ensemble Deep Spatial
(REDS) approach to predict spatial data. The procedure uses random Fourier
features as inputs to an extreme learning machine (a deep neural model with
random weights), and with calibrated ensembles of outputs from this model based
on different random weights, it provides a simple uncertainty quantification.
The REDS method is demonstrated on simulated data and on a classic large
satellite data set