28 research outputs found
Compressing Sparse Sequences under Local Decodability Constraints
We consider a variable-length source coding problem subject to local
decodability constraints. In particular, we investigate the blocklength scaling
behavior attainable by encodings of -sparse binary sequences, under the
constraint that any source bit can be correctly decoded upon probing at most
codeword bits. We consider both adaptive and non-adaptive access models,
and derive upper and lower bounds that often coincide up to constant factors.
Notably, such a characterization for the fixed-blocklength analog of our
problem remains unknown, despite considerable research over the last three
decades. Connections to communication complexity are also briefly discussed.Comment: 8 pages, 1 figure. First five pages to appear in 2015 International
Symposium on Information Theory. This version contains supplementary materia
On the Complexity of Making a Distinguished Vertex Minimum or Maximum Degree by Vertex Deletion
In this paper, we investigate the approximability of two node deletion
problems. Given a vertex weighted graph and a specified, or
"distinguished" vertex , MDD(min) is the problem of finding a minimum
weight vertex set such that becomes the
minimum degree vertex in ; and MDD(max) is the problem of
finding a minimum weight vertex set such that
becomes the maximum degree vertex in . These are known
-complete problems and have been studied from the parameterized complexity
point of view in previous work. Here, we prove that for any ,
both the problems cannot be approximated within a factor , unless . We also show that for any
, MDD(min) cannot be approximated within a factor on bipartite graphs, unless , and that for any , MDD(max) cannot be approximated within a
factor on bipartite graphs, unless . We give an factor approximation algorithm
for MDD(max) on general graphs, provided the degree of is . We
then show that if the degree of is , a similar result holds
for MDD(min). We prove that MDD(max) is -complete on 3-regular unweighted
graphs and provide an approximation algorithm with ratio when is a
3-regular unweighted graph. In addition, we show that MDD(min) can be solved in
polynomial time when is a regular graph of constant degree.Comment: 16 pages, 4 figures, submitted to Elsevier's Journal of Discrete
Algorithm
Maximizing Utility Among Selfish Users in Social Groups
We consider the problem of a social group of users trying to obtain a
"universe" of files, first from a server and then via exchange amongst
themselves. We consider the selfish file-exchange paradigm of give-and-take,
whereby two users can exchange files only if each has something unique to offer
the other. We are interested in maximizing the number of users who can obtain
the universe through a schedule of file-exchanges. We first present a practical
paradigm of file acquisition. We then present an algorithm which ensures that
at least half the users obtain the universe with high probability for files
and users when , thereby showing an
approximation ratio of 2. Extending these ideas, we show a -
approximation algorithm for , and a - approximation algorithm for , , .
Finally, we show that for any , there exists a schedule of file
exchanges which ensures that at least half the users obtain the universe.Comment: 11 pages, 3 figures; submitted for review to the National Conference
on Communications (NCC) 201
The Online Disjoint Set Cover Problem and its Applications
Given a universe of elements and a collection of subsets
of , the maximum disjoint set cover problem (DSCP) is to
partition into as many set covers as possible, where a set cover
is defined as a collection of subsets whose union is . We consider the
online DSCP, in which the subsets arrive one by one (possibly in an order
chosen by an adversary), and must be irrevocably assigned to some partition on
arrival with the objective of minimizing the competitive ratio. The competitive
ratio of an online DSCP algorithm is defined as the maximum ratio of the
number of disjoint set covers obtained by the optimal offline algorithm to the
number of disjoint set covers obtained by across all inputs. We propose an
online algorithm for solving the DSCP with competitive ratio . We then
show a lower bound of on the competitive ratio for any
online DSCP algorithm. The online disjoint set cover problem has wide ranging
applications in practice, including the online crowd-sourcing problem, the
online coverage lifetime maximization problem in wireless sensor networks, and
in online resource allocation problems.Comment: To appear in IEEE INFOCOM 201
Modeling and Correcting Bias in Sequential Evaluation
We consider the problem of sequential evaluation, in which an evaluator
observes candidates in a sequence and assigns scores to these candidates in an
online, irrevocable fashion. Motivated by the psychology literature that has
studied sequential bias in such settings -- namely, dependencies between the
evaluation outcome and the order in which the candidates appear -- we propose a
natural model for the evaluator's rating process that captures the lack of
calibration inherent to such a task. We conduct crowdsourcing experiments to
demonstrate various facets of our model. We then proceed to study how to
correct sequential bias under our model by posing this as a statistical
inference problem. We propose a near-linear time, online algorithm for this
task and prove guarantees in terms of two canonical ranking metrics. We also
prove that our algorithm is information theoretically optimal, by establishing
matching lower bounds in both metrics. Finally, we perform a host of numerical
experiments to show that our algorithm often outperforms the de facto method of
using the rankings induced by the reported scores, both in simulation and on
the crowdsourcing data that we collected
Do algorithms and barriers for sparse principal component analysis extend to other structured settings?
We study a principal component analysis problem under the spiked Wishart
model in which the structure in the signal is captured by a class of
union-of-subspace models. This general class includes vanilla sparse PCA as
well as its variants with graph sparsity. With the goal of studying these
problems under a unified statistical and computational lens, we establish
fundamental limits that depend on the geometry of the problem instance, and
show that a natural projected power method exhibits local convergence to the
statistically near-optimal neighborhood of the solution. We complement these
results with end-to-end analyses of two important special cases given by path
and tree sparsity in a general basis, showing initialization methods and
matching evidence of computational hardness. Overall, our results indicate that
several of the phenomena observed for vanilla sparse PCA extend in a natural
fashion to its structured counterparts