28 research outputs found

    Compressing Sparse Sequences under Local Decodability Constraints

    Full text link
    We consider a variable-length source coding problem subject to local decodability constraints. In particular, we investigate the blocklength scaling behavior attainable by encodings of rr-sparse binary sequences, under the constraint that any source bit can be correctly decoded upon probing at most dd codeword bits. We consider both adaptive and non-adaptive access models, and derive upper and lower bounds that often coincide up to constant factors. Notably, such a characterization for the fixed-blocklength analog of our problem remains unknown, despite considerable research over the last three decades. Connections to communication complexity are also briefly discussed.Comment: 8 pages, 1 figure. First five pages to appear in 2015 International Symposium on Information Theory. This version contains supplementary materia

    On the Complexity of Making a Distinguished Vertex Minimum or Maximum Degree by Vertex Deletion

    Full text link
    In this paper, we investigate the approximability of two node deletion problems. Given a vertex weighted graph G=(V,E)G=(V,E) and a specified, or "distinguished" vertex p∈Vp \in V, MDD(min) is the problem of finding a minimum weight vertex set SβŠ†Vβˆ–{p}S \subseteq V\setminus \{p\} such that pp becomes the minimum degree vertex in G[Vβˆ–S]G[V \setminus S]; and MDD(max) is the problem of finding a minimum weight vertex set SβŠ†Vβˆ–{p}S \subseteq V\setminus \{p\} such that pp becomes the maximum degree vertex in G[Vβˆ–S]G[V \setminus S]. These are known NPNP-complete problems and have been studied from the parameterized complexity point of view in previous work. Here, we prove that for any Ο΅>0\epsilon > 0, both the problems cannot be approximated within a factor (1βˆ’Ο΅)log⁑n(1 - \epsilon)\log n, unless NPβŠ†DTIME(nlog⁑log⁑n)NP \subseteq DTIME(n^{\log\log n}). We also show that for any Ο΅>0\epsilon > 0, MDD(min) cannot be approximated within a factor (1βˆ’Ο΅)log⁑n(1 -\epsilon)\log n on bipartite graphs, unless NPβŠ†DTIME(nlog⁑log⁑n)NP \subseteq DTIME(n^{\log\log n}), and that for any Ο΅>0\epsilon > 0, MDD(max) cannot be approximated within a factor (1/2βˆ’Ο΅)log⁑n(1/2 - \epsilon)\log n on bipartite graphs, unless NPβŠ†DTIME(nlog⁑log⁑n)NP \subseteq DTIME(n^{\log\log n}). We give an O(log⁑n)O(\log n) factor approximation algorithm for MDD(max) on general graphs, provided the degree of pp is O(log⁑n)O(\log n). We then show that if the degree of pp is nβˆ’O(log⁑n)n-O(\log n), a similar result holds for MDD(min). We prove that MDD(max) is APXAPX-complete on 3-regular unweighted graphs and provide an approximation algorithm with ratio 1.5831.583 when GG is a 3-regular unweighted graph. In addition, we show that MDD(min) can be solved in polynomial time when GG is a regular graph of constant degree.Comment: 16 pages, 4 figures, submitted to Elsevier's Journal of Discrete Algorithm

    Maximizing Utility Among Selfish Users in Social Groups

    Full text link
    We consider the problem of a social group of users trying to obtain a "universe" of files, first from a server and then via exchange amongst themselves. We consider the selfish file-exchange paradigm of give-and-take, whereby two users can exchange files only if each has something unique to offer the other. We are interested in maximizing the number of users who can obtain the universe through a schedule of file-exchanges. We first present a practical paradigm of file acquisition. We then present an algorithm which ensures that at least half the users obtain the universe with high probability for nn files and m=O(log⁑n)m=O(\log n) users when nβ†’βˆžn\rightarrow\infty, thereby showing an approximation ratio of 2. Extending these ideas, we show a 1+Ο΅11+\epsilon_1 - approximation algorithm for m=O(n)m=O(n), Ο΅1>0\epsilon_1>0 and a (1+z)/2+Ο΅2(1+z)/2 +\epsilon_2 - approximation algorithm for m=O(nz)m=O(n^z), z>1z>1, Ο΅2>0\epsilon_2>0. Finally, we show that for any m=O(eo(n))m=O(e^{o(n)}), there exists a schedule of file exchanges which ensures that at least half the users obtain the universe.Comment: 11 pages, 3 figures; submitted for review to the National Conference on Communications (NCC) 201

    The Online Disjoint Set Cover Problem and its Applications

    Full text link
    Given a universe UU of nn elements and a collection of subsets S\mathcal{S} of UU, the maximum disjoint set cover problem (DSCP) is to partition S\mathcal{S} into as many set covers as possible, where a set cover is defined as a collection of subsets whose union is UU. We consider the online DSCP, in which the subsets arrive one by one (possibly in an order chosen by an adversary), and must be irrevocably assigned to some partition on arrival with the objective of minimizing the competitive ratio. The competitive ratio of an online DSCP algorithm AA is defined as the maximum ratio of the number of disjoint set covers obtained by the optimal offline algorithm to the number of disjoint set covers obtained by AA across all inputs. We propose an online algorithm for solving the DSCP with competitive ratio ln⁑n\ln n. We then show a lower bound of Ω(ln⁑n)\Omega(\sqrt{\ln n}) on the competitive ratio for any online DSCP algorithm. The online disjoint set cover problem has wide ranging applications in practice, including the online crowd-sourcing problem, the online coverage lifetime maximization problem in wireless sensor networks, and in online resource allocation problems.Comment: To appear in IEEE INFOCOM 201

    Modeling and Correcting Bias in Sequential Evaluation

    Full text link
    We consider the problem of sequential evaluation, in which an evaluator observes candidates in a sequence and assigns scores to these candidates in an online, irrevocable fashion. Motivated by the psychology literature that has studied sequential bias in such settings -- namely, dependencies between the evaluation outcome and the order in which the candidates appear -- we propose a natural model for the evaluator's rating process that captures the lack of calibration inherent to such a task. We conduct crowdsourcing experiments to demonstrate various facets of our model. We then proceed to study how to correct sequential bias under our model by posing this as a statistical inference problem. We propose a near-linear time, online algorithm for this task and prove guarantees in terms of two canonical ranking metrics. We also prove that our algorithm is information theoretically optimal, by establishing matching lower bounds in both metrics. Finally, we perform a host of numerical experiments to show that our algorithm often outperforms the de facto method of using the rankings induced by the reported scores, both in simulation and on the crowdsourcing data that we collected

    Do algorithms and barriers for sparse principal component analysis extend to other structured settings?

    Full text link
    We study a principal component analysis problem under the spiked Wishart model in which the structure in the signal is captured by a class of union-of-subspace models. This general class includes vanilla sparse PCA as well as its variants with graph sparsity. With the goal of studying these problems under a unified statistical and computational lens, we establish fundamental limits that depend on the geometry of the problem instance, and show that a natural projected power method exhibits local convergence to the statistically near-optimal neighborhood of the solution. We complement these results with end-to-end analyses of two important special cases given by path and tree sparsity in a general basis, showing initialization methods and matching evidence of computational hardness. Overall, our results indicate that several of the phenomena observed for vanilla sparse PCA extend in a natural fashion to its structured counterparts
    corecore