7 research outputs found
Hard Properties with (Very) Short PCPPs and Their Applications
We show that there exist properties that are maximally hard for testing, while still admitting PCPPs with a proof size very close to linear. Specifically, for every fixed ?, we construct a property P^(?)? {0,1}^n satisfying the following: Any testing algorithm for P^(?) requires ?(n) many queries, and yet P^(?) has a constant query PCPP whose proof size is O(n?log^(?)n), where log^(?) denotes the ? times iterated log function (e.g., log^(2)n = log log n). The best previously known upper bound on the PCPP proof size for a maximally hard to test property was O(n?polylog(n)).
As an immediate application, we obtain stronger separations between the standard testing model and both the tolerant testing model and the erasure-resilient testing model: for every fixed ?, we construct a property that has a constant-query tester, but requires ?(n/log^(?)(n)) queries for every tolerant or erasure-resilient tester
Erasure-Resilient Sublinear-Time Graph Algorithms
We investigate sublinear-time algorithms that take partially erased graphs represented by adjacency lists as input. Our algorithms make degree and neighbor queries to the input graph and work with a specified fraction of adversarial erasures in adjacency entries. We focus on two computational tasks: testing if a graph is connected or ?-far from connected and estimating the average degree. For testing connectedness, we discover a threshold phenomenon: when the fraction of erasures is less than ?, this property can be tested efficiently (in time independent of the size of the graph); when the fraction of erasures is at least ?, then a number of queries linear in the size of the graph representation is required. Our erasure-resilient algorithm (for the special case with no erasures) is an improvement over the previously known algorithm for connectedness in the standard property testing model and has optimal dependence on the proximity parameter ?. For estimating the average degree, our results provide an "interpolation" between the query complexity for this computational task in the model with no erasures in two different settings: with only degree queries, investigated by Feige (SIAM J. Comput. `06), and with degree queries and neighbor queries, investigated by Goldreich and Ron (Random Struct. Algorithms `08) and Eden et al. (ICALP `17). We conclude with a discussion of our model and open questions raised by our work
Sublinear-Time Computation in the Presence of Online Erasures
We initiate the study of sublinear-time algorithms that access their input
via an online adversarial erasure oracle. After answering each query to the
input object, such an oracle can erase input values. Our goal is to
understand the complexity of basic computational tasks in extremely adversarial
situations, where the algorithm's access to data is blocked during the
execution of the algorithm in response to its actions. Specifically, we focus
on property testing in the model with online erasures. We show that two
fundamental properties of functions, linearity and quadraticity, can be tested
for constant with asymptotically the same complexity as in the standard
property testing model. For linearity testing, we prove tight bounds in terms
of , showing that the query complexity is . In contrast to
linearity and quadraticity, some other properties, including sortedness and the
Lipschitz property of sequences, cannot be tested at all, even for . Our
investigation leads to a deeper understanding of the structure of violations of
linearity and other widely studied properties. We also consider implications of
our results for algorithms that are resilient to online adversarial corruptions
instead of erasures
Sample-based distance-approximation for subsequence-freeness
In this work, we study the problem of approximating the distance to
subsequence-freeness in the sample-based distribution-free model. For a given
subsequence (word) , a sequence (text)
is said to contain if there exist indices
such that for every . Otherwise, is
-free. Ron and Rosin (ACM TOCT 2022) showed that the number of samples both
necessary and sufficient for one-sided error testing of subsequence-freeness in
the sample-based distribution-free model is . Denoting by
the distance of to -freeness under a distribution , we are interested in obtaining an estimate ,
such that with probability at
least , for a given distance parameter . Our main result is an
algorithm whose sample complexity is . We first
present an algorithm that works when the underlying distribution is
uniform, and then show how it can be modified to work for any (unknown)
distribution . We also show that a quadratic dependence on is
necessary
On Tolerant Testing and Tolerant Junta Testing
Over the past few decades property testing has became an active field of study in theoretical computer science. The algorithmic task is to determine, given access to an unknown large object (e.g., function, graph, probability distribution), whether it has some fixed property, or it is far from any object having the property. The approximate nature of these algorithms allows in many cases to achieve a significant saving in running time, and obtain \emph{sublinear} running time. Nevertheless, in various settings and applications, accepting only inputs that exactly have a certain property is too restrictive, and it is more beneficial to distinguish between inputs that are close to having the property, and those that are far from it. The framework of \emph{tolerant} testing tackles this exact problem. In this thesis, we will focus on one of the most fundamental properties of Boolean functions: the property of being a \emph{-junta} (i.e., being dependent on at most variables).
The first chapter focuses on algorithms for tolerant junta testing. In particular, we show that there exists a \poly(k) query algorithm distinguishing functions close to -juntas and functions that are far from -juntas. We also show how to obtain a trade-off between the ``tolerance" of the algorithm and its query complexity.
The second chapter focuses on establishing a query lower bound for tolerant junta testing. In particular, we show that any non-adaptive tolerant junta tester, is required to make at least \Omega(k^2/\polylog k) queries.
The third chapter considers tolerant testing in a more general context, and asks whether tolerant testing is strictly harder than standard testing. In particular, we show that for any constant , there exists a property \calP_\ell such that \calP_\ell can be tested in queries, but any tolerant tester for \calP_\ell is required to make at least queries (where denote the times iterated log function).
The final chapter focuses on applications. We show how to leverage the techniques developed in previous chapters to obtain results on tolerant isomorphism testing, unateness testing, and erasure resilient testing
LIPIcs, Volume 261, ICALP 2023, Complete Volume
LIPIcs, Volume 261, ICALP 2023, Complete Volum