Search CORE

1,705 research outputs found

Sample Complexity Bounds on Differentially Private Learning via Communication Complexity

Author: Feldman Vitaly
Xiao David
Publication venue
Publication date: 13/09/2015
Field of study

In this work we analyze the sample complexity of classification by differentially private algorithms. Differential privacy is a strong and well-studied notion of privacy introduced by Dwork et al. (2006) that ensures that the output of an algorithm leaks little information about the data point provided by any of the participating individuals. Sample complexity of private PAC and agnostic learning was studied in a number of prior works starting with (Kasiviswanathan et al., 2008) but a number of basic questions still remain open, most notably whether learning with privacy requires more samples than learning without privacy. We show that the sample complexity of learning with (pure) differential privacy can be arbitrarily higher than the sample complexity of learning without the privacy constraint or the sample complexity of learning with approximate differential privacy. Our second contribution and the main tool is an equivalence between the sample complexity of (pure) differentially private learning of a concept class

C

(or

SCDP(C)

) and the randomized one-way communication complexity of the evaluation problem for concepts from

C

. Using this equivalence we prove the following bounds: 1.

SCDP(C) = \Omega(LDim(C))

, where

LDim(C)

is the Littlestone's (1987) dimension characterizing the number of mistakes in the online-mistake-bound learning model. Known bounds on

LDim(C)

then imply that

SCDP(C)

can be much higher than the VC-dimension of

C

. 2. For any

t

, there exists a class

C

such that

LDim(C)=2

but

SCDP(C) \geq t

. 3. For any

t

, there exists a class

C

such that the sample complexity of (pure)

\alpha

-differentially private PAC learning is

\Omega(t/\alpha)

but the sample complexity of the relaxed

(\alpha,\beta)

-differentially private PAC learning is

O(\log(1/\beta)/\alpha)

. This resolves an open problem of Beimel et al. (2013b).Comment: Extended abstract appears in Conference on Learning Theory (COLT) 201

arXiv.org e-Print Archive

CiteSeerX

What Circuit Classes Can Be Learned with Non-Trivial Savings?

Author: Servedio Rocco A.
Tan Li-Yang
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 8th Innovations in Theoretical Computer Science Conference (ITCS 2017)
Publication date: 01/01/2017
Field of study

Despite decades of intensive research, efficient - or even sub-exponential time - distribution-free PAC learning algorithms are not known for many important Boolean function classes. In this work we suggest a new perspective on these learning problems, inspired by a surge of recent research in complexity theory, in which the goal is to determine whether and how much of a savings over a naive 2^n runtime can be achieved. We establish a range of exploratory results towards this end. In more detail, (1) We first observe that a simple approach building on known uniform-distribution learning results gives non-trivial distribution-free learning algorithms for several well-studied classes including AC0, arbitrary functions of a few linear threshold functions (LTFs), and AC0 augmented with mod_p gates. (2) Next we present an approach, based on the method of random restrictions from circuit complexity, which can be used to obtain several distribution-free learning algorithms that do not appear to be achievable by approach (1) above. The results achieved in this way include learning algorithms with non-trivial savings for LTF-of-AC0 circuits and improved savings for learning parity-of-AC0 circuits. (3) Finally, our third contribution is a generic technique for converting lower bounds proved using Neciporuk\u27s method to learning algorithms with non-trivial savings. This technique, which is the most involved of our three approaches, yields distribution-free learning algorithms for a range of classes where previously even non-trivial uniform-distribution learning algorithms were not known; these classes include full-basis formulas, branching programs, span programs, etc. up to some fixed polynomial size

Dagstuhl Research Online Publication Server

Efficient Transductive Online Learning via Randomized Rounding

Author: Cesa-Bianchi Nicolò
Shamir Ohad
Publication venue
Publication date: 01/01/2013
Field of study

Most traditional online learning algorithms are based on variants of mirror descent or follow-the-leader. In this paper, we present an online algorithm based on a completely different approach, tailored for transductive settings, which combines "random playout" and randomized rounding of loss subgradients. As an application of our approach, we present the first computationally efficient online algorithm for collaborative filtering with trace-norm constrained matrices. As a second application, we solve an open question linking batch learning and transductive online learningComment: To appear in a Festschrift in honor of V.N. Vapnik. Preliminary version presented in NIPS 201

arXiv.org e-Print Archive

AIR Universita degli studi di Milano

Order-Revealing Encryption and the Hardness of Private Learning

Author: A Beimel
A Beimel
A Blum
A Boldyreva
A Gupta
B Chor
C Dwork
C Dwork
D Boneh
D Boneh
D Boneh
J Groth
J Thaler
J Ullman
L Pitt
LG Valiant
M Kearns
M Kearns
M Kharitonov
O Goldreich
O Pandey
RA Servedio
RA Servedio
S Garg
S Goldwasser
SP Kasiviswanathan
T Graepel
Z Brakerski
Publication venue
Publication date: 01/01/2015
Field of study

An order-revealing encryption scheme gives a public procedure by which two ciphertexts can be compared to reveal the ordering of their underlying plaintexts. We show how to use order-revealing encryption to separate computationally efficient PAC learning from efficient

(\epsilon, \delta)

-differentially private PAC learning. That is, we construct a concept class that is efficiently PAC learnable, but for which every efficient learner fails to be differentially private. This answers a question of Kasiviswanathan et al. (FOCS '08, SIAM J. Comput. '11). To prove our result, we give a generic transformation from an order-revealing encryption scheme into one with strongly correct comparison, which enables the consistent comparison of ciphertexts that are not obtained as the valid encryption of any message. We believe this construction may be of independent interest.Comment: 28 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

Learning Parities in the Mistake-Bound model

Author: Buhrman Harry
Garcia-Soriano David
Matsliah Arie
Publication venue: Dagstuhl Seminar Proceedings. 09421 - Algebraic Methods in Computational Complexity
Publication date: 01/01/2010
Field of study

We study the problem of learning parity functions that depend on at most

k

variables (

k

-parities) attribute-efficiently in the mistake-bound model. We design a simple, deterministic, polynomial-time algorithm for learning

k

-parities with mistake bound

O(n^{1-frac{c}{k}})

, for any constant

c > 0

. This is the first polynomial-time algorithms that learns

omega(1)

-parities in the mistake-bound model with mistake bound

o(n)

. Using the standard conversion techniques from the mistake-bound model to the PAC model, our algorithm can also be used for learning

k

-parities in the PAC model. In particular, this implies a slight improvement on the results of Klivans and Servedio cite{rocco} for learning

k

-parities in the PAC model. We also show that the

widetilde{O}(n^{k/2})

time algorithm from cite{rocco} that PAC-learns

k

-parities with optimal sample complexity can be extended to the mistake-bound model

CWI's Institutional Repository

Dagstuhl Research Online Publication Server

Agnostic Membership Query Learning with Nontrivial Savings: New Results, Techniques

Author: Karchmer Ari
Publication venue
Publication date: 11/11/2023
Field of study

(Abridged) Designing computationally efficient algorithms in the agnostic learning model (Haussler, 1992; Kearns et al., 1994) is notoriously difficult. In this work, we consider agnostic learning with membership queries for touchstone classes at the frontier of agnostic learning, with a focus on how much computation can be saved over the trivial runtime of 2^n$. This approach is inspired by and continues the study of ``learning with nontrivial savings'' (Servedio and Tan, 2017). To this end, we establish multiple agnostic learning algorithms, highlighted by: 1. An agnostic learning algorithm for circuits consisting of a sublinear number of gates, which can each be any function computable by a sublogarithmic degree k polynomial threshold function (the depth of the circuit is bounded only by size). This algorithm runs in time 2^{n -s(n)} for s(n) \approx n/(k+1), and learns over the uniform distribution over unlabelled examples on \{0,1\}^n. 2. An agnostic learning algorithm for circuits consisting of a sublinear number of gates, where each can be any function computable by a \sym^+ circuit of subexponential size and sublogarithmic degree k. This algorithm runs in time 2^{n-s(n)} for s(n) \approx n/(k+1), and learns over distributions of unlabelled examples that are products of k+1 arbitrary and unknown distributions, each over \{0,1\}^{n/(k+1)} (assume without loss of generality that k+1 divides n)

arXiv.org e-Print Archive

On the scaling limits of planar percolation

Author: Schramm Oded
Smirnov Stanislav
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2010
Field of study

We prove Tsirelson's conjecture that any scaling limit of the critical planar percolation is a black noise. Our theorems apply to a number of percolation models, including site percolation on the triangular grid and any subsequential scaling limit of bond percolation on the square grid. We also suggest a natural construction for the scaling limit of planar percolation, and more generally of any discrete planar model describing connectivity properties.Comment: With an Appendix by Christophe Garban. Published in at http://dx.doi.org/10.1214/11-AOP659 the Annals of Probability (http://www.imstat.org/aop/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Archive ouverte UNIGE