Search CORE

1,354 research outputs found

Finding Skewed Subcubes Under a Distribution

Author: Gopalan Parikshit
Levin Roie
Wieder Udi
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 11th Innovations in Theoretical Computer Science Conference (ITCS 2020)
Publication date: 01/01/2020
Field of study

Say that we are given samples from a distribution ? over an n-dimensional space. We expect or desire ? to behave like a product distribution (or a k-wise independent distribution over its marginals for small k). We propose the problem of enumerating/list-decoding all large subcubes where the distribution ? deviates markedly from what we expect; we refer to such subcubes as skewed subcubes. Skewed subcubes are certificates of dependencies between small subsets of variables in ?. We motivate this problem by showing that it arises naturally in the context of algorithmic fairness and anomaly detection. In this work we focus on the special but important case where the space is the Boolean hypercube, and the expected marginals are uniform. We show that the obvious definition of skewed subcubes can lead to intractable list sizes, and propose a better definition of a minimal skewed subcube, which are subcubes whose skew cannot be attributed to a larger subcube that contains it. Our main technical contribution is a list-size bound for this definition and an algorithm to efficiently find all such subcubes. Both the bound and the algorithm rely on Fourier-analytic techniques, especially the powerful hypercontractive inequality. On the lower bounds side, we show that finding skewed subcubes is as hard as the sparse noisy parity problem, and hence our algorithms cannot be improved on substantially without a breakthrough on this problem which is believed to be intractable. Motivated by this, we study alternate models allowing query access to ? where finding skewed subcubes might be easier

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

List decoding Reed-Muller codes over small fields

Author: Bhowmick Abhishek
Lovett Shachar
Publication venue
Publication date: 17/07/2014
Field of study

The list decoding problem for a code asks for the maximal radius up to which any ball of that radius contains only a constant number of codewords. The list decoding radius is not well understood even for well studied codes, like Reed-Solomon or Reed-Muller codes. Fix a finite field

\mathbb{F}

. The Reed-Muller code

\mathrm{RM}_{\mathbb{F}}(n,d)

is defined by

n

-variate degree-

d

polynomials over

\mathbb{F}

. In this work, we study the list decoding radius of Reed-Muller codes over a constant prime field

\mathbb{F}=\mathbb{F}_p

, constant degree

d

and large

n

. We show that the list decoding radius is equal to the minimal distance of the code. That is, if we denote by

\delta(d)

the normalized minimal distance of

\mathrm{RM}_{\mathbb{F}}(n,d)

, then the number of codewords in any ball of radius

\delta(d)-\varepsilon

is bounded by

c=c(p,d,\varepsilon)

independent of

n

. This resolves a conjecture of Gopalan-Klivans-Zuckerman [STOC 2008], who among other results proved it in the special case of

\mathbb{F}=\mathbb{F}_2

; and extends the work of Gopalan [FOCS 2010] who proved the conjecture in the case of

d=2

. We also analyse the number of codewords in balls of radius exceeding the minimal distance of the code. For

e \leq d

, we show that the number of codewords of

\mathrm{RM}_{\mathbb{F}}(n,d)

in a ball of radius

\delta(e) - \varepsilon

is bounded by

\exp(c \cdot n^{d-e})

, where

c=c(p,d,\varepsilon)

is independent of

n

. The dependence on

n

is tight. This extends the work of Kaufman-Lovett-Porat [IEEE Inf. Theory 2012] who proved similar bounds over

\mathbb{F}_2

. The proof relies on several new ingredients: an extension of the Frieze-Kannan weak regularity to general function spaces, higher-order Fourier analysis, and an extension of the Schwartz-Zippel lemma to compositions of polynomials.Comment: fixed a bug in the proof of claim 5.6 (now lemma 5.5

arXiv.org e-Print Archive

CiteSeerX

Low Density Lattice Codes

Author: Meir Feder
Naftali Sommer
Ofir Shalvi
Senior Member
Publication venue
Publication date: 11/04/2007
Field of study

Low density lattice codes (LDLC) are novel lattice codes that can be decoded efficiently and approach the capacity of the additive white Gaussian noise (AWGN) channel. In LDLC a codeword x is generated directly at the n-dimensional Euclidean space as a linear transformation of a corresponding integer message vector b, i.e., x = Gb, where H, the inverse of G, is restricted to be sparse. The fact that H is sparse is utilized to develop a linear-time iterative decoding scheme which attains, as demonstrated by simulations, good error performance within ~0.5dB from capacity at block length of n = 100,000 symbols. The paper also discusses convergence results and implementation considerations.Comment: 24 pages, 4 figures. Submitted for publication in IEEE transactions on Information Theor

arXiv.org e-Print Archive

CiteSeerX