CORE
πΊπ¦Β
Β make metadata, not war
Services
Services overview
Explore all CORE services
Access to raw data
API
Dataset
FastSync
Content discovery
Recommender
Discovery
OAI identifiers
OAI Resolver
Managing content
Dashboard
Bespoke contracts
Consultancy services
Support us
Support us
Membership
Sponsorship
Research partnership
About
About
About us
Our mission
Team
Blog
FAQs
Contact us
Community governance
Governance
Advisory Board
Board of supporters
Research network
Innovations
Our research
Labs
research
Recursive Sketching For Frequency Moments
Authors
Vladimir Braverman
Rafail Ostrovsky
Publication date
11 November 2010
Publisher
View
on
arXiv
Abstract
In a ground-breaking paper, Indyk and Woodruff (STOC 05) showed how to compute
F
k
F_k
F
k
β
(for
k
>
2
k>2
k
>
2
) in space complexity
O(\mbox{\em poly-log}(n,m)\cdot n^{1-\frac2k})
, which is optimal up to (large) poly-logarithmic factors in
n
n
n
and
m
m
m
, where
m
m
m
is the length of the stream and
n
n
n
is the upper bound on the number of distinct elements in a stream. The best known lower bound for large moments is
Ξ©
(
log
β‘
(
n
)
n
1
β
2
k
)
\Omega(\log(n)n^{1-\frac2k})
Ξ©
(
lo
g
(
n
)
n
1
β
k
2
β
)
. A follow-up work of Bhuvanagiri, Ganguly, Kesh and Saha (SODA 2006) reduced the poly-logarithmic factors of Indyk and Woodruff to
O
(
log
β‘
2
(
m
)
β
(
log
β‘
n
+
log
β‘
m
)
β
n
1
β
2
k
)
O(\log^2(m)\cdot (\log n+ \log m)\cdot n^{1-{2\over k}})
O
(
lo
g
2
(
m
)
β
(
lo
g
n
+
lo
g
m
)
β
n
1
β
k
2
β
)
. Further reduction of poly-log factors has been an elusive goal since 2006, when Indyk and Woodruff method seemed to hit a natural "barrier." Using our simple recursive sketch, we provide a different yet simple approach to obtain a
O
(
log
β‘
(
m
)
log
β‘
(
n
m
)
β
(
log
β‘
log
β‘
n
)
4
β
n
1
β
2
k
)
O(\log(m)\log(nm)\cdot (\log\log n)^4\cdot n^{1-{2\over k}})
O
(
lo
g
(
m
)
lo
g
(
nm
)
β
(
lo
g
lo
g
n
)
4
β
n
1
β
k
2
β
)
algorithm for constant
Ο΅
\epsilon
Ο΅
(our bound is, in fact, somewhat stronger, where the
(
log
β‘
log
β‘
n
)
(\log\log n)
(
lo
g
lo
g
n
)
term can be replaced by any constant number of
log
β‘
\log
lo
g
iterations instead of just two or three, thus approaching
l
o
g
β
n
log^*n
l
o
g
β
n
. Our bound also works for non-constant
Ο΅
\epsilon
Ο΅
(for details see the body of the paper). Further, our algorithm requires only
4
4
4
-wise independence, in contrast to existing methods that use pseudo-random generators for computing large frequency moments
Similar works
Full text
Available Versions
CiteSeerX
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:CiteSeerX.psu:10.1.1.755.3...
Last time updated on 30/10/2017