Search CORE

1,157 research outputs found

Space-Optimal Profile Estimation in Data Streams with Applications to Symmetric Functions

Author: Chen Justin Y.
Indyk Piotr
Woodruff David P.
Publication venue
Publication date: 29/11/2023
Field of study

We revisit the problem of estimating the profile (also known as the rarity) in the data stream model. Given a sequence of

m

elements from a universe of size

n

, its profile is a vector

\phi

whose

i

-th entry

\phi_i

represents the number of distinct elements that appear in the stream exactly

i

times. A classic paper by Datar and Muthukrishan from 2002 gave an algorithm which estimates any entry

\phi_i

up to an additive error of

\pm \epsilon D

using

O(1/\epsilon^2 (\log n + \log m))

bits of space, where

D

is the number of distinct elements in the stream. In this paper, we considerably improve on this result by designing an algorithm which simultaneously estimates many coordinates of the profile vector

\phi

up to small overall error. We give an algorithm which, with constant probability, produces an estimated profile

\hat\phi

with the following guarantees in terms of space and estimation error: - For any constant

\tau

, with

O(1 / \epsilon^2 + \log n)

bits of space,

\sum_{i=1}^\tau |\phi_i - \hat\phi_i| \leq \epsilon D

. - With

O(1/ \epsilon^2\log (1/\epsilon) + \log n + \log \log m)

bits of space,

\sum_{i=1}^m |\phi_i - \hat\phi_i| \leq \epsilon m

. In addition to bounding the error across multiple coordinates, our space bounds separate the terms that depend on

1/\epsilon

and those that depend on

n

and

m

. We prove matching lower bounds on space in both regimes. Application of our profile estimation algorithm gives estimates within error

\pm \epsilon D

of several symmetric functions of frequencies in

O(1/\epsilon^2 + \log n)

bits. This generalizes space-optimal algorithms for the distinct elements problems to other problems including estimating the Huber and Tukey losses as well as frequency cap statistics.Comment: To appear in ITCS 202

arXiv.org e-Print Archive

Improved Frequency Estimation Algorithms with and without Predictions

Author: Aamand Anders
Chen Justin Y.
Nguyen Huy Lê
Silwal Sandeep
Vakilian Ali
Publication venue
Publication date: 12/12/2023
Field of study

Estimating frequencies of elements appearing in a data stream is a key task in large-scale data analysis. Popular sketching approaches to this problem (e.g., CountMin and CountSketch) come with worst-case guarantees that probabilistically bound the error of the estimated frequencies for any possible input. The work of Hsu et al. (2019) introduced the idea of using machine learning to tailor sketching algorithms to the specific data distribution they are being run on. In particular, their learning-augmented frequency estimation algorithm uses a learned heavy-hitter oracle which predicts which elements will appear many times in the stream. We give a novel algorithm, which in some parameter regimes, already theoretically outperforms the learning based algorithm of Hsu et al. without the use of any predictions. Augmenting our algorithm with heavy-hitter predictions further reduces the error and improves upon the state of the art. Empirically, our algorithms achieve superior performance in all experiments compared to prior approaches.Comment: NeurIPS 202

arXiv.org e-Print Archive

Neither dust nor black carbon causing apparent albedo decline in Greenland\u27s dry snow zone: Implications for MODIS C5 surface reflectance

Author: Bergin Mike
Chen Justin Y
Courville Zoe R
Dibb Jack E.
Flanner Mark G
Lai Alexandra M
Polashenski Chris
Schauer James J
Shafer Martin M
Publication venue: University of New Hampshire Scholars\u27 Repository
Publication date: 03/11/2015
Field of study

Remote sensing observations suggest Greenland ice sheet (GrIS) albedo has declined since 2001, even in the dry snow zone. We seek to explain the apparent dry snow albedo decline. We analyze samples representing 2012–2014 snowfall across NW Greenland for black carbon and dust light-absorbing impurities (LAI) and model their impacts on snow albedo. Albedo reductions due to LAI are small, averaging 0.003, with episodic enhancements resulting in reductions of 0.01–0.02. No significant increase in black carbon or dust concentrations relative to recent decades is found. Enhanced deposition of LAI is not, therefore, causing significant dry snow albedo reduction or driving melt events. Analysis of Collection 5 Moderate Resolution Imaging Spectroradiometer (MODIS) surface reflectance data indicates that the decline and spectral shift in dry snow albedo contains important contributions from uncorrected Terra sensor degradation. Though discrepancies are mostly below the stated accuracy of MODIS products, they will require revisiting some prior conclusions with C6 data

Crossref

UNH Scholars' Repository

Increased Expression of M1 and M2 Phenotypic Markers in Isolated Microglia After Four-Day Binge Alcohol Exposure in Male Rats

Author: Chen Kevin Y.
McClain Justin A.
Nickell Chelsea Rhea Geil
Nixon Kimberly
Peng Hui
Publication venue: UKnowledge
Publication date: 01/08/2017
Field of study

Microglia activation and neuroinflammation are common features of neurodegenerative conditions, including alcohol use disorders (AUDs). When activated, microglia span a continuum of diverse phenotypes ranging from classically activated, pro-inflammatory (M1) microglia/macrophages to alternatively activated, growth-promoting (M2) microglia/macrophages. Identifying microglia phenotypes is critical for understanding the role of microglia in the pathogenesis of AUDs. Therefore, male rats were gavaged with 25% (w/v) ethanol or isocaloric control diet every 8 h for 4 days and sacrificed at 0, 2, 4, and 7 days after alcohol exposure (e.g., T0, T2, etc.). Microglia were isolated from hippocampus and entorhinal cortices by Percoll density gradient centrifugation. Cells were labeled with microglia surface antigens and analyzed by flow cytometry. Consistent with prior studies, isolated cells yielded a highly enriched population of brain macrophages/microglia (\u3e 95% pure), evidenced by staining for the macrophage/microglia antigen CD11b. Polarization states of CD11b+CD45low microglia were evaluated by expression of M1 surface markers, major histocompatibility complex (MHC) II, CD32, CD86, and M2 surface marker, CD206 (mannose receptor). Ethanol-treated animals begin to show increased expression of M1 and M2 markers at T0 (p = n.s.), with significant changes at the T2 time point. At T2, expression of M1 markers, MHC-II, CD86, and CD32 were increased (p \u3c 0.05) in hippocampus and entorhinal cortices, while M2 marker, CD206, was increased significantly only in entorhinal cortices (p \u3c 0.05). All effects resolved to control levels by T4. In summary, four-day binge alcohol exposure produces a transient increase in both M1 (MHC-II, CD32, and CD86) and M2 (CD206) populations of microglia isolated from the entorhinal cortex and hippocampus. Thus, these findings that both pro-inflammatory and potentially beneficial, recovery-promoting microglia phenotypes can be observed after a damaging exposure of alcohol are critically important to our understanding of the role of microglia in the pathogenesis of AUDs

Crossref

University of Kentucky

The Renormalization Group and Singular Perturbations: Multiple-Scales, Boundary Layers and Reductive Perturbation Theory

Author: A. C. Newell
A. J. Roberts
C. M. Bender
D. J. Amit
E. J. Hinch
E. Stueckelberg
G. C. Paquette
G. Green
G. I. Barenblatt
G. Paquette
J. A. Murdock
J. Bricmont
J. Bricmont
J. Carr
J. Carr
J. Swift
J. Zinn-Justin
K. Fujimura
K. G. Wilson
K. G. Wilson
K. Wilson
L. D. Landau
L.-Y. Chen
L.-Y. Chen
L.-Y. Chen
L.-Y. Chen
L.-Y. Chen
L.-Y. Chen
Lin-Yuan Chen
M. C. Cross
M. Frasca
M. Gell-Mann
N. D. Goldenfeld
N. Goldenfeld
N. Goldenfeld
N. Goldenfeld
N. Goldenfeld
N. Goldenfeld
N. N. Bogoliubov
N. N. Bogoliubov
N. N. Bogoliubov
Nigel Goldenfeld
R. E. Langer
R. E. Langer
R. E. Langer
S. L. Woodruff
T. Taniuti
Y. Kuramoto
Y. Oono
Y. Oono
Publication venue: 'American Physical Society (APS)'
Publication date: 23/06/1995
Field of study

Perturbative renormalization group theory is developed as a unified tool for global asymptotic analysis. With numerous examples, we illustrate its application to ordinary differential equation problems involving multiple scales, boundary layers with technically difficult asymptotic matching, and WKB analysis. In contrast to conventional methods, the renormalization group approach requires neither {\it ad hoc\/} assumptions about the structure of perturbation series nor the use of asymptotic matching. Our renormalization group approach provides approximate solutions which are practically superior to those obtained conventionally, although the latter can be reproduced, if desired, by appropriate expansion of the renormalization group approximant. We show that the renormalization group equation may be interpreted as an amplitude equation, and from this point of view develop reductive perturbation theory for partial differential equations describing spatially-extended systems near bifurcation points, deriving both amplitude equations and the center manifold.Comment: 44 pages, 2 Postscript figures, macro \uiucmac.tex available at macro archives or at ftp://gijoe.mrl.uiuc.edu/pu

arXiv.org e-Print Archive

Crossref

Renormalization Group Theory for Global Asymptotic Analysis

Author: A. C. Newell
C. M. Bender
E. C. G. Stueckelberg
E. J. Hinch
G. I. Barenblatt
G. Paquette
J. Bricmont
J. D. Crawford
J. Kevorkian
J. Swift
J. Zinn-Justin
L. Y. Chen
Lin Yuan Chen
M. C. Cross
M. Gell-Mann
N. D. Goldenfeld
N. Goldenfeld
N. Goldenfeld
Nigel Goldenfeld
S. L. Woodruff
T. Taniuti
Y. Kuramoto
Y. Oono
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/1994
Field of study

We show with several examples that renormalization group (RG) theory can be used to understand singular and reductive perturbation methods in a unified fashion. Amplitude equations describing slow motion dynamics in nonequilibrium phenomena are RG equations. The renormalized perturbation approach may be simpler to use than other approaches, because it does not require the use of asymptotic matching, and yields practically superior approximations.Comment: 13 pages, plain tex + uiucmac.tex (available from babbage.sissa.it), one PostScript figure appended at end. Or (easier) get compressed postscript file by anon ftp from gijoe.mrl.uiuc.edu (128.174.119.153), file /pub/rg_sing_prl.ps.

arXiv.org e-Print Archive

CiteSeerX

Crossref

Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks

Author: Aamand Anders
Chen Justin Y.
Indyk Piotr
Narayanan Shyam
Rubinfeld Ronitt
Schiefer Nicholas
Silwal Sandeep
Wagner Tal
Publication venue
Publication date: 06/11/2022
Field of study

Recent work shows that the expressive power of Graph Neural Networks (GNNs) in distinguishing non-isomorphic graphs is exactly the same as that of the Weisfeiler-Lehman (WL) graph test. In particular, they show that the WL test can be simulated by GNNs. However, those simulations involve neural networks for the 'combine' function of size polynomial or even exponential in the number of graph nodes

n

, as well as feature vectors of length linear in

n

. We present an improved simulation of the WL test on GNNs with \emph{exponentially} lower complexity. In particular, the neural network implementing the combine function in each node has only a polylogarithmic number of parameters in

n

, and the feature vectors exchanged by the nodes of GNN consists of only

O(\log n)

bits. We also give logarithmic lower bounds for the feature vector length and the size of the neural networks, showing the (near)-optimality of our construction.Comment: 22 pages,5 figures, accepted at NeurIPS 202

arXiv.org e-Print Archive

Recommended from our members

An ANGPTL4-ceramide-protein kinase Cζ axis mediates chronic glucocorticoid exposure-induced hepatic steatosis and hypertriglyceridemia in mice.

Author: Cheang Rachel T
Chen Tzu-Chieh
Fitch Mark D
Gray Nora E
Hellerstein Marc K
Kanamaluru Deepthi
Lee Justin Y
Lee Rebecca A
Tan Jenna H
Tsai Sam L
Wang Jen-Chywan
Yiv Nicholas
Publication venue: eScholarship, University of California
Publication date: 01/06/2019
Field of study

Chronic or excess glucocorticoid exposure causes lipid disorders such as hypertriglyceridemia and hepatic steatosis. Angptl4 (angiopoietin-like 4), a primary target gene of the glucocorticoid receptor in hepatocytes and adipocytes, is required for hypertriglyceridemia and hepatic steatosis induced by the synthetic glucocorticoid dexamethasone. Angptl4 has also been shown to be required for dexamethasone-induced hepatic ceramide production. Here, we further examined the role of ceramide-mediated signaling in hepatic dyslipidemia caused by chronic glucocorticoid exposure. Using a stable isotope-labeling technique, we found that dexamethasone treatment induced the rate of hepatic de novo lipogenesis and triglyceride synthesis. These dexamethasone responses were compromised in Angptl4-null mice (Angptl4-/-). Treating mice with myriocin, an inhibitor of the rate-controlling enzyme of de novo ceramide synthesis, serine palmitoyltransferase long-chain base subunit 1 (SPTLC1)/SPTLC2, decreased dexamethasone-induced plasma and liver triglyceride levels in WT but not Angptl4-/- mice. We noted similar results in mice infected with adeno-associated virus-expressing small hairpin RNAs targeting Sptlc2. Protein phosphatase 2 phosphatase activator (PP2A) and protein kinase Cζ (PKCζ) are two known downstream effectors of ceramides. We found here that mice treated with an inhibitor of PKCζ, 2-acetyl-1,3-cyclopentanedione (ACPD), had lower levels of dexamethasone-induced triglyceride accumulation in plasma and liver. However, small hairpin RNA-mediated targeting of the catalytic PP2A subunit (Ppp2ca) had no effect on dexamethasone responses on plasma and liver triglyceride levels. Overall, our results indicate that chronic dexamethasone treatment induces an ANGPTL4-ceramide-PKCζ axis that activates hepatic de novo lipogenesis and triglyceride synthesis, resulting in lipid disorders

eScholarship - University of California