Search CORE

76 research outputs found

A Convex Formulation for Mixed Regression with Two Components: Minimax Optimal Rates

Author: Caramanis Constantine
Chen Yudong
Yi Xinyang
Publication venue
Publication date: 13/02/2015
Field of study

We consider the mixed regression problem with two components, under adversarial and stochastic noise. We give a convex optimization formulation that provably recovers the true solution, and provide upper bounds on the recovery errors for both arbitrary noise and stochastic noise settings. We also give matching minimax lower bounds (up to log factors), showing that under certain assumptions, our algorithm is information-theoretically optimal. Our results represent the first tractable algorithm guaranteeing successful recovery with tight bounds on recovery errors and sample complexity.Comment: Added results on minimax lower bounds, which match our upper bounds on recovery errors up to log factors. Appeared in the Conference on Learning Theory (COLT), 2014. (JMLR W&CP 35 :560-604, 2014

arXiv.org e-Print Archive

CiteSeerX

Optimal linear estimation under unknown nonlinear transform

Author: Caramanis Constantine
Liu Han
Wang Zhaoran
Yi Xinyang
Publication venue
Publication date: 01/01/2015
Field of study

Linear regression studies the problem of estimating a model parameter

\beta^* \in \mathbb{R}^p

, from

n

observations

\{(y_i,\mathbf{x}_i)\}_{i=1}^n

from linear model

y_i = \langle \mathbf{x}_i,\beta^* \rangle + \epsilon_i

. We consider a significant generalization in which the relationship between

\langle \mathbf{x}_i,\beta^* \rangle

and

y_i

is noisy, quantized to a single bit, potentially nonlinear, noninvertible, as well as unknown. This model is known as the single-index model in statistics, and, among other things, it represents a significant generalization of one-bit compressed sensing. We propose a novel spectral-based estimation procedure and show that we can recover

\beta^*

in settings (i.e., classes of link function

f

) where previous algorithms fail. In general, our algorithm requires only very mild restrictions on the (unknown) functional relationship between

y_i

and

\langle \mathbf{x}_i,\beta^* \rangle

. We also consider the high dimensional setting where

\beta^*

is sparse ,and introduce a two-stage nonconvex framework that addresses estimation challenges in high dimensional regimes where

p \gg n

. For a broad class of link functions between

\langle \mathbf{x}_i,\beta^* \rangle

and

y_i

, we establish minimax lower bounds that demonstrate the optimality of our estimators in both the classical and high dimensional regimes.Comment: 25 pages, 3 figure

arXiv.org e-Print Archive

Princeton University Open Access Repository

PubMed Central

Improving Multi-Task Generalization via Regularizing Spurious Correlation

Author: Chi Ed H.
Hong Lichan
Hu Ziniu
Sun Yizhou
Yao Tiansheng
Yi Xinyang
Zhao Zhe
Publication venue
Publication date: 24/11/2022
Field of study

Multi-Task Learning (MTL) is a powerful learning paradigm to improve generalization performance via knowledge sharing. However, existing studies find that MTL could sometimes hurt generalization, especially when two tasks are less correlated. One possible reason that hurts generalization is spurious correlation, i.e., some knowledge is spurious and not causally related to task labels, but the model could mistakenly utilize them and thus fail when such correlation changes. In MTL setup, there exist several unique challenges of spurious correlation. First, the risk of having non-causal knowledge is higher, as the shared MTL model needs to encode all knowledge from different tasks, and causal knowledge for one task could be potentially spurious to the other. Second, the confounder between task labels brings in a different type of spurious correlation to MTL. We theoretically prove that MTL is more prone to taking non-causal knowledge from other tasks than single-task learning, and thus generalize worse. To solve this problem, we propose Multi-Task Causal Representation Learning framework, aiming to represent multi-task knowledge via disentangled neural modules, and learn which module is causally related to each task via MTL-specific invariant regularization. Experiments show that it could enhance MTL model's performance by 5.5% on average over Multi-MNIST, MovieLens, Taskonomy, CityScape, and NYUv2, via alleviating spurious correlation problem.Comment: Published on NeurIPS 202

arXiv.org e-Print Archive