Search CORE

243,896 research outputs found

High Dimensional Semiparametric Scale-Invariant Principal Component Analysis

Author: Han Fang
Liu Han
Publication venue
Publication date: 18/02/2014
Field of study

We propose a new high dimensional semiparametric principal component analysis (PCA) method, named Copula Component Analysis (COCA). The semiparametric model assumes that, after unspecified marginally monotone transformations, the distributions are multivariate Gaussian. COCA improves upon PCA and sparse PCA in three aspects: (i) It is robust to modeling assumptions; (ii) It is robust to outliers and data contamination; (iii) It is scale-invariant and yields more interpretable results. We prove that the COCA estimators obtain fast estimation rates and are feature selection consistent when the dimension is nearly exponentially large relative to the sample size. Careful experiments confirm that COCA outperforms sparse PCA on both synthetic and real-world datasets.Comment: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPMAI

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

PubMed Central

Sparse Median Graphs Estimation in a High Dimensional Semiparametric Model

Author: Caffo Brian
Han Fang
Liu Han
Publication venue
Publication date: 11/10/2013
Field of study

In this manuscript a unified framework for conducting inference on complex aggregated data in high dimensional settings is proposed. The data are assumed to be a collection of multiple non-Gaussian realizations with underlying undirected graphical structures. Utilizing the concept of median graphs in summarizing the commonality across these graphical structures, a novel semiparametric approach to modeling such complex aggregated data is provided along with robust estimation of the median graph, which is assumed to be sparse. The estimator is proved to be consistent in graph recovery and an upper bound on the rate of convergence is given. Experiments on both synthetic and real datasets are conducted to illustrate the empirical usefulness of the proposed models and methods

arXiv.org e-Print Archive

Collection Of Biostatistics Research Archive

Distribution-Free Tests of Independence in High Dimensions

Author: Chen Shizhe
Han Fang
Liu Han
Publication venue
Publication date: 20/07/2017
Field of study

We consider the testing of mutual independence among all entries in a

d

-dimensional random vector based on

n

independent observations. We study two families of distribution-free test statistics, which include Kendall's tau and Spearman's rho as important examples. We show that under the null hypothesis the test statistics of these two families converge weakly to Gumbel distributions, and propose tests that control the type I error in the high-dimensional setting where

d>n

. We further show that the two tests are rate-optimal in terms of power against sparse alternatives, and outperform competitors in simulations, especially when

d

is large.Comment: to appear in Biometrik

arXiv.org e-Print Archive

Crossref

eScholarship - University of California

On a generalized canonical bundle formula for generically finite morphisms

Author: Han Jingjun
Liu Wenfei
Publication venue
Publication date: 18/11/2020
Field of study

We prove a canonical bundle formula for generically finite morphisms in the setting of generalized pairs (with

\mathbb{R}

-coefficients). This complements Filipazzi's canonical bundle formula for morphisms with connected fibres. It is then applied to obtain a subadjunction formula for log canonical centers of generalized pairs. As another application, we show that the image of an anti-nef log canonical generalized pair has the structure of a numerically trivial log canonical generalized pair. This readily implies a result of Chen--Zhang. Along the way we prove that the Shokurov type convex sets for anti-nef log canonical divisors are indeed rational polyhedral sets.Comment: 29 pages, to appear in Ann. Inst. Fourier (Grenoble

arXiv.org e-Print Archive

Numérisation de Documents Anciens Mathématiques

Annales de l’institut Fourier (AIF)