Search CORE

27,919 research outputs found

Speeding up Permutation Testing in Neuroimaging

Author: Hinrichs Chris
Ithapu Vamsi K
Johnson Sterling C
Singh Vikas
Sun Qinyuan
Publication venue
Publication date: 11/02/2015
Field of study

Multiple hypothesis testing is a significant problem in nearly all neuroimaging studies. In order to correct for this phenomena, we require a reliable estimate of the Family-Wise Error Rate (FWER). The well known Bonferroni correction method, while simple to implement, is quite conservative, and can substantially under-power a study because it ignores dependencies between test statistics. Permutation testing, on the other hand, is an exact, non-parametric method of estimating the FWER for a given

\alpha

-threshold, but for acceptably low thresholds the computational burden can be prohibitive. In this paper, we show that permutation testing in fact amounts to populating the columns of a very large matrix

{\bf P}

. By analyzing the spectrum of this matrix, under certain conditions, we see that

{\bf P}

has a low-rank plus a low-variance residual decomposition which makes it suitable for highly sub--sampled --- on the order of

0.5\%

--- matrix completion methods. Based on this observation, we propose a novel permutation testing methodology which offers a large speedup, without sacrificing the fidelity of the estimated FWER. Our evaluations on four different neuroimaging datasets show that a computational speedup factor of roughly

50\times

can be achieved while recovering the FWER distribution up to very high accuracy. Further, we show that the estimated

\alpha

-threshold is also recovered faithfully, and is stable.Comment: NIPS 1

arXiv.org e-Print Archive

CiteSeerX

Automatic Dimension Selection for a Non-negative Factorization Approach to Clustering Multiple Random Graphs

Author: Lee Nam H.
Park Youngser
Priebe Care E.
Rosen Michael
Wang I-Jeng
Publication venue
Publication date: 09/09/2014
Field of study

We consider a problem of grouping multiple graphs into several clusters using singular value thesholding and non-negative factorization. We derive a model selection information criterion to estimate the number of clusters. We demonstrate our approach using "Swimmer data set" as well as simulated data set, and compare its performance with two standard clustering algorithms.Comment: This paper has been withdrawn by the author due to a newer version with overlapping content

arXiv.org e-Print Archive

CiteSeerX

Assessing Information Bias and Food Safety

Author: Conley Dennis M.
Wade Mark A.
Publication venue
Publication date
Field of study

Imperfect information can lead to market failure and be an external factor impacting managers of agribusiness firms. A matrix method approach to content analysis was conducted by independent judges based upon established typologies. Food safety articles from consumer publications were examined, and information received by consumers was found to be biased.food safety, information bias, consumers, media, Food Consumption/Nutrition/Food Safety, Marketing, Q10, Q13, Q16,

Research Papers in Economics

Sparse Recovery via Differential Inclusions

Author: Osher Stanley
Ruan Feng
Xiong Jiechao
Yao Yuan
Yin Wotao
Publication venue: 'Elsevier BV'
Publication date: 21/01/2016
Field of study

In this paper, we recover sparse signals from their noisy linear measurements by solving nonlinear differential inclusions, which is based on the notion of inverse scale space (ISS) developed in applied mathematics. Our goal here is to bring this idea to address a challenging problem in statistics, \emph{i.e.} finding the oracle estimator which is unbiased and sign-consistent using dynamics. We call our dynamics \emph{Bregman ISS} and \emph{Linearized Bregman ISS}. A well-known shortcoming of LASSO and any convex regularization approaches lies in the bias of estimators. However, we show that under proper conditions, there exists a bias-free and sign-consistent point on the solution paths of such dynamics, which corresponds to a signal that is the unbiased estimate of the true signal and whose entries have the same signs as those of the true signs, \emph{i.e.} the oracle estimator. Therefore, their solution paths are regularization paths better than the LASSO regularization path, since the points on the latter path are biased when sign-consistency is reached. We also show how to efficiently compute their solution paths in both continuous and discretized settings: the full solution paths can be exactly computed piece by piece, and a discretization leads to \emph{Linearized Bregman iteration}, which is a simple iterative thresholding rule and easy to parallelize. Theoretical guarantees such as sign-consistency and minimax optimal

l_2

-error bounds are established in both continuous and discrete settings for specific points on the paths. Early-stopping rules for identifying these points are given. The key treatment relies on the development of differential inequalities for differential inclusions and their discretizations, which extends the previous results and leads to exponentially fast recovering of sparse signals before selecting wrong ones.Comment: In Applied and Computational Harmonic Analysis, 201

arXiv.org e-Print Archive

The Augmented Synthetic Control Method

Author: Ben-Michael Eli
Feller Avi
Rothstein Jesse
Publication venue
Publication date: 23/07/2020
Field of study

The synthetic control method (SCM) is a popular approach for estimating the impact of a treatment on a single unit in panel data settings. The "synthetic control" is a weighted average of control units that balances the treated unit's pre-treatment outcomes as closely as possible. A critical feature of the original proposal is to use SCM only when the fit on pre-treatment outcomes is excellent. We propose Augmented SCM as an extension of SCM to settings where such pre-treatment fit is infeasible. Analogous to bias correction for inexact matching, Augmented SCM uses an outcome model to estimate the bias due to imperfect pre-treatment fit and then de-biases the original SCM estimate. Our main proposal, which uses ridge regression as the outcome model, directly controls pre-treatment fit while minimizing extrapolation from the convex hull. This estimator can also be expressed as a solution to a modified synthetic controls problem that allows negative weights on some donor units. We bound the estimation error of this approach under different data generating processes, including a linear factor model, and show how regularization helps to avoid over-fitting to noise. We demonstrate gains from Augmented SCM with extensive simulation studies and apply this framework to estimate the impact of the 2012 Kansas tax cuts on economic growth. We implement the proposed method in the new augsynth R package

arXiv.org e-Print Archive

eScholarship - University of California

On the Power of Adaptivity in Matrix Completion and Approximation

Author: Krishnamurthy Akshay
Singh Aarti
Publication venue
Publication date: 14/07/2014
Field of study

We consider the related tasks of matrix completion and matrix approximation from missing data and propose adaptive sampling procedures for both problems. We show that adaptive sampling allows one to eliminate standard incoherence assumptions on the matrix row space that are necessary for passive sampling procedures. For exact recovery of a low-rank matrix, our algorithm judiciously selects a few columns to observe in full and, with few additional measurements, projects the remaining columns onto their span. This algorithm exactly recovers an

n \times n

rank

r

matrix using

O(nr\mu_0 \log^2(r))

observations, where

\mu_0

is a coherence parameter on the column space of the matrix. In addition to completely eliminating any row space assumptions that have pervaded the literature, this algorithm enjoys a better sample complexity than any existing matrix completion algorithm. To certify that this improvement is due to adaptive sampling, we establish that row space coherence is necessary for passive sampling algorithms to achieve non-trivial sample complexity bounds. For constructing a low-rank approximation to a high-rank input matrix, we propose a simple algorithm that thresholds the singular values of a zero-filled version of the input matrix. The algorithm computes an approximation that is nearly as good as the best rank-

r

approximation using

O(nr\mu \log^2(n))

samples, where

\mu

is a slightly different coherence parameter on the matrix columns. Again we eliminate assumptions on the row space

arXiv.org e-Print Archive

CiteSeerX

Relevance Singular Vector Machine for low-rank matrix sensing

Author: Chatterjee Saikat
Jansson Magnus
Rojas Cristian R.
Sundin Martin
Publication venue
Publication date: 01/01/2014
Field of study

In this paper we develop a new Bayesian inference method for low rank matrix reconstruction. We call the new method the Relevance Singular Vector Machine (RSVM) where appropriate priors are defined on the singular vectors of the underlying matrix to promote low rank. To accelerate computations, a numerically efficient approximation is developed. The proposed algorithms are applied to matrix completion and matrix reconstruction problems and their performance is studied numerically.Comment: International Conference on Signal Processing and Communications (SPCOM), 5 page

arXiv.org e-Print Archive

Publikationer från KTH

Crossref

Digitala Vetenskapliga Arkivet - Academic Archive On-line