Statistical mechanics of sparse generalization and model selection

Lage-Castellanos, Alejandro; Pagnani, Andrea; Weigt, Martin

research

Statistical mechanics of sparse generalization and model selection

Authors: Alejandro Lage-Castellanos
Andrea Pagnani
Martin Weigt
Publication date: 1 January 2009
Publisher: 'IOP Publishing'
Doi

Abstract

One of the crucial tasks in many inference problems is the extraction of sparse information out of a given number of high-dimensional measurements. In machine learning, this is frequently achieved using, as a penality term, the

L_p

norm of the model parameters, with

p\leq 1

for efficient dilution. Here we propose a statistical-mechanics analysis of the problem in the setting of perceptron memorization and generalization. Using a replica approach, we are able to evaluate the relative performance of naive dilution (obtained by learning without dilution, following by applying a threshold to the model parameters),