To go deep or wide in learning?

Dukkipati, Ambedkar; Pandey, Gaurav

research

To go deep or wide in learning?

Authors: Ambedkar Dukkipati
Gaurav Pandey
Publication date: 23 February 2014
Publisher

Abstract

To achieve acceptable performance for AI tasks, one can either use sophisticated feature extraction methods as the first layer in a two-layered supervised learning model, or learn the features directly using a deep (multi-layered) model. While the first approach is very problem-specific, the second approach has computational overheads in learning multiple layers and fine-tuning of the model. In this paper, we propose an approach called wide learning based on arc-cosine kernels, that learns a single layer of infinite width. We propose exact and inexact learning strategies for wide learning and show that wide learning with single layer outperforms single layer as well as deep architectures of finite width for some benchmark datasets.Comment: 9 pages, 1 figure, Accepted for publication in Seventeenth International Conference on Artificial Intelligence and Statistic

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.648.9...

Last time updated on 29/10/2017

CiteSeerX

oai:CiteSeerX.psu:10.1.1.703.2...

Last time updated on 29/10/2017

CiteSeerX

oai:CiteSeerX.psu:10.1.1.763.6...

Last time updated on 30/10/2017