Search CORE

977 research outputs found

Bounds on Query Convergence

Author: Pearlmutter Barak A.
Publication venue
Publication date: 01/01/2005
Field of study

The problem of finding an optimum using noisy evaluations of a smooth cost function arises in many contexts, including economics, business, medicine, experiment design, and foraging theory. We derive an asymptotic bound E[ (x_t - x*)^2 ] >= O(1/sqrt(t)) on the rate of convergence of a sequence (x_0, x_1, >...) generated by an unbiased feedback process observing noisy evaluations of an unknown quadratic function maximised at x*. The bound is tight, as the proof leads to a simple algorithm which meets it. We further establish a bound on the total regret, E[ sum_{i=1..t} (x_i - x*)^2 ] >= O(sqrt(t)) These bounds may impose practical limitations on an agent's performance, as O(eps^-4) queries are made before the queries converge to x* with eps accuracy.Comment: 6 pages, 2 figure

arXiv.org e-Print Archive

CiteSeerX

MURAL - Maynooth University Research Archive Library

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

DeLeT: Graduates' Perceptions of the Program and Their Preparedness for Teaching: An Evaluation Report

Author: Eran Tamir
Nili Pearlmutter
Publication venue: Brandeis University's Jack, Joseph and Morton Mandel Center
Publication date: 08/08/2014
Field of study

This report focuses on how DeLeT graduates from both programs perceive their preparedness for day school teaching, as well as how they perceive the DeLeT faculty and the programs' strengths and weaknesses. It also examines similarities and differences between the two programs and offers possible explanations for the handful of differences we identified. Such an in-depth examination of graduates' perspectives provides valuable formative feedback to both programs. In addition, we anticipate that this report will be useful to funders and faculty at other Jewish teacher education programs who may be interested in using the evaluation tools and procedures we have developed to learn about their graduates and identify areas for program improvement

IssueLab

Automatic Differentiation of Algorithms for Machine Learning

Author: Baydin Atilim Gunes
Pearlmutter Barak A.
Publication venue
Publication date: 01/01/2014
Field of study

Automatic differentiation---the mechanical transformation of numeric computer programs to calculate derivatives efficiently and accurately---dates to the origin of the computer age. Reverse mode automatic differentiation both antedates and generalizes the method of backwards propagation of errors used in machine learning. Despite this, practitioners in a variety of fields, including machine learning, have been little influenced by automatic differentiation, and make scant use of available tools. Here we review the technique of automatic differentiation, describe its two main modes, and explain how it can benefit machine learning practitioners. To reach the widest possible audience our treatment assumes only elementary differential calculus, and does not assume any knowledge of linear algebra.Comment: 7 pages, 1 figur

arXiv.org e-Print Archive

MURAL - Maynooth University Research Archive Library

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

An Analysis of Publication Venues for Automatic Differentiation Research

Author: Baydin Atilim Gunes
Pearlmutter Barak A.
Publication venue
Publication date: 01/01/2014
Field of study

We present the results of our analysis of publication venues for papers on automatic differentiation (AD), covering academic journals and conference proceedings. Our data are collected from the AD publications database maintained by the autodiff.org community website. The database is purpose-built for the AD field and is expanding via submissions by AD researchers. Therefore, it provides a relatively noise-free list of publications relating to the field. However, it does include noise in the form of variant spellings of journal and conference names. We handle this by manually correcting and merging these variants under the official names of corresponding venues. We also share the raw data we get after these corrections.Comment: 6 pages, 3 figure

arXiv.org e-Print Archive

MURAL - Maynooth University Research Archive Library

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

AD in Fortran, Part 2: Implementation via Prepreprocessor

Author: Pearlmutter Barak A.
Radul Alexey
Siskind Jeffrey Mark
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 08/03/2012
Field of study

We describe an implementation of the Farfel Fortran AD extensions. These extensions integrate forward and reverse AD directly into the programming model, with attendant benefits to flexibility, modularity, and ease of use. The implementation we describe is a "prepreprocessor" that generates input to existing Fortran-based AD tools. In essence, blocks of code which are targeted for AD by Farfel constructs are put into subprograms which capture their lexical variable context, and these are closure-converted into top-level subprograms and specialized to eliminate EXTERNAL arguments, rendering them amenable to existing AD preprocessors, which are then invoked, possibly repeatedly if the AD is nested

arXiv.org e-Print Archive

Crossref

MURAL - Maynooth University Research Archive Library

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

AD in Fortran, Part 1: Design

Author: Pearlmutter Barak A.
Radul Alexey
Siskind Jeffrey Mark
Publication venue
Publication date: 01/01/2012
Field of study

We propose extensions to Fortran which integrate forward and reverse Automatic Differentiation (AD) directly into the programming model. Irrespective of implementation technology, embedding AD constructs directly into the language extends the reach and convenience of AD while allowing abstraction of concepts of interest to scientific-computing practice, such as root finding, optimization, and finding equilibria of continuous games. Multiple different subprograms for these tasks can share common interfaces, regardless of whether and how they use AD internally. A programmer can maximize a function F by calling a library maximizer, XSTAR=ARGMAX(F,X0), which internally constructs derivatives of F by AD, without having to learn how to use any particular AD tool. We illustrate the utility of these extensions by example: programs become much more concise and closer to traditional mathematical notation. A companion paper describes how these extensions can be implemented by a program that generates input to existing Fortran-based AD tools

arXiv.org e-Print Archive

MURAL - Maynooth University Research Archive Library

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

Soft-LOST: EM on a Mixture of Oriented Lines

Author: Pearlmutter Barak A.
Publication venue: Springer Verlag
Publication date: 01/01/2004
Field of study

Robust clustering of data into overlapping linear subspaces is a common problem. Here we consider one-dimensional subspaces that cross the origin. This problem arises in blind source separation, where the subspaces correspond directly to columns of a mixing matrix. We present an algorithm that identifies these subspaces using an EM procedure, where the E-step calculates posterior probabilities assigning data points to lines and the M-step repositions the lines to match the points assigned to them. This method, combined with a transformation into a sparse domain and an L1-norm optimisation, constitutes a blind source separation algorithm for the under-determined case

MURAL - Maynooth University Research Archive Library

Simplifying Neural Network Soft Weight-sharing Measures by Soft Weight-measure Soft Weight Sharing

Author: Pearlmutter Barak A.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/1994
Field of study

The abstract is included in the text

MURAL - Maynooth University Research Archive Library

Dungeons and Dragons: The Environmental Insurance Game of the 1990s

Author: Pearlmutter Fredi L.
Publication venue: eRepository @ Seton Hall
Publication date: 01/08/1993
Field of study

bepress Legal Repository

Seton Hall University Libraries

Gradient Descent: Second-Order Momentum and Saturating Error

Author: Pearlmutter Barak A.
Publication venue: Massachusetts Institute of Technology Press (MIT Press)
Publication date: 01/01/1991
Field of study

Batch gradient descent, ~w(t) = -7JdE/dw(t) , conver~es to a minimum of quadratic form with a time constant no better than '4Amax/ Amin where Amin and Amax are the minimum and maximum eigenvalues of the Hessian matrix of E with respect to w. It was recently shown that adding a momentum term ~w(t) = -7JdE/dw(t) + Q'~w(t - 1) improves this to ~ VAmax/ Amin, although only in the batch case. Here we show that secondorder momentum, ~w(t) = -7JdE/dw(t) + Q'~w(t -1) + (3~w(t - 2), can lower this no further. We then regard gradient descent with momentum as a dynamic system and explore a non quadratic error surface, showing that saturation of the error accounts for a variety of effects observed in simulations and justifies some popular heuristics

MURAL - Maynooth University Research Archive Library

Maynooth University ePrints and eTheses Archive

NUI Maynooth Eprint Archive