1,471 research outputs found
GPfit: An R package for Gaussian Process Model Fitting using a New Optimization Algorithm
Gaussian process (GP) models are commonly used statistical metamodels for
emulating expensive computer simulators. Fitting a GP model can be numerically
unstable if any pair of design points in the input space are close together.
Ranjan, Haynes, and Karsten (2011) proposed a computationally stable approach
for fitting GP models to deterministic computer simulators. They used a genetic
algorithm based approach that is robust but computationally intensive for
maximizing the likelihood. This paper implements a slightly modified version of
the model proposed by Ranjan et al. (2011), as the new R package GPfit. A novel
parameterization of the spatial correlation function and a new multi-start
gradient based optimization algorithm yield optimization that is robust and
typically faster than the genetic algorithm based approach. We present two
examples with R codes to illustrate the usage of the main functions in GPfit.
Several test functions are used for performance comparison with a popular R
package mlegp. GPfit is a free software and distributed under the general
public license, as part of the R software project (R Development Core Team
2012).Comment: 20 pages, 17 image
Sequential Design for Computer Experiments with a Flexible Bayesian Additive Model
In computer experiments, a mathematical model implemented on a computer is
used to represent complex physical phenomena. These models, known as computer
simulators, enable experimental study of a virtual representation of the
complex phenomena. Simulators can be thought of as complex functions that take
many inputs and provide an output. Often these simulators are themselves
expensive to compute, and may be approximated by "surrogate models" such as
statistical regression models. In this paper we consider a new kind of
surrogate model, a Bayesian ensemble of trees (Chipman et al. 2010), with the
specific goal of learning enough about the simulator that a particular feature
of the simulator can be estimated. We focus on identifying the simulator's
global minimum. Utilizing the Bayesian version of the Expected Improvement
criterion (Jones et al. 1998), we show that this ensemble is particularly
effective when the simulator is ill-behaved, exhibiting nonstationarity or
abrupt changes in the response. A number of illustrations of the approach are
given, including a tidal power application.Comment: 21 page
-SELC: Optimization by sequential elimination of level combinations using genetic algorithms and Gaussian processes
Identifying promising compounds from a vast collection of feasible compounds
is an important and yet challenging problem in the pharmaceutical industry. An
efficient solution to this problem will help reduce the expenditure at the
early stages of drug discovery. In an attempt to solve this problem, Mandal, Wu
and Johnson [Technometrics 48 (2006) 273--283] proposed the SELC algorithm.
Although powerful, it fails to extract substantial information from the data to
guide the search efficiently, as this methodology is not based on any
statistical modeling. The proposed approach uses Gaussian Process (GP) modeling
to improve upon SELC, and hence named -SELC. The performance of
the proposed methodology is illustrated using four and five dimensional test
functions. Finally, we implement the new algorithm on a real pharmaceutical
data set for finding a group of chemical compounds with optimal properties.Comment: Published in at http://dx.doi.org/10.1214/08-AOAS199 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
- …