4,407 research outputs found
An Emulator for the Lyman-alpha Forest
We present methods for interpolating between the 1-D flux power spectrum of
the Lyman- forest, as output by cosmological hydrodynamic simulations.
Interpolation is necessary for cosmological parameter estimation due to the
limited number of simulations possible. We construct an emulator for the
Lyman- forest flux power spectrum from small simulations using
Latin hypercube sampling and Gaussian process interpolation. We show that this
emulator has a typical accuracy of 1.5% and a worst-case accuracy of 4%, which
compares well to the current statistical error of 3 - 5% at from BOSS
DR9. We compare to the previous state of the art, quadratic polynomial
interpolation. The Latin hypercube samples the entire volume of parameter
space, while quadratic polynomial emulation samples only lower-dimensional
subspaces. The Gaussian process provides an estimate of the emulation error and
we show using test simulations that this estimate is reasonable. We construct a
likelihood function and use it to show that the posterior constraints generated
using the emulator are unbiased. We show that our Gaussian process emulator has
lower emulation error than quadratic polynomial interpolation and thus produces
tighter posterior confidence intervals, which will be essential for future
Lyman- surveys such as DESI.Comment: 28 pages, 10 figures, accepted to JCAP with minor change
Estimating the Parameter of Exponential Distribution under Type II Censoring From Fuzzy Data
The problem of estimating the parameter of Exponential distribution on the basis of type II censoring scheme is considered when the available data are in the form of fuzzy numbers. The Bayes estimate of the unknown parameter is obtained by using the approximation forms of Lindley (1980) and Tierney and Kadane (1986) under the assumption of gamma prior. The highest posterior density (HPD) estimate of the parameter of interest is found. A Monte Carlo simulation is used to compare the performances of the different methods. A real data set is investigated to illustrate the applicability of the proposed methods
Small area estimation of general parameters with application to poverty indicators: A hierarchical Bayes approach
Poverty maps are used to aid important political decisions such as allocation
of development funds by governments and international organizations. Those
decisions should be based on the most accurate poverty figures. However, often
reliable poverty figures are not available at fine geographical levels or for
particular risk population subgroups due to the sample size limitation of
current national surveys. These surveys cannot cover adequately all the desired
areas or population subgroups and, therefore, models relating the different
areas are needed to 'borrow strength" from area to area. In particular, the
Spanish Survey on Income and Living Conditions (SILC) produces national poverty
estimates but cannot provide poverty estimates by Spanish provinces due to the
poor precision of direct estimates, which use only the province specific data.
It also raises the ethical question of whether poverty is more severe for women
than for men in a given province. We develop a hierarchical Bayes (HB) approach
for poverty mapping in Spanish provinces by gender that overcomes the small
province sample size problem of the SILC. The proposed approach has a wide
scope of application because it can be used to estimate general nonlinear
parameters. We use a Bayesian version of the nested error regression model in
which Markov chain Monte Carlo procedures and the convergence monitoring
therein are avoided. A simulation study reveals good frequentist properties of
the HB approach. The resulting poverty maps indicate that poverty, both in
frequency and intensity, is localized mostly in the southern and western
provinces and it is more acute for women than for men in most of the provinces.Comment: Published in at http://dx.doi.org/10.1214/13-AOAS702 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Machine Learning in Wireless Sensor Networks: Algorithms, Strategies, and Applications
Wireless sensor networks monitor dynamic environments that change rapidly
over time. This dynamic behavior is either caused by external factors or
initiated by the system designers themselves. To adapt to such conditions,
sensor networks often adopt machine learning techniques to eliminate the need
for unnecessary redesign. Machine learning also inspires many practical
solutions that maximize resource utilization and prolong the lifespan of the
network. In this paper, we present an extensive literature review over the
period 2002-2013 of machine learning methods that were used to address common
issues in wireless sensor networks (WSNs). The advantages and disadvantages of
each proposed algorithm are evaluated against the corresponding problem. We
also provide a comparative guide to aid WSN designers in developing suitable
machine learning solutions for their specific application challenges.Comment: Accepted for publication in IEEE Communications Surveys and Tutorial
One-Class Classification: Taxonomy of Study and Review of Techniques
One-class classification (OCC) algorithms aim to build classification models
when the negative class is either absent, poorly sampled or not well defined.
This unique situation constrains the learning of efficient classifiers by
defining class boundary just with the knowledge of positive class. The OCC
problem has been considered and applied under many research themes, such as
outlier/novelty detection and concept learning. In this paper we present a
unified view of the general problem of OCC by presenting a taxonomy of study
for OCC problems, which is based on the availability of training data,
algorithms used and the application domains applied. We further delve into each
of the categories of the proposed taxonomy and present a comprehensive
literature review of the OCC algorithms, techniques and methodologies with a
focus on their significance, limitations and applications. We conclude our
paper by discussing some open research problems in the field of OCC and present
our vision for future research.Comment: 24 pages + 11 pages of references, 8 figure
An SMP Soft Classification Algorithm for Remote Sensing
This work introduces a symmetric multiprocessing (SMP) version of the continuous iterative
guided spectral class rejection (CIGSCR) algorithm, a semiautomated classification algorithm for remote
sensing (multispectral) images. The algorithm uses soft data clusters to produce a soft classification
containing inherently more information than a comparable hard classification at an increased computational
cost. Previous work suggests that similar algorithms achieve good parallel scalability, motivating the parallel
algorithm development work here. Experimental results of applying parallel CIGSCR to an image with
approximately 10^8 pixels and six bands demonstrate superlinear speedup. A soft two class classification is
generated in just over four minutes using 32 processors
- …