19,769 research outputs found
On the future of astrostatistics: statistical foundations and statistical practice
This paper summarizes a presentation for a panel discussion on "The Future of
Astrostatistics" held at the Statistical Challenges in Modern Astronomy V
conference at Pennsylvania State University in June 2011. I argue that the
emerging needs of astrostatistics may both motivate and benefit from
fundamental developments in statistics. I highlight some recent work within
statistics on fundamental topics relevant to astrostatistical practice,
including the Bayesian/frequentist debate (and ideas for a synthesis),
multilevel models, and multiple testing. As an important direction for future
work in statistics, I emphasize that astronomers need a statistical framework
that explicitly supports unfolding chains of discovery, with acquisition,
cataloging, and modeling of data not seen as isolated tasks, but rather as
parts of an ongoing, integrated sequence of analyses, with information and
uncertainty propagating forward and backward through the chain. A prototypical
example is surveying of astronomical populations, where source detection,
demographic modeling, and the design of survey instruments and strategies all
interact.Comment: 8 pp, 2 figures. To appear in "Statistical Challenges in Modern
Astronomy V," (Lecture Notes in Statistics, Vol. 209), ed. Eric D. Feigelson
and G. Jogesh Babu; publication planned for Sep 2012; see
http://www.springer.com/statistics/book/978-1-4614-3519-
Multi modal multi-semantic image retrieval
PhDThe rapid growth in the volume of visual information, e.g. image, and video can
overwhelm users’ ability to find and access the specific visual information of interest
to them. In recent years, ontology knowledge-based (KB) image information retrieval
techniques have been adopted into in order to attempt to extract knowledge from these
images, enhancing the retrieval performance. A KB framework is presented to
promote semi-automatic annotation and semantic image retrieval using multimodal
cues (visual features and text captions). In addition, a hierarchical structure for the KB
allows metadata to be shared that supports multi-semantics (polysemy) for concepts.
The framework builds up an effective knowledge base pertaining to a domain specific
image collection, e.g. sports, and is able to disambiguate and assign high level
semantics to ‘unannotated’ images.
Local feature analysis of visual content, namely using Scale Invariant Feature
Transform (SIFT) descriptors, have been deployed in the ‘Bag of Visual Words’
model (BVW) as an effective method to represent visual content information and to
enhance its classification and retrieval. Local features are more useful than global
features, e.g. colour, shape or texture, as they are invariant to image scale, orientation
and camera angle. An innovative approach is proposed for the representation,
annotation and retrieval of visual content using a hybrid technique based upon the use
of an unstructured visual word and upon a (structured) hierarchical ontology KB
model. The structural model facilitates the disambiguation of unstructured visual
words and a more effective classification of visual content, compared to a vector
space model, through exploiting local conceptual structures and their relationships.
The key contributions of this framework in using local features for image
representation include: first, a method to generate visual words using the semantic
local adaptive clustering (SLAC) algorithm which takes term weight and spatial
locations of keypoints into account. Consequently, the semantic information is
preserved. Second a technique is used to detect the domain specific ‘non-informative
visual words’ which are ineffective at representing the content of visual data and
degrade its categorisation ability. Third, a method to combine an ontology model with
xi
a visual word model to resolve synonym (visual heterogeneity) and polysemy
problems, is proposed. The experimental results show that this approach can discover
semantically meaningful visual content descriptions and recognise specific events,
e.g., sports events, depicted in images efficiently.
Since discovering the semantics of an image is an extremely challenging problem, one
promising approach to enhance visual content interpretation is to use any associated
textual information that accompanies an image, as a cue to predict the meaning of an
image, by transforming this textual information into a structured annotation for an
image e.g. using XML, RDF, OWL or MPEG-7. Although, text and image are distinct
types of information representation and modality, there are some strong, invariant,
implicit, connections between images and any accompanying text information.
Semantic analysis of image captions can be used by image retrieval systems to
retrieve selected images more precisely. To do this, a Natural Language Processing
(NLP) is exploited firstly in order to extract concepts from image captions. Next, an
ontology-based knowledge model is deployed in order to resolve natural language
ambiguities. To deal with the accompanying text information, two methods to extract
knowledge from textual information have been proposed. First, metadata can be
extracted automatically from text captions and restructured with respect to a semantic
model. Second, the use of LSI in relation to a domain-specific ontology-based
knowledge model enables the combined framework to tolerate ambiguities and
variations (incompleteness) of metadata. The use of the ontology-based knowledge
model allows the system to find indirectly relevant concepts in image captions and
thus leverage these to represent the semantics of images at a higher level.
Experimental results show that the proposed framework significantly enhances image
retrieval and leads to narrowing of the semantic gap between lower level machinederived
and higher level human-understandable conceptualisation
Methodological Issues in Spatial Microsimulation Modelling for Small Area Estimation
In this paper, some vital methodological issues of spatial microsimulation modelling for small area estimation have been addressed, with a particular emphasis given to the reweighting techniques. Most of the review articles in small area estimation have highlighted methodologies based on various statistical models and theories. However, spatial microsimulation modelling is emerging as a very useful alternative means of small area estimation. Our findings demonstrate that spatial microsimulation models are robust and have advantages over other type of models used for small area estimation. The technique uses different methodologies typically based on geographic models and various economic theories. In contrast to statistical model-based approaches, the spatial microsimulation model-based approaches can operate through reweighting techniques such as GREGWT and combinatorial optimization. A comparison between reweighting techniques reveals that they are using quite different iterative algorithms and that their properties also vary. The study also points out a new method for spatial microsimulation modellingBayesian prediction approach; combinatorial optimisation; GREGWT; microdata; small area estimation; spatial microsimulation
Learning to count with deep object features
Learning to count is a learning strategy that has been recently proposed in
the literature for dealing with problems where estimating the number of object
instances in a scene is the final objective. In this framework, the task of
learning to detect and localize individual object instances is seen as a harder
task that can be evaded by casting the problem as that of computing a
regression value from hand-crafted image features. In this paper we explore the
features that are learned when training a counting convolutional neural network
in order to understand their underlying representation. To this end we define a
counting problem for MNIST data and show that the internal representation of
the network is able to classify digits in spite of the fact that no direct
supervision was provided for them during training. We also present preliminary
results about a deep network that is able to count the number of pedestrians in
a scene.Comment: This paper has been accepted at Deep Vision Workshop at CVPR 201
A Systematic Review of Strong Gravitational Lens Modeling Software
Despite expanding research activity in gravitational lens modeling, there is
no particular software which is considered a standard. Much of the
gravitational lens modeling software is written by individual investigators for
their own use. Some gravitational lens modeling software is freely available
for download but is widely variable with regard to ease of use and quality of
documentation. This review of 13 software packages was undertaken to provide a
single source of information. Gravitational lens models are classified as
parametric models or non-parametric models, and can be further divided into
research and educational software. Software used in research includes the
GRAVLENS package (with both gravlens and lensmodel), Lenstool, LensPerfect,
glafic, PixeLens, SimpLens, Lensview, and GRALE. In this review, GravLensHD,
G-Lens, Gravitational Lensing, lens and MOWGLI are categorized as educational
programs that are useful for demonstrating various aspects of lensing. Each of
the 13 software packages is reviewed with regard to software features
(installation, documentation, files provided, etc.) and lensing features (type
of model, input data, output data, etc.) as well as a brief review of studies
where they have been used. Recent studies have demonstrated the utility of
strong gravitational lensing data for mass mapping, and suggest increased use
of these techniques in the future. Coupled with the advent of greatly improved
imaging, new approaches to modeling of strong gravitational lens systems are
needed. This is the first systematic review of strong gravitational lens
modeling software, providing investigators with a starting point for future
software development to further advance gravitational lens modeling research
Bayesian Modelling of Direct and Indirect Effects of Marine Reserves on Fishes : A thesis presented in partial fulfilment of the requirements for the degree of Doctor of Philosophy in Statistics at Massey University, Albany, New Zealand.
This thesis reviews and develops modern advanced statistical methodology for
sampling and modelling count data from marine ecological studies, with specific applications
to quantifying potential direct and indirect effects of marine reserves on fishes in north
eastern New Zealand. Counts of snapper (Pagrus auratus: Sparidae) from baited underwater
video surveys from an unbalanced, multi-year, hierarchical sampling programme were
analysed using a Bayesian Generalised Linear Mixed Model (GLMM) approach, which
allowed the integer counts to be explicitly modelled while incorporating multiple fixed and
random effects. Overdispersion was modelled using a zero-inflated negative-binomial error
distribution. A parsimonious method for zero inflation was developed, where the mean of the
count distribution is explicitly linked to the probability of an excess zero. Comparisons of
variance components identified marine reserve status as the greatest source of variation in
counts of snapper above the legal size limit. Relative densities inside reserves were, on
average, 13-times greater than outside reserves.
Small benthic reef fishes inside and outside the same three reserves were surveyed to
evaluate evidence for potential indirect effects of marine reserves via restored populations of
fishery-targeted predators such as snapper. Sites for sampling were obtained randomly from
populations of interest using spatial data and geo-referencing tools in R—a rarely used
approach that is recommended here more generally to improve field-based ecological
surveys. Resultant multispecies count data were analysed with multivariate GLMMs
implemented in the R package MCMCglmm, based on a multivariate Poisson lognormal error
distribution. Posterior distributions for hypothesised effects of interest were calculated
directly for each species. While reserves did not appear to affect densities of small fishes,
reserve-habitat interactions indicated that some endemic species of triplefin (Tripterygiidae)
had different associations with small-scale habitat gradients inside vs outside reserves. These patterns were consistent with a behavioural risk effect, where small fishes may be more
strongly attracted to refuge habitats to avoid predators inside vs outside reserves.
The approaches developed and implemented in this thesis respond to some of the
major current statistical and logistic challenges inherent in the analysis of counts of
organisms. This work provides useful exemplar pathways for rigorous study design,
modelling and inference in ecological systems
The Design and Implementation of a Bayesian CAD Modeler for Robotic Applications
We present a Bayesian CAD modeler for robotic applications. We address the problem of taking into account the propagation of geometric uncertainties when solving inverse geometric problems. The proposed method may be seen as a generalization of constraint-based approaches in which we explicitly model geometric uncertainties. Using our methodology, a geometric constraint is expressed as a probability distribution on the system parameters and the sensor measurements, instead of a simple equality or inequality. To solve geometric problems in this framework, we propose an original resolution method able to adapt to problem complexity.
Using two examples, we show how to apply our approach by providing simulation results using our modeler
Disentangling causal webs in the brain using functional Magnetic Resonance Imaging: A review of current approaches
In the past two decades, functional Magnetic Resonance Imaging has been used
to relate neuronal network activity to cognitive processing and behaviour.
Recently this approach has been augmented by algorithms that allow us to infer
causal links between component populations of neuronal networks. Multiple
inference procedures have been proposed to approach this research question but
so far, each method has limitations when it comes to establishing whole-brain
connectivity patterns. In this work, we discuss eight ways to infer causality
in fMRI research: Bayesian Nets, Dynamical Causal Modelling, Granger Causality,
Likelihood Ratios, LiNGAM, Patel's Tau, Structural Equation Modelling, and
Transfer Entropy. We finish with formulating some recommendations for the
future directions in this area
Can Ecological Interactions be Inferred from Spatial Data?
The characterisation and quantication of ecological interactions, and the construction
of species distributions and their associated ecological niches, is of fundamental
theoretical and practical importance. In this paper we give an overview of a Bayesian
inference framework, developed over the last 10 years, which, using spatial data, offers
a general formalism within which ecological interactions may be characterised and
quantied. Interactions are identied through deviations of the spatial distribution
of co-occurrences of spatial variables relative to a benchmark for the non-interacting
system, and based on a statistical ensemble of spatial cells. The formalism allows for
the integration of both biotic and abiotic factors of arbitrary resolution. We concentrate
on the conceptual and mathematical underpinnings of the formalism, showing
how, using the Naive Bayes approximation, it can be used to not only compare and
contrast the relative contribution from each variable, but also to construct species
distributions and niches based on arbitrary variable type. We show how the formalism
can be used to quantify confounding and therefore help disentangle the complex
causal chains that are present in ecosystems. We also show species distributions and
their associated niches can be used to infer standard "micro" ecological interactions,
such as predation and parasitism. We present several representative use cases that
validate our framework, both in terms of being consistent with present knowledge of
a set of known interactions, as well as making and validating predictions about new,
previously unknown interactions in the case of zoonoses
- …