19,769 research outputs found

    On the future of astrostatistics: statistical foundations and statistical practice

    Full text link
    This paper summarizes a presentation for a panel discussion on "The Future of Astrostatistics" held at the Statistical Challenges in Modern Astronomy V conference at Pennsylvania State University in June 2011. I argue that the emerging needs of astrostatistics may both motivate and benefit from fundamental developments in statistics. I highlight some recent work within statistics on fundamental topics relevant to astrostatistical practice, including the Bayesian/frequentist debate (and ideas for a synthesis), multilevel models, and multiple testing. As an important direction for future work in statistics, I emphasize that astronomers need a statistical framework that explicitly supports unfolding chains of discovery, with acquisition, cataloging, and modeling of data not seen as isolated tasks, but rather as parts of an ongoing, integrated sequence of analyses, with information and uncertainty propagating forward and backward through the chain. A prototypical example is surveying of astronomical populations, where source detection, demographic modeling, and the design of survey instruments and strategies all interact.Comment: 8 pp, 2 figures. To appear in "Statistical Challenges in Modern Astronomy V," (Lecture Notes in Statistics, Vol. 209), ed. Eric D. Feigelson and G. Jogesh Babu; publication planned for Sep 2012; see http://www.springer.com/statistics/book/978-1-4614-3519-

    Multi modal multi-semantic image retrieval

    Get PDF
    PhDThe rapid growth in the volume of visual information, e.g. image, and video can overwhelm users’ ability to find and access the specific visual information of interest to them. In recent years, ontology knowledge-based (KB) image information retrieval techniques have been adopted into in order to attempt to extract knowledge from these images, enhancing the retrieval performance. A KB framework is presented to promote semi-automatic annotation and semantic image retrieval using multimodal cues (visual features and text captions). In addition, a hierarchical structure for the KB allows metadata to be shared that supports multi-semantics (polysemy) for concepts. The framework builds up an effective knowledge base pertaining to a domain specific image collection, e.g. sports, and is able to disambiguate and assign high level semantics to ‘unannotated’ images. Local feature analysis of visual content, namely using Scale Invariant Feature Transform (SIFT) descriptors, have been deployed in the ‘Bag of Visual Words’ model (BVW) as an effective method to represent visual content information and to enhance its classification and retrieval. Local features are more useful than global features, e.g. colour, shape or texture, as they are invariant to image scale, orientation and camera angle. An innovative approach is proposed for the representation, annotation and retrieval of visual content using a hybrid technique based upon the use of an unstructured visual word and upon a (structured) hierarchical ontology KB model. The structural model facilitates the disambiguation of unstructured visual words and a more effective classification of visual content, compared to a vector space model, through exploiting local conceptual structures and their relationships. The key contributions of this framework in using local features for image representation include: first, a method to generate visual words using the semantic local adaptive clustering (SLAC) algorithm which takes term weight and spatial locations of keypoints into account. Consequently, the semantic information is preserved. Second a technique is used to detect the domain specific ‘non-informative visual words’ which are ineffective at representing the content of visual data and degrade its categorisation ability. Third, a method to combine an ontology model with xi a visual word model to resolve synonym (visual heterogeneity) and polysemy problems, is proposed. The experimental results show that this approach can discover semantically meaningful visual content descriptions and recognise specific events, e.g., sports events, depicted in images efficiently. Since discovering the semantics of an image is an extremely challenging problem, one promising approach to enhance visual content interpretation is to use any associated textual information that accompanies an image, as a cue to predict the meaning of an image, by transforming this textual information into a structured annotation for an image e.g. using XML, RDF, OWL or MPEG-7. Although, text and image are distinct types of information representation and modality, there are some strong, invariant, implicit, connections between images and any accompanying text information. Semantic analysis of image captions can be used by image retrieval systems to retrieve selected images more precisely. To do this, a Natural Language Processing (NLP) is exploited firstly in order to extract concepts from image captions. Next, an ontology-based knowledge model is deployed in order to resolve natural language ambiguities. To deal with the accompanying text information, two methods to extract knowledge from textual information have been proposed. First, metadata can be extracted automatically from text captions and restructured with respect to a semantic model. Second, the use of LSI in relation to a domain-specific ontology-based knowledge model enables the combined framework to tolerate ambiguities and variations (incompleteness) of metadata. The use of the ontology-based knowledge model allows the system to find indirectly relevant concepts in image captions and thus leverage these to represent the semantics of images at a higher level. Experimental results show that the proposed framework significantly enhances image retrieval and leads to narrowing of the semantic gap between lower level machinederived and higher level human-understandable conceptualisation

    Methodological Issues in Spatial Microsimulation Modelling for Small Area Estimation

    Get PDF
    In this paper, some vital methodological issues of spatial microsimulation modelling for small area estimation have been addressed, with a particular emphasis given to the reweighting techniques. Most of the review articles in small area estimation have highlighted methodologies based on various statistical models and theories. However, spatial microsimulation modelling is emerging as a very useful alternative means of small area estimation. Our findings demonstrate that spatial microsimulation models are robust and have advantages over other type of models used for small area estimation. The technique uses different methodologies typically based on geographic models and various economic theories. In contrast to statistical model-based approaches, the spatial microsimulation model-based approaches can operate through reweighting techniques such as GREGWT and combinatorial optimization. A comparison between reweighting techniques reveals that they are using quite different iterative algorithms and that their properties also vary. The study also points out a new method for spatial microsimulation modellingBayesian prediction approach; combinatorial optimisation; GREGWT; microdata; small area estimation; spatial microsimulation

    Learning to count with deep object features

    Full text link
    Learning to count is a learning strategy that has been recently proposed in the literature for dealing with problems where estimating the number of object instances in a scene is the final objective. In this framework, the task of learning to detect and localize individual object instances is seen as a harder task that can be evaded by casting the problem as that of computing a regression value from hand-crafted image features. In this paper we explore the features that are learned when training a counting convolutional neural network in order to understand their underlying representation. To this end we define a counting problem for MNIST data and show that the internal representation of the network is able to classify digits in spite of the fact that no direct supervision was provided for them during training. We also present preliminary results about a deep network that is able to count the number of pedestrians in a scene.Comment: This paper has been accepted at Deep Vision Workshop at CVPR 201

    A Systematic Review of Strong Gravitational Lens Modeling Software

    Full text link
    Despite expanding research activity in gravitational lens modeling, there is no particular software which is considered a standard. Much of the gravitational lens modeling software is written by individual investigators for their own use. Some gravitational lens modeling software is freely available for download but is widely variable with regard to ease of use and quality of documentation. This review of 13 software packages was undertaken to provide a single source of information. Gravitational lens models are classified as parametric models or non-parametric models, and can be further divided into research and educational software. Software used in research includes the GRAVLENS package (with both gravlens and lensmodel), Lenstool, LensPerfect, glafic, PixeLens, SimpLens, Lensview, and GRALE. In this review, GravLensHD, G-Lens, Gravitational Lensing, lens and MOWGLI are categorized as educational programs that are useful for demonstrating various aspects of lensing. Each of the 13 software packages is reviewed with regard to software features (installation, documentation, files provided, etc.) and lensing features (type of model, input data, output data, etc.) as well as a brief review of studies where they have been used. Recent studies have demonstrated the utility of strong gravitational lensing data for mass mapping, and suggest increased use of these techniques in the future. Coupled with the advent of greatly improved imaging, new approaches to modeling of strong gravitational lens systems are needed. This is the first systematic review of strong gravitational lens modeling software, providing investigators with a starting point for future software development to further advance gravitational lens modeling research

    Bayesian Modelling of Direct and Indirect Effects of Marine Reserves on Fishes : A thesis presented in partial fulfilment of the requirements for the degree of Doctor of Philosophy in Statistics at Massey University, Albany, New Zealand.

    Get PDF
    This thesis reviews and develops modern advanced statistical methodology for sampling and modelling count data from marine ecological studies, with specific applications to quantifying potential direct and indirect effects of marine reserves on fishes in north eastern New Zealand. Counts of snapper (Pagrus auratus: Sparidae) from baited underwater video surveys from an unbalanced, multi-year, hierarchical sampling programme were analysed using a Bayesian Generalised Linear Mixed Model (GLMM) approach, which allowed the integer counts to be explicitly modelled while incorporating multiple fixed and random effects. Overdispersion was modelled using a zero-inflated negative-binomial error distribution. A parsimonious method for zero inflation was developed, where the mean of the count distribution is explicitly linked to the probability of an excess zero. Comparisons of variance components identified marine reserve status as the greatest source of variation in counts of snapper above the legal size limit. Relative densities inside reserves were, on average, 13-times greater than outside reserves. Small benthic reef fishes inside and outside the same three reserves were surveyed to evaluate evidence for potential indirect effects of marine reserves via restored populations of fishery-targeted predators such as snapper. Sites for sampling were obtained randomly from populations of interest using spatial data and geo-referencing tools in R—a rarely used approach that is recommended here more generally to improve field-based ecological surveys. Resultant multispecies count data were analysed with multivariate GLMMs implemented in the R package MCMCglmm, based on a multivariate Poisson lognormal error distribution. Posterior distributions for hypothesised effects of interest were calculated directly for each species. While reserves did not appear to affect densities of small fishes, reserve-habitat interactions indicated that some endemic species of triplefin (Tripterygiidae) had different associations with small-scale habitat gradients inside vs outside reserves. These patterns were consistent with a behavioural risk effect, where small fishes may be more strongly attracted to refuge habitats to avoid predators inside vs outside reserves. The approaches developed and implemented in this thesis respond to some of the major current statistical and logistic challenges inherent in the analysis of counts of organisms. This work provides useful exemplar pathways for rigorous study design, modelling and inference in ecological systems

    The Design and Implementation of a Bayesian CAD Modeler for Robotic Applications

    Get PDF
    We present a Bayesian CAD modeler for robotic applications. We address the problem of taking into account the propagation of geometric uncertainties when solving inverse geometric problems. The proposed method may be seen as a generalization of constraint-based approaches in which we explicitly model geometric uncertainties. Using our methodology, a geometric constraint is expressed as a probability distribution on the system parameters and the sensor measurements, instead of a simple equality or inequality. To solve geometric problems in this framework, we propose an original resolution method able to adapt to problem complexity. Using two examples, we show how to apply our approach by providing simulation results using our modeler

    Disentangling causal webs in the brain using functional Magnetic Resonance Imaging: A review of current approaches

    Get PDF
    In the past two decades, functional Magnetic Resonance Imaging has been used to relate neuronal network activity to cognitive processing and behaviour. Recently this approach has been augmented by algorithms that allow us to infer causal links between component populations of neuronal networks. Multiple inference procedures have been proposed to approach this research question but so far, each method has limitations when it comes to establishing whole-brain connectivity patterns. In this work, we discuss eight ways to infer causality in fMRI research: Bayesian Nets, Dynamical Causal Modelling, Granger Causality, Likelihood Ratios, LiNGAM, Patel's Tau, Structural Equation Modelling, and Transfer Entropy. We finish with formulating some recommendations for the future directions in this area

    Can Ecological Interactions be Inferred from Spatial Data?

    Get PDF
    The characterisation and quantication of ecological interactions, and the construction of species distributions and their associated ecological niches, is of fundamental theoretical and practical importance. In this paper we give an overview of a Bayesian inference framework, developed over the last 10 years, which, using spatial data, offers a general formalism within which ecological interactions may be characterised and quantied. Interactions are identied through deviations of the spatial distribution of co-occurrences of spatial variables relative to a benchmark for the non-interacting system, and based on a statistical ensemble of spatial cells. The formalism allows for the integration of both biotic and abiotic factors of arbitrary resolution. We concentrate on the conceptual and mathematical underpinnings of the formalism, showing how, using the Naive Bayes approximation, it can be used to not only compare and contrast the relative contribution from each variable, but also to construct species distributions and niches based on arbitrary variable type. We show how the formalism can be used to quantify confounding and therefore help disentangle the complex causal chains that are present in ecosystems. We also show species distributions and their associated niches can be used to infer standard "micro" ecological interactions, such as predation and parasitism. We present several representative use cases that validate our framework, both in terms of being consistent with present knowledge of a set of known interactions, as well as making and validating predictions about new, previously unknown interactions in the case of zoonoses
    • …
    corecore