3,333 research outputs found
Non-stationary patterns of isolation-by-distance: inferring measures of local genetic differentiation with Bayesian kriging
Patterns of isolation-by-distance arise when population differentiation
increases with increasing geographic distances. Patterns of
isolation-by-distance are usually caused by local spatial dispersal, which
explains why differences of allele frequencies between populations accumulate
with distance. However, spatial variations of demographic parameters such as
migration rate or population density can generate non-stationary patterns of
isolation-by-distance where the rate at which genetic differentiation
accumulates varies across space. To characterize non-stationary patterns of
isolation-by-distance, we infer local genetic differentiation based on Bayesian
kriging. Local genetic differentiation for a sampled population is defined as
the average genetic differentiation between the sampled population and fictive
neighboring populations. To avoid defining populations in advance, the method
can also be applied at the scale of individuals making it relevant for
landscape genetics. Inference of local genetic differentiation relies on a
matrix of pairwise similarity or dissimilarity between populations or
individuals such as matrices of FST between pairs of populations. Simulation
studies show that maps of local genetic differentiation can reveal barriers to
gene flow but also other patterns such as continuous variations of gene flow
across habitat. The potential of the method is illustrated with 2 data sets:
genome-wide SNP data for human Swedish populations and AFLP markers for alpine
plant species. The software LocalDiff implementing the method is available at
http://membres-timc.imag.fr/Michael.Blum/LocalDiff.htmlComment: In press, Evolution 201
Recommended from our members
Appropriate, accessible and appealing probabilistic graphical models
Appropriate - Many multivariate probabilistic models either use independent distributions or dependent Gaussian distributions. Yet, many real-world datasets contain count-valued or non-negative skewed data, e.g. bag-of-words text data and biological sequencing data. Thus, we develop novel probabilistic graphical models for use on count-valued and non-negative data including Poisson graphical models and multinomial graphical models. We develop one generalization that allows for triple-wise or k-wise graphical models going beyond the normal pairwise formulation. Furthermore, we also explore Gaussian-copula graphical models and derive closed-form solutions for the conditional distributions and marginal distributions (both before and after conditioning). Finally, we derive mixture and admixture, or topic model, generalizations of these graphical models to introduce more power and interpretability.
Accessible - Previous multivariate models, especially related to text data, often have complex dependencies without a closed form and require complex inference algorithms that have limited theoretical justification. For example, hierarchical Bayesian models often require marginalizing over many latent variables. We show that our novel graphical models (even the k-wise interaction models) have simple and intuitive estimation procedures based on node-wise regressions that likely have similar theoretical guarantees as previous work in graphical models. For the copula-based graphical models, we show that simple approximations could still provide useful models; these copula models also come with closed-form conditional and marginal distributions, which make them amenable to exploratory inspection and manipulation. The parameters of these models are easy to interpret and thus may be accessible to a wide audience.
Appealing - High-level visualization and interpretation of graphical models with even 100 variables has often been difficult even for a graphical model expert---despite visualization being one of the original motivators for graphical models. This difficulty is likely due to the lack of collaboration between graphical model experts and visualization experts. To begin bridging this gap, we develop a novel "what if?" interaction that manipulates and leverages the probabilistic power of graphical models. Our approach defines: the probabilistic mechanism via conditional probability; the query language to map text input to a conditional probability query; and the formal underlying probabilistic model. We then propose to visualize these query-specific probabilistic graphical models by combining the intuitiveness of force-directed layouts with the beauty and readability of word clouds, which pack many words into valuable screen space while ensuring words do not overlap via pixel-level collision detection. Although both the force-directed layout and the pixel-level packing problems are challenging in their own right, we approximate both simultaneously via adaptive simulated annealing starting from careful initialization. For visualizing mixture distributions, we also design a meaningful mapping from the properties of the mixture distribution to a color in the perceptually uniform CIELUV color space. Finally, we demonstrate our approach via illustrative visualizations of several real-world datasets.Computer Science
On multidimensional scaling with Euclidean and city block metrics
Experimental sciences collect large amounts of data. Different techniques are available for information elicitation from data. Frequently statistical analysis should be combined with the experience and intuition of researchers. Human heuristic abilities are developed and oriented to patterns in space of dimensionality up to 3. Multidimensional scaling (MDS) addresses the problem how objects represented by proximity data can be represented by points in low dimensional space. MDS methods are implemented as the optimization of a stress function measuring fit of the proximity data by the distances between the respective points. Since the optimization problem is multimodal, a global optimization method should be used. In the present paper a combination of an evolutionary metaheuristic algorithm with a local search algorithm is used. The experimental results show the influence of metrics defining distances in the considered spaces on the results of multidimensional scaling. Data sets with known and unknown structure and different dimensionality (up to 512 variables) have been visualized.
Daugiamačių skalių su Euklido ir Manheteno metrikomis sudarymo metodai
Santrauka
Eksperimentiniai mokslai kaupia didelius duomenų kiekius. Sukurta daug metodų informacijai iš duomenų išgauti. Daûnai statistiniai metodai yra derinami su euristine analize pagrįsta tyrinėtojų intuicija. Tačiau euristiniai žmonių sugebėjimai gerai tinka analizuoti duomenis, kurių matavimų skaičius neviršija 3. Daugiamačių skalių metodas skirtas vaizduoti objektams mažo matavimų skaičiaus erdvėje, kai objektai apibrėžti panašumais/nepanašumais, o atstumai vaizdų erdvėje vaizduoja nepanašumus. Daugiamačių skalių metodai sudaromi kaip vaizdavimo tikslumo kriterijaus, paprastai vadinamo stresu, minimizavimo procedūros. Kadangi optimizavimo uždaviniai daugiaekstremalūs, jiems spręsti reikia globalios optimizacijos metodų. Šiame darbe pasiūlytas algoritmas, jungiantis metaeuristinę globalią paiešką ir lokalios minimizacijos metodą. Eksperimentais ištirta metrikos vaizdų erdvėje įtaka vaizdavimo tikslumui ir algoritmo efektyvumui. Eksperimentuose naudotos duomenų aibės su žinoma ir nežinoma struktūra; optimizacijos uždavinio kintamųjų yra iki 512.
First Published Online: 21 Oct 2010
Reikšminiai žodžiai: daugiadimensės skalės, globalioji optimizacija, metaeuristiniai metodai, Manheteno metrika, daugiamačių duomenų vizualizacija
Characterisation of the transmissivity field of a fractured and karstic aquifer, Southern France
International audienceGeological and hydrological data collected at the Terrieu experimental site north of Montpellier, in a confined carbonate aquifer indicates that both fracture clusters and a major bedding plane form the main flow paths of this highly heterogeneous karst aquifer. However, characterising the geometry and spatial location of the main flow channels and estimating their flow properties remain difficult. These challenges can be addressed by solving an inverse problem using the available hydraulic head data recorded during a set of interference pumping tests.We first constructed a 2D equivalent porous medium model to represent the test site domain and then employed regular zoning parameterisation, on which the inverse modelling was performed. Because we aim to resolve the fine-scale characteristics of the transmissivity field, the problem undertaken is essentially a large-scale inverse model, i.e. the dimension of the unknown parameters is high. In order to deal with the high computational demands in such a large-scale inverse problem, a gradient-based, non-linear algorithm (SNOPT) was used to estimate the transmissivity field on the experimental site scale through the inversion of steady-state, hydraulic head measurements recorded at 22 boreholes during 8 sequential cross-hole pumping tests. We used the data from outcrops, borehole fracture measurements and interpretations of inter-well connectivities from interference test responses as initial models to trigger the inversion. Constraints for hydraulic conductivities, based on analytical interpretations of pumping tests, were also added to the inversion models. In addition, the efficiency of the adopted inverse algorithm enables us to increase dramatically the number of unknown parameters to investigate the influence of elementary discretisation on the reconstruction of the transmissivity fields in both synthetic and field studies.By following the above approach, transmissivity fields that produce similar hydrodynamic behaviours to the real head measurements were obtained. The inverted transmissivity fields show complex, spatial heterogeneities with highly conductive channels embedded in a low transmissivity matrix region. The spatial trend of the main flow channels is in a good agreement with that of the main fracture sets mapped on outcrops in the vicinity of the Terrieu site suggesting that the hydraulic anisotropy is consistent with the structural anisotropy. These results from the inverse modelling enable the main flow paths to be located and their hydrodynamic properties to be estimated
State-of-the-art in aerodynamic shape optimisation methods
Aerodynamic optimisation has become an indispensable component for any aerodynamic design over the past 60 years, with applications to aircraft, cars, trains, bridges, wind turbines, internal pipe flows, and cavities, among others, and is thus relevant in many facets of technology. With advancements in computational power, automated design optimisation procedures have become more competent, however, there is an ambiguity and bias throughout the literature with regards to relative performance of optimisation architectures and employed algorithms. This paper provides a well-balanced critical review of the dominant optimisation approaches that have been integrated with aerodynamic theory for the purpose of shape optimisation. A total of 229 papers, published in more than 120 journals and conference proceedings, have been classified into 6 different optimisation algorithm approaches. The material cited includes some of the most well-established authors and publications in the field of aerodynamic optimisation. This paper aims to eliminate bias toward certain algorithms by analysing the limitations, drawbacks, and the benefits of the most utilised optimisation approaches. This review provides comprehensive but straightforward insight for non-specialists and reference detailing the current state for specialist practitioners
Development and demonstration of an on-board mission planner for helicopters
Mission management tasks can be distributed within a planning hierarchy, where each level of the hierarchy addresses a scope of action, and associated time scale or planning horizon, and requirements for plan generation response time. The current work is focused on the far-field planning subproblem, with a scope and planning horizon encompassing the entire mission and with a response time required to be about two minutes. The far-feld planning problem is posed as a constrained optimization problem and algorithms and structural organizations are proposed for the solution. Algorithms are implemented in a developmental environment, and performance is assessed with respect to optimality and feasibility for the intended application and in comparison with alternative algorithms. This is done for the three major components of far-field planning: goal planning, waypoint path planning, and timeline management. It appears feasible to meet performance requirements on a 10 Mips flyable processor (dedicated to far-field planning) using a heuristically-guided simulated annealing technique for the goal planner, a modified A* search for the waypoint path planner, and a speed scheduling technique developed for this project
A fuzzy clustering approach for determination of ideal points of new products
Prior to manufacture a new products, consumers with similar purchasing attitudes are grouped into clusters of which their central points are used as ideal points for new product development. However, many clustering methods ignore the fuzziness of consumers in purchasing products or conducing survey. This paper presents a new method which integrates a fuzzy data processing technique for dimension reduction of customer attributes and a fuzzy clustering technique for grouping consumers with similar purchasing attributes. Hence, the central points of each group are treated as the ideal points for new product development. The effectiveness of the proposed method is demonstrated based on a new product design problem for new digital cameras
Current Approaches for Image Fusion of Histological Data with Computed Tomography and Magnetic Resonance Imaging
Classical analysis of biological samples requires the destruction of the tissue’s integrity by cutting or grinding it down to thin slices for (Immuno)-histochemical staining and microscopic analysis. Despite high specificity, encoded in the stained 2D section of the whole tissue, the structural information, especially 3D information, is limited. Computed tomography (CT) or magnetic resonance imaging (MRI) scans performed prior to sectioning in combination with image registration algorithms provide an opportunity to regain access to morphological characteristics as well as to relate histological findings to the 3D structure of the local tissue environment. This review provides a summary of prevalent literature addressing the problem of multimodal coregistration of hard- and soft-tissue in microscopy and tomography. Grouped according to the complexity of the dimensions, including image-to-volume (2D ⟶ 3D), image-to-image (2D ⟶ 2D), and volume-to-volume (3D ⟶ 3D), selected currently applied approaches are investigated by comparing the method accuracy with respect to the limiting resolution of the tomography. Correlation of multimodal imaging could position itself as a useful tool allowing for precise histological diagnostic and allow the a priori planning of tissue extraction like biopsies
- …