Search CORE

102,984 research outputs found

Structural bias in population-based algorithms

Author: Anna V. Kononova
Barnsley
Barnsley
Barnsley
Briscoe
Burke
Davarynejad
David W. Corne
de Oca
Eiben
Eiben
Eiben
Fabio Caraffini
Gehlhaar
Glover
Glover
Goldberg
Gupta
Hurley
Hutter
Inselberg
Inselberg
Knuth
Kolmogorov
Li
Lozano
L’Ecuyer
Marsaglia
Molina
Neri
Peng
Philippe De Wilde
Piotrowski
Qin
Russell
Smirnov
Spears
Stephens
van den Bergh
Vsevolod Shneer
Winston
Zhang
Zhou
Črepinšek
Publication venue: 'Elsevier BV'
Publication date: 22/08/2014
Field of study

Challenging optimisation problems are abundant in all areas of science and industry. Since the 1950s, scientists have responded to this by developing ever-diversifying families of 'black box' optimisation algorithms. The latter are designed to be able to address any optimisation problem, requiring only that the quality of any candidate solution can be calculated via a 'fitness function' specific to the problem. For such algorithms to be successful, at least three properties are required: (i) an effective informed sampling strategy, that guides the generation of new candidates on the basis of the fitnesses and locations of previously visited candidates; (ii) mechanisms to ensure efficiency, so that (for example) the same candidates are not repeatedly visited; and (iii) the absence of structural bias, which, if present, would predispose the algorithm towards limiting its search to specific regions of the solution space. The first two of these properties have been extensively investigated, however the third is little understood and rarely explored. In this article we provide theoretical and empirical analyses that contribute to the understanding of structural bias. In particular, we state and prove a theorem concerning the dynamics of population variance in the case of real-valued search spaces and a 'flat' fitness landscape. This reveals how structural bias can arise and manifest as non-uniform clustering of the population over time. Critically, theory predicts that structural bias is exacerbated with (independently) increasing population size, and increasing problem difficulty. These predictions, supported by our empirical analyses, reveal two previously unrecognised aspects of structural bias that would seem vital for algorithm designers and practitioners. Respectively, (i) increasing the population size, though ostensibly promoting diversity, will magnify any inherent structural bias, and (ii) the effects of structural bias are more apparent when faced with (many classes of) 'difficult' problems. Our theoretical result also contributes to the 'exploitation/exploration' conundrum in optimisation algorithm design, by suggesting that two commonly used approaches to enhancing exploration - increasing the population size, and increasing the disruptiveness of search operators - have quite distinct implications in terms of structural bias

arXiv.org e-Print Archive

Crossref

Heriot Watt Pure

Kent Academic Repository

De Montfort University Open Research Archive

Data-Adaptive Estimation for Double-Robust Methods in Population-Based Cancer Epidemiology: Risk Differences for Lung Cancer Mortality by Emergency Presentation.

Author: Belot Aurélien
Cerulli Giovanni
Luque-Fernandez Miguel Angel
Maringe Camille
Rachet Bernard
Valeri Linda
Publication venue: Oxford University Press (OUP)
Publication date: 11/09/2017
Field of study

In this paper, we propose a structural framework for population-based cancer epidemiology and evaluate the performance of double-robust estimators for a binary exposure in cancer mortality. We conduct numerical analyses to study the bias and efficiency of these estimators. Furthermore, we compare 2 different model selection strategies based on 1) Akaike's Information Criterion and the Bayesian Information Criterion and 2) machine learning algorithms, and we illustrate double-robust estimators' performance in a real-world setting. In simulations with correctly specified models and near-positivity violations, all but the naive estimators had relatively good performance. However, the augmented inverse-probability-of-treatment weighting estimator showed the largest relative bias. Under dual model misspecification and near-positivity violations, all double-robust estimators were biased. Nevertheless, the targeted maximum likelihood estimator showed the best bias-variance trade-off, more precise estimates, and appropriate 95% confidence interval coverage, supporting the use of the data-adaptive model selection strategies based on machine learning algorithms. We applied these methods to estimate adjusted 1-year mortality risk differences in 183,426 lung cancer patients diagnosed after admittance to an emergency department versus persons with a nonemergency cancer diagnosis in England (2006-2013). The adjusted mortality risk (for patients diagnosed with lung cancer after admittance to an emergency department) was 16% higher in men and 18% higher in women, suggesting the importance of interventions targeting early detection of lung cancer signs and symptoms

LSHTM Research Online

Harvard University - DASH

Recommended from our members

Robust model-based analysis of single-particle tracking experiments with Spot-On.

Author: Darzacq Xavier
Grimm Jonathan B
Hansen Anders S
Lavis Luke D
Tjian Robert
Woringer Maxime
Publication venue: eScholarship, University of California
Publication date: 01/01/2018
Field of study

Single-particle tracking (SPT) has become an important method to bridge biochemistry and cell biology since it allows direct observation of protein binding and diffusion dynamics in live cells. However, accurately inferring information from SPT studies is challenging due to biases in both data analysis and experimental design. To address analysis bias, we introduce 'Spot-On', an intuitive web-interface. Spot-On implements a kinetic modeling framework that accounts for known biases, including molecules moving out-of-focus, and robustly infers diffusion constants and subpopulations from pooled single-molecule trajectories. To minimize inherent experimental biases, we implement and validate stroboscopic photo-activation SPT (spaSPT), which minimizes motion-blur bias and tracking errors. We validate Spot-On using experimentally realistic simulations and show that Spot-On outperforms other methods. We then apply Spot-On to spaSPT data from live mammalian cells spanning a wide range of nuclear dynamics and demonstrate that Spot-On consistently and robustly infers subpopulation fractions and diffusion constants

eScholarship - University of California

On The Stability of Interpretable Models

Author: Guidotti Riccardo
Ruggieri Salvatore
Publication venue
Publication date: 01/01/2019
Field of study

Interpretable classification models are built with the purpose of providing a comprehensible description of the decision logic to an external oversight agent. When considered in isolation, a decision tree, a set of classification rules, or a linear model, are widely recognized as human-interpretable. However, such models are generated as part of a larger analytical process. Bias in data collection and preparation, or in model's construction may severely affect the accountability of the design process. We conduct an experimental study of the stability of interpretable models with respect to feature selection, instance selection, and model selection. Our conclusions should raise awareness and attention of the scientific community on the need of a stability impact assessment of interpretable models

arXiv.org e-Print Archive

Archivio della Ricerca - Università di Pisa

Recommended from our members

Selection of earthquake ground motions for multiple objectives using genetic algorithms

Author: Abrahamson
Anastasios G. Sextos
ASCE
Baker
Baker
Baker
Beyer
Bommer
Bommer
Boore
Bradley
CEN
Cimellaro
Deb
Deb
Deep
FEMA P-58
Giaralis
Ha
Ha
Hancock
Holland
Iervolino
Jayaram
Katsanos
Katsanos
Katsanos
Kottke
Koutrakis
Kwong
Kwong
Lin
Luco
Macedo
MathWorks
Mergos
Mergos
Mergos
Messac
Moschen
Naeim
Naeim
Padgett
Panagiotis E. Mergos
Reiter
Sextos
Sextos
Sextos
Shome
Smerzini
Stefanidou
Stewart
Yang
Publication venue: 'Elsevier BV'
Publication date: 15/05/2019
Field of study

Existing earthquake ground motion (GM) selection methods for the seismic assessment of structural systems focus on spectral compatibility in terms of either only central values or both central values and variability. In this way, important selection criteria related to the seismology of the region, local soil conditions, strong GM intensity and duration as well as the magnitude of scale factors are considered only indirectly by setting them as constraints in the pre-processing phase in the form of permissible ranges. In this study, a novel framework for the optimum selection of earthquake GMs is presented, where the aforementioned criteria are treated explicitly as selection objectives. The framework is based on the principles of multi-objective optimization that is addressed with the aid of the Weighted Sum Method, which supports decision making both in the pre-processing and post-processing phase of the GM selection procedure. The solution of the derived equivalent single-objective optimization problem is performed by the application of a mixed-integer Genetic Algorithm and the effects of its parameters on the efficiency of the selection procedure are investigated. Application of the proposed framework shows that it is able to track GM sets that not only provide excellent spectral matching but they are also able to simultaneously consider more explicitly a set of additional criteria

City Research Online

Crossref

Explore Bristol Research