Search CORE

200 research outputs found

On the limitations of the univariate marginal distribution algorithm to deception and where bivariate EDAs might help

Author: Lehre Per Kristian
Nguyen Phan Trung Hai
Publication venue: Association for Computing Machinery (ACM)
Publication date: 29/07/2019
Field of study

We introduce a new benchmark problem called Deceptive Leading Blocks (DLB) to rigorously study the runtime of the Univariate Marginal Distribution Algorithm (UMDA) in the presence of epistasis and deception. We show that simple Evolutionary Algorithms (EAs) outperform the UMDA unless the selective pressure

\mu/\lambda

is extremely high, where

\mu

and

\lambda

are the parent and offspring population sizes, respectively. More precisely, we show that the UMDA with a parent population size of

\mu=\Omega(\log n)

has an expected runtime of

e^{\Omega(\mu)}

on the DLB problem assuming any selective pressure

\frac{\mu}{\lambda} \geq \frac{14}{1000}

, as opposed to the expected runtime of

\mathcal{O}(n\lambda\log \lambda+n^3)

for the non-elitist

(\mu,\lambda)~\text{EA}

with

\mu/\lambda\leq 1/e

. These results illustrate inherent limitations of univariate EDAs against deception and epistasis, which are common characteristics of real-world problems. In contrast, empirical evidence reveals the efficiency of the bi-variate MIMIC algorithm on the DLB problem. Our results suggest that one should consider EDAs with more complex probabilistic models when optimising problems with some degree of epistasis and deception.Comment: To appear in the 15th ACM/SIGEVO Workshop on Foundations of Genetic Algorithms (FOGA XV), Potsdam, German

arXiv.org e-Print Archive

University of Birmingham Research Portal

Recommended from our members

Distributed Bayesian Computation and Self-Organized Learning in Sheets of Spiking Neurons with Local Lateral Inhibition

Author: Bill Johannes
Buesing Lars
Habenschuss Stefan
Legenstein Robert
Maass Wolfgang
Nessler Bernhard
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2015
Field of study

During the last decade, Bayesian probability theory has emerged as a framework in cognitive science and neuroscience for describing perception, reasoning and learning of mammals. However, our understanding of how probabilistic computations could be organized in the brain, and how the observed connectivity structure of cortical microcircuits supports these calculations, is rudimentary at best. In this study, we investigate statistical inference and self-organized learning in a spatially extended spiking network model, that accommodates both local competitive and large-scale associative aspects of neural information processing, under a unified Bayesian account. Specifically, we show how the spiking dynamics of a recurrent network with lateral excitation and local inhibition in response to distributed spiking input, can be understood as sampling from a variational posterior distribution of a well-defined implicit probabilistic model. This interpretation further permits a rigorous analytical treatment of experience-dependent plasticity on the network level. Using machine learning theory, we derive update rules for neuron and synapse parameters which equate with Hebbian synaptic and homeostatic intrinsic plasticity rules in a neural implementation. In computer simulations, we demonstrate that the interplay of these plasticity rules leads to the emergence of probabilistic local experts that form distributed assemblies of similarly tuned cells communicating through lateral excitatory connections. The resulting sparse distributed spike code of a well-adapted network carries compressed information on salient input features combined with prior experience on correlations among them. Our theory predicts that the emergence of such efficient representations benefits from network architectures in which the range of local inhibition matches the spatial extent of pyramidal cells that share common afferent input

Columbia University Academic Commons

Directory of Open Access Journals

PubMed Central

Hochschulschriftenserver - Universität Frankfurt am Main

FigShare

Massively-Parallel Feature Selection for Big Data

Author: Borboudakis Giorgos
Christophides Vassilis
Katsogridakis Pavlos
Pratikakis Polyvios
Tsamardinos Ioannis
Publication venue
Publication date: 23/08/2017
Field of study

We present the Parallel, Forward-Backward with Pruning (PFBP) algorithm for feature selection (FS) in Big Data settings (high dimensionality and/or sample size). To tackle the challenges of Big Data FS PFBP partitions the data matrix both in terms of rows (samples, training examples) as well as columns (features). By employing the concepts of

p

-values of conditional independence tests and meta-analysis techniques PFBP manages to rely only on computations local to a partition while minimizing communication costs. Then, it employs powerful and safe (asymptotically sound) heuristics to make early, approximate decisions, such as Early Dropping of features from consideration in subsequent iterations, Early Stopping of consideration of features within the same iteration, or Early Return of the winner in each iteration. PFBP provides asymptotic guarantees of optimality for data distributions faithfully representable by a causal network (Bayesian network or maximal ancestral graph). Our empirical analysis confirms a super-linear speedup of the algorithm with increasing sample size, linear scalability with respect to the number of features and processing cores, while dominating other competitive algorithms in its class

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Hal-Diderot

Genetic Information in Agricultural Productivity and Product Development

Author: Babcock Bruce A.
Hennessy David A.
Miranowski John
Publication venue
Publication date
Field of study

A prominent facet of recent changes in agriculture has been the advent of precision breeding techniques. Another has been an increase in the level of information inputs and outputs associated with agricultural production. This paper identifies ways in which these features may complement in expanding the variety of processed products, the level of productivity, and the rate of change in productivity. Using a martingale concept of ï¾“more information,ï¾” we identify conditions under which more information increases the incentives to invest and engage in product differentiation. A theory on how genetic uniformity can enhance the rate of learning through process experimentation, and so the rate of technical change, is also developed.experimentation, genetics, information, martingale, sorting, uniformity, value added.

Research Papers in Economics

Simulation-based Bayesian inference for multi-fingered robotic grasping

Author: Bruls Olivier
Louppe Gilles
Marlier Norman
Publication venue
Publication date: 01/01/2021
Field of study

Multi-fingered robotic grasping is an undeniable stepping stone to universal picking and dexterous manipulation. Yet, multi-fingered grippers remain challenging to control because of their rich nonsmooth contact dynamics or because of sensor noise. In this work, we aim to plan hand configurations by performing Bayesian posterior inference through the full stochastic forward simulation of the robot in its environment, hence robustly accounting for many of the uncertainties in the system. While previous methods either relied on simplified surrogates of the likelihood function or attempted to learn to directly predict maximum likelihood estimates, we bring a novel simulation-based approach for full Bayesian inference based on a deep neural network surrogate of the likelihood-to-evidence ratio. Hand configurations are found by directly optimizing through the resulting amortized and differentiable expression for the posterior. The geometry of the configuration space is accounted for by proposing a Riemannian manifold optimization procedure through the neural posterior. Simulation and physical benchmarks demonstrate the high success rate of the procedure

Open Repository and Bibliography - Liège

A multiple-phenotype imputation method for genetic studies

Author: A Baud
A Zufferey
Amelie Baud
Andreas Kranis
Andrew Dahl
AP Dawid
B Howie
C Cotsapas
CR Gilson
ED Schifano
EE Schadt
FP Casale
G Lauc
GI Allen
GR Abecasis
H Shinya
I Hers
IJ Mackay
J Cho
J Listgarten
J Marchini
J Yang
JE Huffman
Jonathan Marchini
JS Bloom
K Suhre
L Almasy
L Almasy
M Scutari
MAR Ferreira
MI Jordan
N Prévost
N Solovieff
N Soranzo
Nicole Soranzo
O Troyanskaya
PF O'Reilly
R Abdollahi-Arpanahi
R Hai
R Mazumder
R Mott
Richard Mott
S-Y Shin
SH Lee
SH Park
SJ Szabo
SR Piccolo
SV Buuren
TE Galesloot
TH Meuwissen
THE Meuwissen
Ulf Gyllensten
V Marx
Valentina Iotchkova
W Igl
X Cui
X Zhou
X Zhou
X Zhou
Y-R Chen
Åsa Johansson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Genetic association studies have yielded a wealth of biologic discoveries. However, these have mostly analyzed one trait and one SNP at a time, thus failing to capture the underlying complexity of these datasets. Joint genotypephenotype analyses of complex, high-dimensional datasets represent an important way to move beyond simple GWAS with great potential. The move to high-dimensional phenotypes will raise many new statistical problems. In this paper we address the central issue of missing phenotypes in studies with any level of relatedness between samples. We propose a multiple phenotype mixed model and use a computationally efficient variational Bayesian algorithm to fit the model. On a variety of simulated and real datasets from a range of organisms and trait types, we show that our method outperforms existing state-of-the-art methods from the statistics and machine learning literature and can boost signals of associatio

Crossref

PubMed Central

Edinburgh Research Explorer

Oxford University Research Archive