Search CORE

6,319 research outputs found

Geometric Semantic Grammatical Evolution

Author: A Moraglio
C Ryan
K Krawiec
L Beadle
M Castelli
M O’Neill
NQ Uy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/09/2018
Field of study

This is the author accepted manuscript. The final version is available from Springer via the DOI in this record.Geometric Semantic Genetic Programming (GSGP) is a novel form of Genetic Programming (GP), based on a geometric theory of evolutionary algorithms, which directly searches the semantic space of programs. In this chapter, we extend this framework to Grammatical Evolution (GE) and refer to the new method as Geometric Semantic Grammatical Evolution (GSGE). We formally derive new mutation and crossover operators for GE which are guaranteed to see a simple unimodal fitness landscape. This surprising result shows that the GE genotypephenotype mapping does not necessarily imply low genotype-fitness locality. To complement the theory, we present extensive experimental results on three standard domains (Boolean, Arithmetic and Classifier)

Crossref

Open Research Exeter

Analysing Symbolic Regression Benchmarks under a Meta-Learning Approach

Author: Martins Joao Francisco Barreto da Silva
Miranda Luis Fernando
Oliveira Luiz Otavio Vilas Boas
Pappa Gisele Lobo
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 25/05/2018
Field of study

The definition of a concise and effective testbed for Genetic Programming (GP) is a recurrent matter in the research community. This paper takes a new step in this direction, proposing a different approach to measure the quality of the symbolic regression benchmarks quantitatively. The proposed approach is based on meta-learning and uses a set of dataset meta-features---such as the number of examples or output skewness---to describe the datasets. Our idea is to correlate these meta-features with the errors obtained by a GP method. These meta-features define a space of benchmarks that should, ideally, have datasets (points) covering different regions of the space. An initial analysis of 63 datasets showed that current benchmarks are concentrated in a small region of this benchmark space. We also found out that number of instances and output skewness are the most relevant meta-features to GP output error. Both conclusions can help define which datasets should compose an effective testbed for symbolic regression methods.Comment: 8 pages, 3 Figures, Proceedings of Genetic and Evolutionary Computation Conference Companion, Kyoto, Japa

arXiv.org e-Print Archive

Crossref

Automated Problem Decomposition for the Boolean Domain with Genetic Programming

Author: A. Moraglio
D. Jackson
E. Hemberg
J. Walker
M. Keijzer
M. O’Neill
S.C. Roberts
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Researchers have been interested in exploring the regularities and modularity of the problem space in genetic programming (GP) with the aim of decomposing the original problem into several smaller subproblems. The main motivation is to allow GP to deal with more complex problems. Most previous works on modularity in GP emphasise the structure of modules used to encapsulate code and/or promote code reuse, instead of in the decomposition of the original problem. In this paper we propose a problem decomposition strategy that allows the use of a GP search to find solutions for subproblems and combine the individual solutions into the complete solution to the problem

CiteSeerX

Crossref

Kent Academic Repository

From holism to compositionality: memes and the evolution of segmentation, syntax, and signification in music and language

Author: Adkins
Allanbrook
Blackmore
Boas
Byros
Calvin
Chantler
Chomsky
Cooke
Cross
Crystal
Darwin
Dawkins
Dawkins
Dawkins
Deliège
Dennett
Dennett
Dissanayake
Distin
Durham
Durrell
Fodor
Gjerdingen
Goldberg
Hebb
Jan
Jan
Kirby
Kramer
Leman
Leng
Lerdahl
Lynch
Matyja
Meyer
Meyer
Miller
Mithen
Monelle
Mountcastle
Narmour
Nattiez
Patel
Pinker
Ratner
Rice
Schenker
Scruton
Shapiro
Snyder
Snyder
STEVEN JAN
Strunk
Temperley
Truss
Whorf
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/03/2015
Field of study

Steven Mithen argues that language evolved from an antecedent he terms “Hmmmmm, [meaning it was] Holistic, manipulative, multi-modal, musical and mimetic”. Owing to certain innate and learned factors, a capacity for segmentation and cross-stream mapping in early Homo sapiens broke the continuous line of Hmmmmm, creating discrete replicated units which, with the initial support of Hmmmmm, eventually became the semantically freighted words of modern language. That which remained after what was a bifurcation of Hmmmmm arguably survived as music, existing as a sound stream segmented into discrete units, although one without the explicit and relatively fixed semantic content of language. All three types of utterance – the parent Hmmmmm, language, and music – are amenable to a memetic interpretation which applies Universal Darwinism to what are understood as language and musical memes. On the basis of Peter Carruthers’ distinction between ‘cognitivism’ and ‘communicativism’ in language, and William Calvin’s theories of cortical information encoding, a framework is hypothesized for the semantic and syntactic associations between, on the one hand, the sonic patterns of language memes (‘lexemes’) and of musical memes (‘musemes’) and, on the other hand, ‘mentalese’ conceptual structures, in Chomsky’s ‘Logical Form’ (LF)

Crossref

University of Huddersfield Repository

Huddersfield Research Portal

Thematic Annotation: extracting concepts out of documents

Author: Andrews Pierre
Rajman Martin
Publication venue
Publication date: 29/12/2004
Field of study

Contrarily to standard approaches to topic annotation, the technique used in this work does not centrally rely on some sort of -- possibly statistical -- keyword extraction. In fact, the proposed annotation algorithm uses a large scale semantic database -- the EDR Electronic Dictionary -- that provides a concept hierarchy based on hyponym and hypernym relations. This concept hierarchy is used to generate a synthetic representation of the document by aggregating the words present in topically homogeneous document segments into a set of concepts best preserving the document's content. This new extraction technique uses an unexplored approach to topic selection. Instead of using semantic similarity measures based on a semantic resource, the later is processed to extract the part of the conceptual hierarchy relevant to the document content. Then this conceptual hierarchy is searched to extract the most relevant set of concepts to represent the topics discussed in the document. Notice that this algorithm is able to extract generic concepts that are not directly present in the document.Comment: Technical report EPFL/LIA. 81 pages, 16 figure

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne