Search CORE

2,605 research outputs found

Repeated sequences in linear genetic programming genomes

Author: Banzhaf W.
Langdon W.B.
Publication venue
Publication date: 01/01/2005
Field of study

Biological chromosomes are replete with repetitive sequences, micro satellites, SSR tracts, ALU, etc. in their DNA base sequences. We started looking for similar phenomena in evolutionary computation. First studies find copious repeated sequences, which can be hierarchically decomposed into shorter sequences, in programs evolved using both homologous and two point crossover but not with headless chicken crossover or other mutations. In bloated programs the small number of effective or expressed instructions appear in both repeated and nonrepeated code. Hinting that building-blocks or code reuse may evolve in unplanned ways. Mackey-Glass chaotic time series prediction and eukaryotic protein localisation (both previously used as artificial intelligence machine learning benchmarks) demonstrate evolution of Shannon information (entropy) and lead to models capable of lossy Kolmogorov compression. Our findings with diverse benchmarks and GP systems suggest this emergent phenomenon may be widespread in genetic systems

UCL Discovery

Backstreaming from oil diffusion pumps Quarterly progress report, 1 Jan. - 31 Mar. 1966

Author: Langdon W. M.
Publication venue
Publication date
Field of study

Backstreaming from oil diffusion and turbo-molecular pump

NASA Technical Reports Server

More Mouldy Data: Another mycoplasma gene jumps the silicon barrier into the human genome

Author: Arno M. J.
Langdon W. B.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/06/2011
Field of study

The human genome sequence database contains DNA sequences very like those of mycoplasma molds. It appears such moulds infect not only molecular Biology laboratories but were picked up by experimenters from contaminated samples and inserted into GenBank as if they were human. At least one mouldy EST (Expressed Sequence Tag) has transferred from public databases to commercial tools (Affymetrix HG-U133 plus 2.0 microarrays). We report a second example (DA466599) and suggest there is a need to clean up genomic databases but fear current tools will be inadequate to catch genes which have jumped the silicon barrier.Comment: data directory contains results of AF241217 and DA466599 blast runs by EBI in Cambridg

arXiv.org e-Print Archive

Crossref

UCL Discovery

Repeated patterns in tree genetic programming

Author: Banzhaf W.
Langdon W.B.
Publication venue: Springer-Verlag GmbH
Publication date: 01/01/2005
Field of study

We extend our analysis of repetitive patterns found in genetic programming genomes to tree based GP. As in linear GP, repetitive patterns are present in large numbers. Size fair crossover limits bloat in automatic programming, preventing the evolution of recurring motifs. We examine these complex properties in detail: e.g. using depth v. size Catalan binary tree shape plots, subgraph and subtree matching, information entropy, syntactic and semantic fitness correlations and diffuse introns. We relate this emergent phenomenon to considerations about building blocks in GP and how GP works

UCL Discovery

Backstreaming from oil diffusion pumps Final report, Dec. 1, 1963 - May 30, 1966

Author: Ivanuski V. R.
Langdon W. M.
Publication venue
Publication date
Field of study

Backstreaming from oil diffusion and turbomolecular pump

NASA Technical Reports Server

Convergence rates for the distribution of program outputs

Author: W. B. Langdon
Publication venue: Morgan Kaufmann Publishers
Publication date: 01/01/2002
Field of study

Fitness distributions (landscapes) of programs tend to a limit as they get bigger. Markov chain convergence theorems give general upper bounds on the linear program sizes needed for convergence. Tight bounds (exponential in N, N log N and smaller) are given for five computer models (any, average, cyclic, bit flip and Boolean). Mutation randomizes a genetic algorithm population in 1 4 (l + 1)(log(l) + 4) generations. Results for a genetic programming (GP) like model are confirmed by experiment.

CiteSeerX