7 research outputs found

    A New Genetic Programming Algorithm for Building Decision Tree

    Get PDF
    AbstractGenetic programming (GP) is a flexible and powerful evolutionary technique with some special features that are suitable for building a classifier of tree representation. However, unsuitable step size of editing operator will destroy the continuity of the evolution. In this paper, we propose a multiage genetic programming (MGP) algorithm to build a classifier on a given training set. Individuals are grouped into different groups according to their ages (tree size). The competitions between individuals are limited in the same groups. That prevents the structure editing operators from destroying the continuity of the evolution. The experimental results showed that the MGP algorithm is superior to the traditional genetic programming algorithm (GP) in building decision tree

    A Neat Approach To Genetic Programming

    Get PDF
    The evolution of explicitly represented topologies such as graphs involves devising methods for mutating, comparing and combining structures in meaningful ways and identifying and maintaining the necessary topological diversity. Research has been conducted in the area of the evolution of trees in genetic programming and of neural networks and some of these problems have been addressed independently by the different research communities. In the domain of neural networks, NEAT (Neuroevolution of Augmenting Topologies) has shown to be a successful method for evolving increasingly complex networks. This system\u27s success is based on three interrelated elements: speciation, marking of historical information in topologies, and initializing search in a small structures search space. This provides the dynamics necessary for the exploration of diverse solution spaces at once and a way to discriminate between different structures. Although different representations have emerged in the area of genetic programming, the study of the tree representation has remained of interest in great part because of its mapping to programming languages and also because of the observed phenomenon of unnecessary code growth or bloat which hinders performance. The structural similarity between trees and neural networks poses an interesting question: Is it possible to apply the techniques from NEAT to the evolution of trees and if so, how does it affect performance and the dynamics of code growth? In this work we address these questions and present analogous techniques to those in NEAT for genetic programming

    Visualizing Tree Structures in Genetic Programming

    Full text link
    This paper presents methods to visualize the structure of trees that occur in genetic programming. These methods allow for the inspection of structure of entire trees even though several thousands of nodes may be involved. The methods also scale to allow for the inspection of structure for entire populations and for complete trials even though millions of nodes may be involved. Examples are given that demonstrate how this new way of โ€œseeingโ€ can afford a potentially rich way of understanding dynamics that underpin genetic programming. The examples indicate further studies that might be enabled by visualizing structure at these scales.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/45620/1/10710_2005_Article_7621.pd

    Adaptive Operator Mechanism for Genetic Programming

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ์ „๊ธฐยท์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2013. 8. Robert Ian McKay.their performances are competitive with systems without an adaptive operator mechanism. However they showed some drawbacks, which we discuss. To overcome them, we suggest three variants on operator selection, which performed somewhat better. We have investigated evaluation of operator impact in adaptive operator mechanism, which measures the impact of operator applications on improvement of solution. Hence the impact guides operator rates, evaluation of operator impact is very important in adaptive operator mechanism. There are two issues in evaluation of operator impact: the resource and the method. Basically all history information of run are able to be used as resources for the operator impact, but fitness value which is directly related with the improvement of solution, is usually used as a resource. By using a variety of problems, we used two kinds of resources: accuracy and structure in this thesis. On the other hand, although we used same resources, the evaluated impacts are different by methods. We suggested several methods of the evaluation of operator impact. Although they require only small change, they have a large effect on performance. Finally, we verified adaptive operator mechanism by applying it to a real-world applicationa modeling of algal blooms in the Nakdong River. The objective of this application is a model that describes and predicts the ecosystem of the Nakdong River. We verified it with two researches: fitting the parameters of an expert-derived model for the Nakdong River with a GA, and modeling by extending the expert-derived model with TAG3P.์œ ์ „ ํ”„๋กœ๊ทธ๋ž˜๋ฐ์€ ๋ชจ๋ธ ํ•™์Šต์— ํšจ๊ณผ์ ์ธ ์ง„ํ™” ์—ฐ์‚ฐ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด๋‹ค. ์œ ์ „ ํ”„๋กœ๊ทธ๋ž˜๋ฐ์€ ๋‹ค์–‘ํ•œ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ๋Š”๋ฐ, ์ด๋“ค ํŒŒ๋ผ๋ฏธํ„ฐ์˜ ๊ฐ’์€ ๋Œ€์ฒด๋กœ ์ฃผ์–ด์ง„ ๋ฌธ์ œ์— ๋งž์ถฐ ์‚ฌ์šฉ์ž๊ฐ€ ์ง์ ‘ ์กฐ์ •ํ•œ๋‹ค. ์œ ์ „ ํ”„๋กœ๊ทธ๋ž˜๋ฐ์˜ ์„ฑ๋Šฅ์€ ํŒŒ๋ผ๋ฏธํ„ฐ์˜ ๊ฐ’์— ๋”ฐ๋ผ ํฌ๊ฒŒ ์ขŒ์šฐ๋˜๊ธฐ ๋•Œ๋ฌธ์— ํŒŒ๋ผ๋ฏธํ„ฐ ์„ค์ •์— ๋Œ€ํ•œ ์—ฐ๊ตฌ๋Š” ์ง„ํ™” ์—ฐ์‚ฐ์—์„œ ๋งŽ์€ ์ฃผ๋ชฉ์„ ๋ฐ›๊ณ  ์žˆ๋‹ค. ํ•˜์ง€๋งŒ ์•„์ง๊นŒ์ง€ ํšจ๊ณผ์ ์œผ๋กœ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ์„ค์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•์— ๋Œ€ํ•œ ๋ณดํŽธ์ ์ธ ์ง€์นจ์ด ์—†์œผ๋ฉฐ, ๋งŽ์€ ์‹คํ—˜์„ ํ†ตํ•œ ์‹œํ–‰์ฐฉ์˜ค๋ฅผ ๊ฑฐ์น˜๋ฉด์„œ ์ ์ ˆํ•œ ํŒŒ๋ผ๋ฏธํ„ฐ ๊ฐ’์„ ์ฐพ๋Š” ๋ฐฉ๋ฒ•์ด ์ผ๋ฐ˜์ ์œผ๋กœ ์“ฐ์ด๊ณ  ์žˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ ์ œ์‹œํ•˜๋Š” ์ ์‘ ์—ฐ์‚ฐ์ž ๋ฉ”์ปค๋‹ˆ์ฆ˜์€ ์—ฌ๋Ÿฌ ํŒŒ๋ผ๋ฏธํ„ฐ ์ค‘ ์œ ์ „ ์—ฐ์‚ฐ์ž์˜ ์ ์šฉ๋ฅ ์„ ์„ค์ •ํ•ด ์ฃผ๋Š” ๋ฐฉ๋ฒ•์œผ๋กœ, ํ•™์Šต ์ค‘๊ฐ„์ค‘๊ฐ„์˜ ์ƒํ™ฉ์— ๋งž์ถฐ ์—ฐ์‚ฐ์ž ์ ์šฉ๋ฅ ์„ ์ž๋™์ ์œผ๋กœ ์กฐ์ •ํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š”, ๊ธฐ์กด์˜ ์ ์‘ ์—ฐ์‚ฐ์ž ๋ฐฉ๋ฒ•์„ ๋‹ค์–‘ํ•œ ์œ ์ „ ์—ฐ์‚ฐ์ž๋ฅผ ๊ฐ€์ง„ ๋ฌธ๋ฒ• ๊ธฐ๋ฐ˜์˜ ์œ ์ „ ํ”„๋กœ๊ทธ๋ž˜๋ฐ์ธ TAG3P์— ์ ์šฉํ•˜๊ณ  ์ƒˆ๋กœ์šด ์ ์‘ ์—ฐ์‚ฐ์ž ๋ฐฉ๋ฒ•์„ ๊ฐœ๋ฐœํ•จ์œผ๋กœ์จ, ์ ์‘ ์—ฐ์‚ฐ์ž ๋ฉ”์ปค๋‹ˆ์ฆ˜์˜ ์ ์šฉ ๋ฒ”์œ„๋ฅผ ์œ ์ „ ํ”„๋กœ๊ทธ๋ž˜๋ฐ ์˜์—ญ๊นŒ์ง€ ํ™•์žฅํ•˜์˜€๋‹ค. ๊ธฐ์กด์˜ ์ ์‘ ์—ฐ์‚ฐ์ž ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ TAG3P์— ์ ์šฉ์‹œํ‚ค๋Š” ์—ฐ๊ตฌ๋Š” ์„ฑ๊ณต์ ์œผ๋กœ ์ด๋ฃจ์–ด์กŒ์œผ๋‚˜ ๋ช‡ ๊ฐ€์ง€ ๋ฌธ์ œ์ ์„ ๋“œ๋Ÿฌ๋‚ด์—ˆ๋‹ค. ์ด ๋ฌธ์ œ์ ์€ ๋ณธ๋ฌธ์—์„œ ํ›„์ˆ ํ•œ๋‹ค. ์ด ๋ฌธ์ œ์ ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ์œ ์ „์ž ์„ ํƒ์— ๋Œ€ํ•œ ์ƒˆ๋กœ์šด ๋ณ€ํ˜• ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ์‹œํ•˜์˜€๊ณ , ์ด๋Š” ๊ธฐ์กด ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ ๋น„๊ตํ•˜์—ฌ ๋” ์ข‹์€ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค. ํ•œํŽธ์œผ๋กœ ์œ ์ „ ์—ฐ์‚ฐ์ž๊ฐ€ ํ•ด์˜ ํ–ฅ์ƒ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์ธก์ •ํ•˜๋Š” ์—ฐ์‚ฐ์ž ์˜ํ–ฅ๋ ฅ ํ‰๊ฐ€์— ๋Œ€ํ•œ ์—ฐ๊ตฌ๋„ ์ง„ํ–‰ํ•˜์˜€๋‹ค. ์ ์‘ ์—ฐ์‚ฐ์ž ๋ฉ”์ปค๋‹ˆ์ฆ˜์—์„œ๋Š” ์ธก์ •๋œ ์˜ํ–ฅ๋ ฅ์„ ๋ฐ”ํƒ•์œผ๋กœ ์—ฐ์‚ฐ์ž์˜ ์ ์šฉ๋ฅ ์„ ๋ณ€ํ™”์‹œํ‚ค๊ธฐ ๋•Œ๋ฌธ์— ์˜ํ–ฅ๋ ฅ ํ‰๊ฐ€๋Š” ์ ์‘ ์—ฐ์‚ฐ์ž ๋ฉ”์ปค๋‹ˆ์ฆ˜์—์„œ ๋งค์šฐ ์ค‘์š”ํ•˜๋‹ค. ์ด ์—ฐ๊ตฌ์—์„œ๋Š” ์–ด๋–ค ์ •๋ณด๋ฅผ ์ด์šฉํ•˜์—ฌ ์˜ํ–ฅ๋ ฅ์„ ์ธก์ •ํ•  ๊ฒƒ์ธ์ง€, ๊ทธ๋ฆฌ๊ณ  ์–ด๋–ค ๋ฐฉ๋ฒ•์„ ์ด์šฉํ•˜์—ฌ ์˜ํ–ฅ๋ ฅ์„ ์ธก์ •ํ•  ๊ฒƒ์ธ์ง€์˜ ๋‘ ๊ฐ€์ง€ ์ฃผ์š” ์Ÿ์ ์„ ๋‹ค๋ฃฌ๋‹ค. ์—ฐ์‚ฐ์ž ์˜ํ–ฅ๋ ฅ ํ‰๊ฐ€์—๋Š” ํ•™์Šต ๊ณผ์ •์˜ ๋ชจ๋“  ์ •๋ณด๊ฐ€ ์‚ฌ์šฉ๋  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ๋Œ€์ฒด๋กœ ํ•ด์˜ ํ–ฅ์ƒ๊ณผ ์ง์ ‘์ ์ธ ๊ด€๋ จ์ด ์žˆ๋Š” ์ ํ•ฉ๋„๋ฅผ ์ด์šฉํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๋‹ค์–‘ํ•œ ๋ฌธ์ œ๋ฅผ ์ด์šฉํ•˜์—ฌ ์ •ํ™•๋„์™€ ๊ตฌ์กฐ์— ๊ด€๋ จ๋œ ๋‘ ์ง€ํ‘œ๋ฅผ ์˜ํ–ฅ๋ ฅ ํ‰๊ฐ€์— ์ด์šฉํ•ด๋ณด์•˜๋‹ค. ํ•œํŽธ์œผ๋กœ ๊ฐ™์€ ์ •๋ณด๋ฅผ ์ด์šฉํ•˜๋”๋ผ๋„ ๊ทธ๊ฒƒ์„ ํ™œ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์— ๋”ฐ๋ผ ์ธก์ •๋˜๋Š” ์˜ํ–ฅ๋ ฅ์ด ๋‹ฌ๋ผ์ง€๋Š”๋ฐ, ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์ž‘์€ ๋ณ€ํ™”๋ฅผ ํ†ตํ•ด์„œ๋„ ํฐ ์„ฑ๋Šฅ ๋ณ€ํ™”๋ฅผ ์•ผ๊ธฐ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ์˜ํ–ฅ๋ ฅ ํ‰๊ฐ€ ๋ฐฉ๋ฒ•์„ ๋ช‡๊ฐ€์ง€ ์†Œ๊ฐœํ•œ๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ ์ ์‘ ์—ฐ์‚ฐ์ž ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์‹ค์ œ ๋ฌธ์ œ์— ์ ์šฉํ•จ์œผ๋กœ์จ ์œ ์šฉ์„ฑ์„ ํ™•์ธํ•˜์˜€๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ์‚ฌ์šฉ๋œ ์‹ค์ œ ๋ฌธ์ œ๋Š” ๋‚™๋™๊ฐ•์˜ ๋…น์กฐ ํ˜„์ƒ์— ๋Œ€ํ•œ ์˜ˆ์ธก์œผ๋กœ, ๋‚™๋™๊ฐ•์˜ ์ƒํƒœ ์‹œ์Šคํ…œ์„ ๋ฌ˜์‚ฌํ•˜๊ณ  ์˜ˆ์ธกํ•˜๋Š” ๋ชจ๋ธ์„ ๊ฐœ๋ฐœํ•˜๋Š” ๊ฒƒ์„ ๋ชฉ์ ์œผ๋กœ ํ•œ๋‹ค. 2๊ฐ€์ง€ ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด ์œ ์šฉ์„ฑ์„ ํ™•์ธํ•˜์˜€๋‹ค. ์šฐ์„  ์ „๋ฌธ๊ฐ€์— ์˜ํ•ด ๋งŒ๋“ค์–ด์ง„ ๊ธฐ๋ณธ ๋ชจ๋ธ์„ ๋ฐ”ํƒ•์œผ๋กœ, ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ด์šฉํ•˜์—ฌ ๋ชจ๋ธ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ์ตœ์ ํ™” ํ•˜์˜€๊ณ , ๊ทธ๋ฆฌ๊ณ  TAG3P๋ฅผ ์ด์šฉํ•˜์—ฌ ๊ธฐ๋ณธ ๋ชจ๋ธ์˜ ํ™•์žฅํ•˜๊ณ  ์ด๋ฅผ ํ†ตํ•ด ์ƒˆ๋กœ์šด ๋ชจ๋ธ์„ ๋งŒ๋“ค์–ด ๋ณด์•˜๋‹ค.Genetic programming (GP) is an effective evolutionary algorithm for many problems, especially suited to model learning. GP has many parameters, usually defined by the user according to the problem. The performance of GP is sensitive to their values. Parameter setting has been a major focus of study in evolutionary computation. However there is still no general guideline for choosing efficient settings. The usual method for parameter setting is trial and error. The method used in this thesis, adaptive operator mechanism, replaces the user's action in setting rates of application of genetic operators. adaptive operator mechanism autonomously controls the genetic operators during a run. This thesis extends adaptive operator mechanism to genetic programming, applying existing adaptive operator algorithms and developing them for TAG3P, a grammar-guided GP which supports a wide variety of useful genetic operators. Existing adaptive operator selection algorithms are successfully applied to TAG3P1 Introduction 1 1.1 Background and Motivation 1 1.2 Our Approach and Its Contributions 2 1.3 Outline 4 2 Related Works 5 2.1 Evolutionary Algorithms 5 2.1.1 Genetic Algorithm 5 2.1.2 Genetic Programming 8 2.1.3 Tree Adjoining Grammar based Genetic Programming 9 3 Adaptive Mechanism and Adaptive Operator Selection 16 3.1 Adaptive Mechanism 16 3.2 Adaptive Operator Selection 18 3.2.1 Operator Selection 18 3.2.2 Evaluation of Operator Impact 19 3.3 Algorithms of Adaptive Operator Selection 20 3.3.1 Probability Matching 21 3.3.2 Adaptive Pursuit 22 3.3.3 Multi-Armed Bandits 25 4 Preliminary Experiment for Adaptive Operator Mechanism 28 4.1 Test Problems 28 4.2 Experimental Design 30 4.2.1 Search Space 31 4.2.2 General Parameter Settings 32 4.3 Results and Discussion 34 5 Operator Selection 39 5.1 Operator Selection Algorithms for GP 39 5.1.1 Powered Probability Matching 39 5.1.2 Adaptive Probability Matching 41 5.1.3 Recursive Adaptive Pursuit 41 5.2 Experiments and Results 43 5.2.1 Test Problems 43 5.2.2 Experimental Design 44 5.2.3 Results and Discussion 46 6 Evaluation of Operator Impact 56 6.1 Rates for the Amount of Individual Usage 57 6.1.1 Denition of Rates for the Amount of Individual Usage 57 6.1.2 Results and Discussion 58 6.2 Ratio for the Improvement of Fitness 63 6.2.1 Pairs and Group 64 6.2.2 Ratio and Children Fitness 65 6.2.3 Experimental Design 65 6.2.4 Result and Discussion 66 6.3 Ranking Point 73 6.3.1 Denition of Ranking Point 73 6.3.2 Experimental Design 74 6.3.3 Result and Discussion 74 6.4 Pre-Search Structure 76 6.4.1 Denition of Pre-Search Structure 76 6.4.2 Preliminary Experiment for Sampling 78 6.4.3 Experimental Design 82 6.4.4 Result and Discussion 83 7 Application: Nakdong River Modeling 85 7.1 Problem Description 85 7.1.1 Outline 85 7.1.2 Data Description 86 7.1.3 Model Description 88 7.1.4 Methods 93 7.2 Results 97 7.2.1 Parameter Optimization 97 7.2.2 Modeling 101 7.3 Summary 103 8 Conclusion 104 8.1 Summary 104 8.2 Future Works 108Docto

    A Study of the Impact of Interaction Mechanisms and Population Diversity in Evolutionary Multiagent Systems

    Full text link
    In the Evolutionary Computation (EC) research community, a major concern is maintaining optimal levels of population diversity. In the Multiagent Systems (MAS) research community, a major concern is implementing effective agent coordination through various interaction mechanisms. These two concerns coincide when one is faced with Evolutionary Multiagent Systems (EMAS). This thesis demonstrates a methodology to study the relationship between interaction mechanisms, population diversity, and performance of an evolving multiagent system in a dynamic, real-time, and asynchronous environment. An open sourced extensible experimentation platform is developed that allows plug-ins for evolutionary models, interaction mechanisms, and genotypical encoding schemes beyond the one used to run experiments. Moreover, the platform is designed to scale arbitrarily large number of parallel experiments in multi-core clustered environments. The main contribution of this thesis is better understanding of the role played by population diversity and interaction mechanisms in the evolution of multiagent systems. First, it is shown, through carefully planned experiments in three different evolutionary models, that both interaction mechanisms and population diversity have a statistically significant impact on performance in a system of evolutionary agents coordinating to achieve a shared goal of completing problems in sequential task domains. Second, it is experimentally verified that, in the sequential task domain, a larger heterogeneous population of limited-capability agents will evolve to perform better than a smaller homogeneous population of full-capability agents, and performance is influenced by the ways in which the agents interact. Finally, two novel trait-based population diversity levels are described and are shown to be effective in their applicability

    An Analysis of Diversity in Genetic Programming

    Get PDF
    Genetic programming is a metaheuristic search method that uses a population of variable-length computer programs and a search strategy based on biological evolution. The idea of automatic programming has long been a goal of artificial intelligence, and genetic programming presents an intuitive method for automatically evolving programs. However, this method is not without some potential drawbacks. Search using procedural representations can be complex and inefficient. In addition, variable sized solutions can become unnecessarily large and difficult to interpret. The goal of this thesis is to understand the dynamics of genetic programming that encourages efficient and effective search. Toward this goal, the research focuses on an important property of genetic programming search: the population. The population is related to many key aspects of the genetic programming algorithm. In this programme of research, diversity is used to describe and analyse populations and their effect on search. A series of empirical investigations are carried out to better understand the genetic programming algorithm. The research begins by studying the relationship between diversity and search. The effect of increased population diversity and a metaphor of search are then examined. This is followed by an investigation into the phenomenon of increased solution size and problem difficulty. The research concludes by examining the role of diverse individuals, particularly the ability of diverse individuals to affect the search process and ways of improving the genetic programming algorithm. This thesis makes the following contributions: (1) An analysis shows the complexity of the issues of diversity and the relationship between diversity and fitness, (2) The genetic programming search process is characterised by using the concept of genetic lineages and the sampling of structures and behaviours, (3) A causal model of the varied rates of solution size increase is presented, (4) A new, tunable problem demonstrates the contribution of different population members during search, and (5) An island model is proposed to improve the search by speciating dissimilar individuals into better-suited environments. Currently, genetic programming is applied to a wide range of problems under many varied contexts. From artificial intelligence to operations research, the results presented in this thesis will benefit population-based search methods, methods based on the concepts of evolution and search methods using variable-length representations
    corecore