2,240 research outputs found

    Embodied Evolution in Collective Robotics: A Review

    Full text link
    This paper provides an overview of evolutionary robotics techniques applied to on-line distributed evolution for robot collectives -- namely, embodied evolution. It provides a definition of embodied evolution as well as a thorough description of the underlying concepts and mechanisms. The paper also presents a comprehensive summary of research published in the field since its inception (1999-2017), providing various perspectives to identify the major trends. In particular, we identify a shift from considering embodied evolution as a parallel search method within small robot collectives (fewer than 10 robots) to embodied evolution as an on-line distributed learning method for designing collective behaviours in swarm-like collectives. The paper concludes with a discussion of applications and open questions, providing a milestone for past and an inspiration for future research.Comment: 23 pages, 1 figure, 1 tabl

    Information driven self-organization of complex robotic behaviors

    Get PDF
    Information theory is a powerful tool to express principles to drive autonomous systems because it is domain invariant and allows for an intuitive interpretation. This paper studies the use of the predictive information (PI), also called excess entropy or effective measure complexity, of the sensorimotor process as a driving force to generate behavior. We study nonlinear and nonstationary systems and introduce the time-local predicting information (TiPI) which allows us to derive exact results together with explicit update rules for the parameters of the controller in the dynamical systems framework. In this way the information principle, formulated at the level of behavior, is translated to the dynamics of the synapses. We underpin our results with a number of case studies with high-dimensional robotic systems. We show the spontaneous cooperativity in a complex physical system with decentralized control. Moreover, a jointly controlled humanoid robot develops a high behavioral variety depending on its physics and the environment it is dynamically embedded into. The behavior can be decomposed into a succession of low-dimensional modes that increasingly explore the behavior space. This is a promising way to avoid the curse of dimensionality which hinders learning systems to scale well.Comment: 29 pages, 12 figure

    Evolutionary Learning of Fuzzy Rules for Regression

    Get PDF
    The objective of this PhD Thesis is to design Genetic Fuzzy Systems (GFS) that learn Fuzzy Rule Based Systems to solve regression problems in a general manner. Particularly, the aim is to obtain models with low complexity while maintaining high precision without using expert-knowledge about the problem to be solved. This means that the GFSs have to work with raw data, that is, without any preprocessing that help the learning process to solve a particular problem. This is of particular interest, when no knowledge about the input data is available or for a first approximation to the problem. Moreover, within this objective, GFSs have to cope with large scale problems, thus the algorithms have to scale with the data

    POWERPLAY: Training an Increasingly General Problem Solver by Continually Searching for the Simplest Still Unsolvable Problem

    Get PDF
    Most of computer science focuses on automatically solving given computational problems. I focus on automatically inventing or discovering problems in a way inspired by the playful behavior of animals and humans, to train a more and more general problem solver from scratch in an unsupervised fashion. Consider the infinite set of all computable descriptions of tasks with possibly computable solutions. The novel algorithmic framework POWERPLAY (2011) continually searches the space of possible pairs of new tasks and modifications of the current problem solver, until it finds a more powerful problem solver that provably solves all previously learned tasks plus the new one, while the unmodified predecessor does not. Wow-effects are achieved by continually making previously learned skills more efficient such that they require less time and space. New skills may (partially) re-use previously learned skills. POWERPLAY's search orders candidate pairs of tasks and solver modifications by their conditional computational (time & space) complexity, given the stored experience so far. The new task and its corresponding task-solving skill are those first found and validated. The computational costs of validating new tasks need not grow with task repertoire size. POWERPLAY's ongoing search for novelty keeps breaking the generalization abilities of its present solver. This is related to Goedel's sequence of increasingly powerful formal theories based on adding formerly unprovable statements to the axioms without affecting previously provable theorems. The continually increasing repertoire of problem solving procedures can be exploited by a parallel search for solutions to additional externally posed tasks. POWERPLAY may be viewed as a greedy but practical implementation of basic principles of creativity. A first experimental analysis can be found in separate papers [53,54].Comment: 21 pages, additional connections to previous work, references to first experiments with POWERPLA

    Data-driven System Identification and Optimal Control Framework for Grand-Prix Style Autonomous Racing

    Get PDF
    For the past 30 years, autonomous driving has witnessed a tremendous improvements thanks to the surge of computing power. Not only did we witness the autonomous vehicle navigate itself safely in the urban area, stories about more diverse autonomous driving applications, such as off-road rally-style navigation, are also commonly mentioned. Just until recently, the exponential increase in GPU and high-performance computing technology has motivated the research on autonomous driving under extreme situations such as autonomous racing or drifting.[25] The motivation for this thesis is to offer a brief overview about the main challenge of autonomous driving control and planning in racing scenario along with the potential solutions. The first contribution is using koopmam operator and deep neural network to perform data-driven system identification. We then design optimal model-based control which is based on the learned dynamics alone. Based on our new system identification algorithm, we can approximate an accurate, explainable, and linearized system representation in a high-dimensional latent space, without any prior knowledge of the system. In this case, the learned vehicle dynamic automatically involves the information that is normally difficult to obtain, including cornering stiffness, tire slip, transmission parameters, etc. Our result shows that our koopman data-driven optimal control approach is able to deliver better tracking accuracy at high speed compared to the state-of-art vehicle controllers. The second contribution is an iterative learning and sampling algorithm that can perform minimum-time optimization of the global racing trajectory(aka racing line) within the limit of tire friction. This trajectory optimization algorithm is not only proven to be computationally efficient, but also safe enough for the onboard RC vehicle’s test. The research achievements we made for the last two years not only enables the F1TENTH racing team of Clemson University Mechanical Engineering Department to finish top 5 in both virtual autonomous racing hosted by IFAC and IROS congress, but also offer us the opportunity to join ICRA 2021 Autonomous racing workshop to present our work and being awarded the joint best paper. More importantly, these contributions proved to be functional and effective in the on-board testing of the real F1TENTH robot’s autonomous navigation in the Flour Danial basement. Finally, this thesis will also include discussions of the potential research directions that can help improve the our current method so that it can better contribute to the autonomous driving industry

    SwarMAV: A Swarm of Miniature Aerial Vehicles

    Get PDF
    As the MAV (Micro or Miniature Aerial Vehicles) field matures, we expect to see that the platform's degree of autonomy, the information exchange, and the coordination with other manned and unmanned actors, will become at least as crucial as its aerodynamic design. The project described in this paper explores some aspects of a particularly exciting possible avenue of development: an autonomous swarm of MAVs which exploits its inherent reliability (through redundancy), and its ability to exchange information among the members, in order to cope with a dynamically changing environment and achieve its mission. We describe the successful realization of a prototype experimental platform weighing only 75g, and outline a strategy for the automatic design of a suitable controller

    Data-Driven Passivity-Based Control of Underactuated Robotic Systems

    Get PDF
    Classical control strategies for robotic systems are based on the idea that feedback control can be used to override the natural dynamics of the machines. Passivity-based control (Pbc) is a branch of nonlinear control theory that follows a similar approach, where the natural dynamics is modified based on the overall energy of the system. This method involves transforming a nonlinear control system, through a suitable control input, into another fictitious system that has desirable stability characteristics. The majority of Pbc techniques require the discovery of a reasonable storage function, which acts as a Lyapunov function candidate that can be used to certify stability. There are several challenges in the design of a suitable storage function, including: 1) what a reasonable choice for the function is for a given control system, and 2) the control synthesis requires a closed-form solution to a set of nonlinear partial differential equations. The latter is in general difficult to overcome, especially for systems with high degrees of freedom, limiting the applicability of Pbc techniques. A machine learning framework that automatically determines the storage function for underactuated robotic systems is introduced in this dissertation. This framework combines the expressive power of neural networks with the systematic methods of the Pbc paradigm, bridging the gap between controllers derived from learning algorithms and nonlinear control theory. A series of experiments demonstrates the efficacy and applicability of this framework for a family of underactuated robots
    • …
    corecore