15 research outputs found

    Evaluation of Word Representations in Grounding Natural Language Instructions through Computational Human-Robot Interaction

    Get PDF
    International audienceIn order to interact with people in a natural way, a robot must be able to link words to objects and actions. Although previous studies in the literature have investigated grounding, they did not consider grounding of unknown synonyms. In this paper, we introduce a probabilistic model for grounding unknown synonymous object and action names using cross-situational learning. The proposed Bayesian learning model uses four different word representations to determine synonymous words. Afterwards, they are grounded through geometric characteristics of objects and kinematic features of the robot joints during action execution. The proposed model is evaluated through an interaction experiment between a human tutor and HSR robot. The results show that semantic and syntactic information enable grounding of unknown synonyms and that the combination of both achieves the best grounding

    The Ecology of Open-Ended Skill Acquisition: Computational framework and experiments on the interactions between environmental, adaptive, multi-agent and cultural dynamics

    Get PDF
    An intriguing feature of the human species is our ability to continuously invent new problems and to proactively acquiring new skills in order to solve them: what is called open-ended skill acquisition (OESA). Understanding the mechanisms underlying OESA is an important scientific challenge in both cognitive science (e.g. by studying infant cognitive development) and in artificial intelligence (aiming at computational architectures capable of open-ended learning). Both fields, however, mostly focus on cognitive and social mechanisms at the scale of an individual’s life. It is rarely acknowledged that OESA, an ability that is fundamentally related to the characteristics of human intelligence, has been necessarily shaped by ecological, evolutionary and cultural mechanisms interacting at multiple spatiotemporal scales. In this thesis, I present a research program aiming at understanding, modelingand simulating the dynamics of OESA in artificial systems, grounded in theories studying its eco-evolutionary bases in the human species. It relies on a conceptual framework expressing the complex interactions between environmental, adaptive, multi-agent and cultural dynamics. Three main research questions are developed and I present a selection of my contributions for each of them.- What are the ecological conditions favoring the evolution of skill acquisition?- How to bootstrap the formation of a cultural repertoire in populations of adaptive agents?- What is the role of cultural evolution in the open-ended dynamics of human skill acquisition?By developing these topics, we will reveal interesting relationships between theories in human evolution and recent approaches in artificial intelligence. This will lead to the proposition of a humanist perspective on AI: using it as a family of computational tools that can help us to explore and study the mechanisms driving open-ended skill acquisition in both artificial and biological systems, as a way to better understand the dynamics of our own species within its whole ecological context. This document presents an overview of my scientific trajectory since the start of my PhD thesis in 2007, the detail of my current research program, a selection of my contributions as well as perspectives for future work

    Self-Organization of Early Vocal Development in Infants and Machines: The Role of Intrinsic Motivation

    Get PDF
    International audienceWe bridge the gap between two issues in infant development: vocal development and intrinsic motivation. We propose and experimentally test the hypothesis that general mechanisms of intrinsically motivated spontaneous exploration, also called curiosity-driven learning, can self-organize developmental stages during early vocal learning. We introduce a computational model of intrinsically motivated vocal exploration, which allows the learner to autonomously structure its own vocal experiments, and thus its own learning schedule, through a drive to maximize competence progress. This model relies on a physical model of the vocal tract, the auditory system and the agent's motor control as well as vocalizations of social peers. We present computational experiments that show how such a mechanism can explain the adaptive transition from vocal self-exploration with little influence from the speech environment, to a later stage where vocal exploration becomes influenced by vocalizations of peers. Within the initial self-exploration phase, we show that a sequence of vocal production stages self-organizes, and shares properties with data from infant developmental psychology: the vocal learner first discovers how to control phonation, then focuses on vocal variations of unarticulated sounds, and finally automatically discovers and focuses on babbling with articulated proto-syllables. As the vocal learner becomes more proficient at producing complex sounds, imitating vocalizations of peers starts to provide high learning progress explaining an automatic shift from self-exploration to vocal imitation

    Control strategies for cleaning robots in domestic applications: A comprehensive review:

    Get PDF
    Service robots are built and developed for various applications to support humans as companion, caretaker, or domestic support. As the number of elderly people grows, service robots will be in increasing demand. Particularly, one of the main tasks performed by elderly people, and others, is the complex task of cleaning. Therefore, cleaning tasks, such as sweeping floors, washing dishes, and wiping windows, have been developed for the domestic environment using service robots or robot manipulators with several control approaches. This article is primarily focused on control methodology used for cleaning tasks. Specifically, this work mainly discusses classical control and learning-based controlled methods. The classical control approaches, which consist of position control, force control, and impedance control , are commonly used for cleaning purposes in a highly controlled environment. However, classical control methods cannot be generalized for cluttered environment so that learning-based control methods could be an alternative solution. Learning-based control methods for cleaning tasks can encompass three approaches: learning from demonstration (LfD), supervised learning (SL), and reinforcement learning (RL). These control approaches have their own capabilities to generalize the cleaning tasks in the new environment. For example, LfD, which many research groups have used for cleaning tasks, can generate complex cleaning trajectories based on human demonstration. Also, SL can support the prediction of dirt areas and cleaning motion using large number of data set. Finally, RL can learn cleaning actions and interact with the new environment by the robot itself. In this context, this article aims to provide a general overview of robotic cleaning tasks based on different types of control methods using manipulator. It also suggest a description of the future directions of cleaning tasks based on the evaluation of the control approaches

    Sensorimotor input as a language generalisation tool: a neurorobotics model for generation and generalisation of noun-verb combinations with sensorimotor inputs

    Get PDF
    The paper presents a neurorobotics cognitive model explaining the understanding and generalisation of nouns and verbs combinations when a vocal command consisting of a verb-noun sentence is provided to a humanoid robot. The dataset used for training was obtained from object manipulation tasks with a humanoid robot platform; it includes 9 motor actions and 9 objects placing placed in 6 different locations), which enables the robot to learn to handle real-world objects and actions. Based on the multiple time-scale recurrent neural networks, this study demonstrates its generalisation capability using a large data-set, with which the robot was able to generalise semantic representation of novel combinations of noun-verb sentences, and therefore produce the corresponding motor behaviours. This generalisation process is done via the grounding process: different objects are being interacted, and associated, with different motor behaviours, following a learning approach inspired by developmental language acquisition in infants. Further analyses of the learned network dynamics and representations also demonstrate how the generalisation is possible via the exploitation of this functional hierarchical recurrent network

    A Curious Robot Learner for Interactive Goal-Babbling (Strategically Choosing What, How, When and from Whom to Learn)

    Get PDF
    Les dĂ© s pour voir des robots opĂ©rant dans l environnement de tous les jours des humains et sur unelongue durĂ©e soulignent l importance de leur adaptation aux changements qui peuvent ĂȘtre imprĂ©visiblesau moment de leur construction. Ils doivent ĂȘtre capable de savoir quelles parties Ă©chantillonner, et quelstypes de compĂ©tences il a intĂ©rĂȘt Ă  acquĂ©rir. Une maniĂšre de collecter des donnĂ©es est de dĂ©cider par soi-mĂȘme oĂč explorer. Une autre maniĂšre est de se rĂ©fĂ©rer Ă  un mentor. Nous appelons ces deux maniĂšresde collecter des donnĂ©es des modes d Ă©chantillonnage. Le premier mode d Ă©chantillonnage correspondĂ  des algorithmes dĂ©veloppĂ©s dans la littĂ©rature pour automatiquement pousser l agent vers des partiesintĂ©ressantes de l environnement ou vers des types de compĂ©tences utiles. De tels algorithmes sont appelĂ©sdes algorithmes de curiositĂ© arti cielle ou motivation intrinsĂšque. Le deuxiĂšme mode correspond au guidagesocial ou l imitation, oĂč un partenaire humain indique oĂč explorer et oĂč ne pas explorer.Nous avons construit une architecture algorithmique intrinsĂšquement motivĂ©e pour apprendre commentproduire par ses actions des e ets et consĂ©quences variĂ©es. Il apprend de maniĂšre active et en ligne encollectant des donnĂ©es qu il choisit en utilisant plusieurs modes d Ă©chantillonnage. Au niveau du metaapprentissage, il apprend de maniĂšre active quelle stratĂ©gie d Ă©chantillonnage est plus e cace pour amĂ©liorersa compĂ©tence et gĂ©nĂ©raliser Ă  partir de son expĂ©rience Ă  un grand Ă©ventail d e ets. Par apprentissage parinteraction, il acquiert de multiples compĂ©tences de maniĂšre structurĂ©e, en dĂ©couvrant par lui-mĂȘme lessĂ©quences dĂ©veloppementale.The challenges posed by robots operating in human environments on a daily basis and in the long-termpoint out the importance of adaptivity to changes which can be unforeseen at design time. The robot mustlearn continuously in an open-ended, non-stationary and high dimensional space. It must be able to knowwhich parts to sample and what kind of skills are interesting to learn. One way is to decide what to exploreby oneself. Another way is to refer to a mentor. We name these two ways of collecting data sampling modes.The rst sampling mode correspond to algorithms developed in the literature in order to autonomously drivethe robot in interesting parts of the environment or useful kinds of skills. Such algorithms are called arti cialcuriosity or intrinsic motivation algorithms. The second sampling mode correspond to social guidance orimitation where the teacher indicates where to explore as well as where not to explore. Starting fromthe study of the relationships between these two concurrent methods, we ended up building an algorithmicarchitecture with a hierarchical learning structure, called Socially Guided Intrinsic Motivation (SGIM).We have built an intrinsically motivated active learner which learns how its actions can produce variedconsequences or outcomes. It actively learns online by sampling data which it chooses by using severalsampling modes. On the meta-level, it actively learns which data collection strategy is most e cient forimproving its competence and generalising from its experience to a wide variety of outcomes. The interactivelearner thus learns multiple tasks in a structured manner, discovering by itself developmental sequences.BORDEAUX1-Bib.electronique (335229901) / SudocSudocFranceF

    Development of New Cotton Defoliation Sprayer Using Unmanned Ground Vehicle and Pulse Width Modulation Technology

    Get PDF
    Chemical spraying is one of the most important and frequently performed intercultural agriculture operations. It is imperative to utilize appropriate spraying technology as a selection of ineffective one leads to waste of agrochemicals to the non‐target area. Several precision technologies have been developed in the past few decades, such as image processing based on real‐time variable‐rate chemical spraying systems, autonomous chemical sprayers using machine vision and nozzle control, and use of unmanned aerial and ground vehicles. Cotton (Gossypium hirsutum L.) is an important industrial crop. It is a perennial crop with indeterminate growth habit; however, in most parts of the United States, it is grown as an annual crop and managed using growth regulators. Cotton defoliation is a natural physiological phenomenon, but untimely and/or inadequate defoliation by natural processes necessitates the application of chemical defoliants for efficient harvest. Defoliation is a major production practice influencing harvester efficiency, fiber trash content, cotton yield, and fiber quality. Currently, defoliant spraying is done by conventional ground driven boom sprayer or aerial applicator and both systems spray chemical vertically downwards into the canopy, which results in less chemical reaching the bottom of the canopy. Thus, a new autonomous ground sprayer was developed using robotics and pulse width modulation, which travels between two rows covering the whole canopy of the plant. Field research was conducted to evaluate the (i) effect of duty cycles (20%,40%, and 60%) on droplet characteristic (droplet distribution, deposition, and drift potential), defoliation cotton fiber and (ii) effect of duty cycles on cotton yield and II fiber quality. Droplet characteristics (droplet distribution, density, and potential droplet drift) were non-significant across the treatments and results from the water‐sensitive paper field test showed adequate penetration with low flow rates. Therefore, a 20% duty cycle was sufficient to defoliate based on the result of the field experiment. Likewise, the defoliants could be applied safely at the duty cycles tested without influencing fiber quality except for nep/gm, length (Ln), L (5%), short fiber content (SFCn), trash content in field 1 and micronaire, nep size, length (Ln), span length (5%), SFC, and fiber fineness in field 2 which were significant. However, the 20% duty cycle significantly reduced the amount of defoliant and would be a good choice for the autonomous cotton defoliation. This is a significant development as there is a huge potential to save on the cost of applying defoliant chemicals and the environment

    Vers un modÚle biologiquement plausible de la sélection de l'action pour un robot mobile

    Get PDF
    This thesis aims at studying the different mechanisms involved in action selection and de-cision making processes, according to animal experiments and neurobiological recordings. Forthat matter, we propose several biologically plausible models for action selection. The goal is toachieve a better understanding of the animal’s brain functions. This gives us the opportunity todevelop bioinspired control architectures for robots that are more robust and adaptative to a realenvironement. These models are based on Artificial Neural Networks, allowing us to test ourhypotheses on simulations of different brain regions and function, implemented on robots andvirtual agents.Action selection for mobile robots can be approached from different angles. This processcan be seen as the selection between two possibilities, e.g. go left or go right. Those mechanismsinvolve the ability to learn and categorize specific events, encoding contexts where a change inthe perception is perceived, a change in the behavior is noticed or the decision is made. There-fore, this thesis studies those capacities of acquisition, categorisation and coding of differentevents that can be relevant for action selection.We also, approach the action selection as a strategy selection. The different behaviors areguided consciously or through automated behavior learned as habits. We investigate differentpossibilities allowing a robot to develop those capacities. Also, we aim at studying interactionsthat can emerge between those mechanisms during navigational behaviors.The work presented in this these is based on the modelisation of the hippocampo-cotico-basal loops involved in the navigational behaviors, the action selection and the multimodal cat-egorisation of events. We base our models on a previous model of the hippocampus for thelearning of spatio-temporal associations and for multimodal conditionning of perceptive events.It is based on sensorimotor associations between place cells and actions to achieve navigationalbehaviors. The model involves also a specific type of hippocampic cells, named transition cells,for temporal prediction of future events. This capacity allows the model to learn spatio-temporalsequences, and it represents the neural substrate for the learning of a cognitive map, hypoth-esised to be localized in prefrontal and/or parietal areas. This kind of topological map allowsto plan the behavior of the robot according to its motivations, which is used in goal orientedexperiments to achieve goals and capture rewardsCette thĂšse Ă©tudie les mĂ©canismes de sĂ©lection de l’action et de choix de stratĂ©gie tels qu’ilsapparaissent Ă  travers des expĂ©riences animales et des enregistrements neurobiologiques. Nousproposons ensuite des modĂšles biologiquement plausibles de la sĂ©lection de l’action. L’objectifest de mieux comprendre le fonctionnement du cerveau chez les ĂȘtres vivants et de pouvoir endĂ©duire des architectures de contrĂŽle bio-inspirĂ©es, plus robustes et adaptĂ©es Ă  l’environnement.Les modĂšles Ă©tudiĂ©s sont rĂ©alisĂ©s avec des rĂ©seaux de neurones artificiels, permettant de mod-Ă©liser des rĂ©gions cĂ©rĂ©brales et ainsi pouvoir simuler le fonctionnement du cerveau, ce qui permetde tester nos hypothĂšses sur des robots et des agents virtuels.L’étude de la sĂ©lection de l’action pour des robots mobiles implique plusieurs approches. LasĂ©lection de l’action peut ĂȘtre Ă©tudiĂ©e du point de vue du choix entre plusieurs actions basiques,e.g. un choix binaire aller Ă  gauche ou Ă  droite.Ceci passe forcĂ©ment par l’acquisition et la catĂ©gorisation d’instants et d’évĂ©nements spĂ©ciaux,perçus ou effectuĂ©s, qui reprĂ©sentent des contextes dans lesquels la perception change, le com-portement est modifiĂ© ou bien la sĂ©lection est rĂ©alisĂ©e. Ainsi, la thĂšse traite aussi de l’acquisition,la catĂ©gorisation et l’encodage de ces Ă©vĂ©nements importants dans la sĂ©lection de l’action.Enfin, on s’intĂ©ressera Ă  la sĂ©lection de l’action du point de vue de la sĂ©lection de stratĂ©gie.Les diffĂ©rents comportements peuvent ĂȘtre dirigĂ©s consciemment ou bien ĂȘtre des automatismesacquis avec l’habitude. Le but ici est d’explorer diffĂ©rentes approches pour que le robot puissedĂ©velopper ces deux capacitĂ©s, mais aussi d’étudier les interactions entre ces types de mĂ©can-ismes dans la cadre de tĂąches de navigation.Les travaux de cette thĂšse se basent sur la modĂ©lisation du fonctionnement de diffĂ©rentesboucles hippocampo-cortico-basales impliquĂ©es dans des tĂąches de navigation, de sĂ©lection del’action et de catĂ©gorisations multimodales. En particulier, nous avons un modĂšle de l’hippocampepermettant d’apprendre des associations spatio-temporelles et des conditionnements multimodauxentre des Ă©vĂ©nements perceptifs. Il se base sur des associations sensorimotrices entre des cellulesappelĂ©es cellules de lieu qui sont associĂ©es avec des actions pour dĂ©finir des comportements co-hĂ©rents. Le modĂšle fait aussi intervenir des cellules de transition hippocampiques, permettant defaire des prĂ©dictions temporelles sur les Ă©vĂ©nements futurs. Celles-ci permettent l’apprentissagede sĂ©quences spatio-temporelles, notamment du fait qu’elles reprĂ©sentent le substrat neuronal Ă l’apprentissage d’une carte cognitive, situĂ©e elle au niveau du cortex prĂ©frontal et/ou pariĂ©tal.Ce type de carte permet de planifier des chemins Ă  suivre en fonction des motivations du robot,ce qui permet de rejoindre diffĂ©rents buts prĂ©cĂ©demment dĂ©couverts dans l’environnement
    corecore