3,804 research outputs found

    Learning and Composing Primitive Skills for Dual-Arm Manipulation

    Get PDF
    In an attempt to confer robots with complex manipulation capabilities, dual-arm anthropomorphic systems have become an important research topic in the robotics community. Most approaches in the literature rely upon a great understanding of the dynamics underlying the system's behaviour and yet offer limited autonomous generalisation capabilities. To address these limitations, this work proposes a modelisation for dual-arm manipulators based on dynamic movement primitives laying in two orthogonal spaces. The modularity and learning capabilities of this model are leveraged to formulate a novel end-to-end learning-based framework which (i) learns a library of primitive skills from human demonstrations, and (ii) composes such knowledge simultaneously and sequentially to confront novel scenarios. The feasibility of the proposal is evaluated by teaching the iCub humanoid the basic skills to succeed on simulated dual-arm pick-and-place tasks. The results suggest the learning and generalisation capabilities of the proposed framework extend to autonomously conduct undemonstrated dual-arm manipulation tasks.Comment: Annual Conference Towards Autonomous Robotic Systems (TAROS19

    ๊ณ„์ธต์  ๊ฐ•ํ™”ํ•™์Šต์„ ํ†ตํ•œ ์–‘ํŒ”๋กœ๋ด‡ ๋งค๋‹ˆํ“ฐ๋ ˆ์ด์…˜

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ๊ธฐ๊ณ„ํ•ญ๊ณต๊ณตํ•™๋ถ€, 2021.8. ๋ฐ•๋ชฉ์ธ.๊ฐ•ํ™”ํ•™์Šต์€ ๋ณต์žกํ•œ ์›€์ง์ž„์„ ๋งŒ๋“ค์–ด๋‚ผ ์ˆ˜ ์žˆ๋Š” ๊ฐ•๋ ฅํ•œ ๋„๊ตฌ์ด๋‹ค. ํ•˜์ง€๋งŒ ์–‘ํŒ”๋กœ๋ด‡์ด ๊ฐ„๋‹จํ•œ ํ˜•์‹์˜ ๋ณด์ƒ ๊ธฐ๋ฐ˜ ๊ฐ•ํ™”ํ•™์Šต ๊ธฐ๋ฒ•์œผ๋กœ ์ˆœ์ฐจ์ ์ธ ๋งค๋‹ˆํ“ฐ๋ ˆ์ด์…˜ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๊ธฐ์œ„ํ•œ ๊ธฐ์ˆ ๋“ค์„ ๋ฐฐ์šฐ๋Š”๋ฐ์—๋Š” ์—ฌ์ „ํžˆ ๋งŽ์€ ์–ด๋ ค์›€์ด ์กด์žฌํ•œ๋‹ค. ํŠนํžˆ๋‚˜, ๋กœ๋ด‡์€ ์ž‘์—… ์ˆœ์„œ์˜ ์กฐํ•ฉ๋ฐฉ์‹์— ๋Œ€ํ•ด ์•Œ์•„์•ผ ํ•  ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ํ™˜๊ฒฝ๋‚ด์˜ ์žฅ์• ๋ฌผ๊ณผ ๊ฐ™์€ ๋ฐฉํ•ด์š”์†Œ๋“ค์„ ํ”ผํ•˜๋Š” ๋ฐฉ๋ฒ•๋„ ์•Œ์•„์•ผํ•œ๋‹ค. ์ด๋Ÿฌํ•œ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃจ๊ธฐ ์œ„ํ•ด, ์‹ค์ œ ๋„๋‹ฌ๊ฐ€๋Šฅํ•œ ์ƒํƒœ์™€ ์ถ”์ •๋œ ๋„๋‹ฌ๊ฐ€๋Šฅํ•œ ์ƒํƒœ๊ฐ„์˜ ์ฐจ์ด๋ฅผ ์ธก์ •ํ•˜๋Š” ๊ฒฉ์ฐจํ•จ์ˆ˜๋ฅผ ์ œ์•ˆํ•˜์˜€์œผ๋ฉฐ, ๊ฒฉ์ฐจํ•จ์ˆ˜๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ฃผ์–ด์ง„ ๋ชฉํ‘œ์ง€์ ์˜ ๋„๋‹ฌ๊ฐ€๋Šฅ์„ฑ ์—ฌ๋ถ€์— ๋Œ€ํ•œ ๋ณด์ˆ˜์ ์ธ ์ •๋ณด๋ฅผ ์ œ๊ณตํ•ด์ฃผ๋Š” ๊ฒฝ๋กœ๊ณ„ํš๋งต์„ ๊ตฌ์„ฑํ•˜์˜€๋‹ค. ์ด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๋ณต์žกํ•œ ํ™˜๊ฒฝ์—์„œ์˜ ์ˆœ์ฐจ์  ๋งค๋‹ˆํ“ฐ๋ ˆ์ด์…˜ ๋ฌธ์ œ์— ์ ์šฉ๊ฐ€๋Šฅํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ž‘์—…์ˆœ์„œํ•™์Šต๊ธฐ๋ฒ•๊ณผ ํ•จ๊ป˜ ๊ตฌ์„ฑํ•˜์˜€์œผ๋ฉฐ, ์‹ ๋ขฐ์„ฑ์žˆ๊ณ  ์•ˆ์ „ํ•˜๊ฒŒ ๋„๋‹ฌ๊ฐ€๋Šฅํ•œ ์ƒํƒœ๋“ค์„ ์ƒ˜ํ”Œ๋งํ•˜๋Š” ์ธก๋ฉด์—์„œ ์ œ์•ˆํ•œ ๋ฐฉ๋ฒ•์ด ๊ฐœ์„ ์„ ๊ฐ€์ ธ์˜ด์„ ํ™•์ธํ–ˆ๊ณ , ํ˜„์‹ค์ ์ธ ์ƒํ™ฉ์— ์ ์šฉ๊ฐ€๋Šฅํ•จ์„ ๋ณด์˜€๋‹ค.Reinforcement Learning (RL) is a powerful tool for acquiring complex skills. However, it is still difficult for a dual-arm robot to acquire skills for sequential manipulation tasks with a simple reward-based RL approach. Specifically, the robot needs to know not only how to compose sequences of tasks but also how to avoid the interrupting elements in the environment. To address this problem, we propose a discrepancy function that estimates the discrepancy between estimated reachable state and truly reachable state. Then, a planning map that provides conservative information about whether a given goal state is reachable or not is constructed by the discrepancy function. Combining these with task sequence learning, we develop an algorithm that is applicable to complex sequential manipulation problems in a cluttered environment. We find that our method provides an improvement in sampling reliable and safe reachable states in various environments and show that it is applicable to the realistic setting.1 Introduction 1 1.1 Thesis contribution 3 1.2 Thesis outline 4 2 Related works 5 3 Preliminary 7 3.1 Temporal di erence model (TDM-SAC) 7 3.2 Task sequence learning 10 4 Method 12 4.1 Discrepancy function 12 4.2 Planning map 13 4.3 Reduced function 14 5 Experiments 16 5.1 Simulation and training setup 16 5.2 TDM-SAC pre-training results 18 5.2.1 Value function 18 5.2.2 Discrepancy function 18 5.2.3 Planning map 18 5.2.4 Sampling quality 20 5.3 Training results 21 6 Conclusion and future works 26์„

    Multi-expert synthesis for versatile locomotion and manipulation skills

    Get PDF
    This work focuses on generating multiple coordinated motor skills for intelligent systems and studies a Multi-Expert Synthesis (MES) approach to achieve versatile robotic skills for locomotion and manipulation. MES embeds and uses expert skills to solve new composite tasks, and is able to synthesise and coordinate different and multiple skills smoothly. We proposed essential and effective design guidelines for training successful MES policies in simulation, which were deployed on both floating- and fixed-base robots. We formulated new algorithms to systematically determine task-relevant state variables for each individual experts which improved robustness and learning efficiency, and an explicit enforcement objective to diversify skills among different experts. The capabilities of MES policies were validated in both simulation and real experiments for locomotion and bi-manual manipulation. We demonstrated that the MES policies achieved robust locomotion on the quadruped ANYmal by fusing the gait recovery and trotting skills. For object manipulation, the MES policies learned to first reconfigure an object in an ungraspable pose and then grasp it through cooperative dual-arm manipulation

    Behaviour-driven motion synthesis

    Get PDF
    Heightened demand for alternatives to human exposure to strenuous and repetitive labour, as well as to hazardous environments, has led to an increased interest in real-world deployment of robotic agents. Targeted applications require robots to be adept at synthesising complex motions rapidly across a wide range of tasks and environments. To this end, this thesis proposes leveraging abstractions of the problem at hand to ease and speed up the solving. We formalise abstractions to hint relevant robotic behaviour to a family of planning problems, and integrate them tightly into the motion synthesis process to make real-world deployment in complex environments practical. We investigate three principal challenges of this proposition. Firstly, we argue that behavioural samples in form of trajectories are of particular interest to guide robotic motion synthesis. We formalise a framework with behavioural semantic annotation that enables the storage and bootstrap of sets of problem-relevant trajectories. Secondly, in the core of this thesis, we study strategies to exploit behavioural samples in task instantiations that differ significantly from those stored in the framework. We present two novel strategies to efficiently leverage offline-computed problem behavioural samples: (i) online modulation based on geometry-tuned potential fields, and (ii) experience-guided exploration based on trajectory segmentation and malleability. Thirdly, we demonstrate that behavioural hints can be extracted on-the-fly to tackle highlyconstrained, ever-changing complex problems, from which there is no prior knowledge. We propose a multi-layer planner that first solves a simplified version of the problem at hand, to then inform the search for a solution in the constrained space. Our contributions on efficient motion synthesis via behaviour guidance augment the robotsโ€™ capabilities to deal with more complex planning problems, and do so more effectively than related approaches in the literature by computing better quality paths in lower response time. We demonstrate our contributions, in both laboratory experiments and field trials, on a spectrum of planning problems and robotic platforms ranging from high-dimensional humanoids and robotic arms with a focus on autonomous manipulation in resembling environments, to high-dimensional kinematic motion planning with a focus on autonomous safe navigation in unknown environments. While this thesis was motivated by challenges on motion synthesis, we have explored the applicability of our findings on disparate robotic fields, such as grasp and task planning. We have made some of our contributions open-source hoping they will be of use to the robotics community at large.The CDT in Robotics and Autonomous Systems at Heriot-Watt University and The University of EdinburghThe ORCA Hub EPSRC project (EP/R026173/1)The Scottish Informatics and Computer Science Alliance (SICSA

    Concurrent Skill Composition using Ensemble of Primitive Skills

    Get PDF
    One of the key characteristics of an open-ended cumulative learning agent is that it should use the knowledge gained from prior learning to solve future tasks. That characteristic is especially essential in robotics, as learning every perception-action skill from scratch is not only time consuming but may not always be feasible. In the case of reinforcement learning, this learned knowledge is called a policy. The lifelong learning agent should treat the policies of learned tasks as building blocks to solve those future tasks. One of the categorizations of tasks is based on its composition, ranging from primitive tasks to compound tasks that are either a sequential or concurrent combination of primitive tasks. Thus, the agent needs to be able to combine the policies of the primitive tasks to solve compound tasks, which are then added to its knowledge base. Inspired by modular neural networks, we propose an approach to compose policies for compound tasks that are concurrent combinations of disjoint tasks. Furthermore, we hypothesize that learning in a specialized environment leads to more efficient learning; hence, we create scaffolded environments for the robot to learn primitive skills for our mobile robot-based experiments. We then show how the agent can combine those primitive skills to learn solutions for compound tasks. That reduces the overall training time of multiple skills and creates a versatile agent that can mix and match the skills.</p

    Adaptive Robot Framework: Providing Versatility and Autonomy to Manufacturing Robots Through FSM, Skills and Agents

    Get PDF
    207 p.The main conclusions that can be extracted from an analysis of the current situation and future trends of the industry,in particular manufacturing plants, are the following: there is a growing need to provide customization of products, ahigh variation of production volumes and a downward trend in the availability of skilled operators due to the ageingof the population. Adapting to this new scenario is a challenge for companies, especially small and medium-sizedenterprises (SMEs) that are suffering first-hand how their specialization is turning against them.The objective of this work is to provide a tool that can serve as a basis to face these challenges in an effective way.Therefore the presented framework, thanks to its modular architecture, allows focusing on the different needs of eachparticular company and offers the possibility of scaling the system for future requirements. The presented platform isdivided into three layers, namely: interface with robot systems, the execution engine and the application developmentlayer.Taking advantage of the provided ecosystem by this framework, different modules have been developed in order toface the mentioned challenges of the industry. On the one hand, to address the need of product customization, theintegration of tools that increase the versatility of the cell are proposed. An example of such tools is skill basedprogramming. By applying this technique a process can be intuitively adapted to the variations or customizations thateach product requires. The use of skills favours the reuse and generalization of developed robot programs.Regarding the variation of the production volumes, a system which permits a greater mobility and a faster reconfigurationis necessary. If in a certain situation a line has a production peak, mechanisms for balancing the loadwith a reasonable cost are required. In this respect, the architecture allows an easy integration of different roboticsystems, actuators, sensors, etc. In addition, thanks to the developed calibration and set-up techniques, the system canbe adapted to new workspaces at an effective time/cost.With respect to the third mentioned topic, an agent-based monitoring system is proposed. This module opens up amultitude of possibilities for the integration of auxiliary modules of protection and security for collaboration andinteraction between people and robots, something that will be necessary in the not so distant future.For demonstrating the advantages and adaptability improvement of the developed framework, a series of real usecases have been presented. In each of them different problematic has been resolved using developed skills,demonstrating how are adapted easily to the different casuistic

    Reasoning and understanding grasp affordances for robot manipulation

    Get PDF
    This doctoral research focuses on developing new methods that enable an artificial agent to grasp and manipulate objects autonomously. More specifically, we are using the concept of affordances to learn and generalise robot grasping and manipulation techniques. [75] defined affordances as the ability of an agent to perform a certain action with an object in a given environment. In robotics, affordances defines the possibility of an agent to perform actions with an object. Therefore, by understanding the relation between actions, objects and the effect of these actions, the agent understands the task at hand, providing the robot with the potential to bridge perception to action. The significance of affordances in robotics has been studied from varied perspectives, such as psychology and cognitive sciences. Many efforts have been made to pragmatically employ the concept of affordances as it provides the potential for an artificial agent to perform tasks autonomously. We start by reviewing and finding common ground amongst different strategies that use affordances for robotic tasks. We build on the identified grounds to provide guidance on including the concept of affordances as a medium to boost autonomy for an artificial agent. To this end, we outline common design choices to build an affordance relation; and their implications on the generalisation capabilities of the agent when facing previously unseen scenarios. Based on our exhaustive review, we conclude that prior research on object affordance detection is effective, however, among others, it has the following technical gaps: (i) the methods are limited to a single object โ†” affordance hypothesis, and (ii) they cannot guarantee task completion or any level of performance for the manipulation task alone nor (iii) in collaboration with other agents. In this research thesis, we propose solutions to these technical challenges. In an incremental fashion, we start by addressing the limited generalisation capabilities of, at the time state-of-the-art methods, by strengthening the perception to action connection through the construction of an Knowledge Base (KB). We then leverage the information encapsulated in the KB to design and implement a reasoning and understanding method based on statistical relational leaner (SRL) that allows us to cope with uncertainty in testing environments, and thus, improve generalisation capabilities in affordance-aware manipulation tasks. The KB in conjunctions with our SRL are the base for our designed solutions that guarantee task completion when the robot is performing a task alone as well as when in collaboration with other agents. We finally expose and discuss a range of interesting avenues that have the potential to thrive the capabilities of a robotic agent through the use of the concept of affordances for manipulation tasks. A summary of the contributions of this thesis can be found at: https://bit.ly/grasp_affordance_reasonin
    • โ€ฆ
    corecore