Search CORE

3 research outputs found

Generative Adversarial Network for Imitation Learning from Single Demonstration

Author: Duc Tho Nguyen
Kamioka Eiji
Tan Phan Xuan
Tran Chanh Minh
Publication venue: College of Science for Women - University of Baghdad
Publication date: 20/12/2021
Field of study

التعلم التقليد هو طريقة فعالة لتدريب وكيل مستقل لإنجاز المهمة عن طريق تقليد سلوكيات الخبراء في مظاهراتهم. ومع ذلك، تتطلب طرق التعلم التقليدية التقليدية عددا كبيرا من مظاهرات الخبراء من أجل تعلم سلوك معقد. حدد هذا العيب محدودا إمكانية التعلم التقليد في المهام المعقدة حيث لا تكون مظاهرات الخبراء كافية. من أجل معالجة المشكلة، يقترح النموذج المستند إلى الشبكة المصنوعة من الشبكة المصممة على تصميم سياسات مثالية باستخدام مظاهرة واحدة فقط. يتم تقييم النموذج المقترح على مهمتين محاكاة مقارنة بطرق أخرى. تظهر النتائج أن نموذجنا المقترح قادر على إكمال المهام المدروسة على الرغم من القيد في عدد مظاهرات الخبراء، والذي يشير بوضوح إلى إمكانات نموذجنا.Imitation learning is an effective method for training an autonomous agent to accomplish a task by imitating expert behaviors in their demonstrations. However, traditional imitation learning methods require a large number of expert demonstrations in order to learn a complex behavior. Such a disadvantage has limited the potential of imitation learning in complex tasks where the expert demonstrations are not sufficient. In order to address the problem, we propose a Generative Adversarial Network-based model which is designed to learn optimal policies using only a single demonstration. The proposed model is evaluated on two simulated tasks in comparison with other methods. The results show that our proposed model is capable of completing considered tasks despite the limitation in the number of expert demonstrations, which clearly indicate the potential of our model

Baghdad Science Journal

Data for: Deterministic Generative Adversarial Imitation Learning

Author: Zuo G (via Mendeley Data)
Publication venue
Publication date: 31/03/2020
Field of study

This data set is the experimental data for Deterministic Generative Adversarial Imitation Learning, which includes the cumulative rewards of the agent exploration process and several groups of motion trajectories.THIS DATASET IS ARCHIVED AT DANS/EASY, BUT NOT ACCESSIBLE HERE. TO VIEW A LIST OF FILES AND ACCESS THE FILES IN THIS DATASET CLICK ON THE DOI-LINK ABOV

Electronic Archiving System

Data for: Deterministic Generative Adversarial Imitation Learning

Author: Zuo G
Publication venue: Data Archiving and Networked Services (DANS)
Publication date
Field of study