Search CORE

11,025 research outputs found

Learning to Segment and Represent Motion Primitives from Driving Data for Motion Planning Applications

Author: Chen Huiyan
Gong Jianwei
Wang Boyang
Zhang Ruizeng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 17/09/2018
Field of study

Developing an intelligent vehicle which can perform human-like actions requires the ability to learn basic driving skills from a large amount of naturalistic driving data. The algorithms will become efficient if we could decompose the complex driving tasks into motion primitives which represent the elementary compositions of driving skills. Therefore, the purpose of this paper is to segment unlabeled trajectory data into a library of motion primitives. By applying a probabilistic inference based on an iterative Expectation-Maximization algorithm, our method segments the collected trajectories while learning a set of motion primitives represented by the dynamic movement primitives. The proposed method utilizes the mutual dependencies between the segmentation and representation of motion primitives and the driving-specific based initial segmentation. By utilizing this mutual dependency and the initial condition, this paper presents how we can enhance the performance of both the segmentation and the motion primitive library establishment. We also evaluate the applicability of the primitive representation method to imitation learning and motion planning algorithms. The model is trained and validated by using the driving data collected from the Beijing Institute of Technology intelligent vehicle platform. The results show that the proposed approach can find the proper segmentation and establish the motion primitive library simultaneously

arXiv.org e-Print Archive

Crossref

Evorus: A Crowd-powered Conversational Assistant Built to Automate Itself Over Time

Author: Banchs Rafael E
Carpenter Rollo
Constine Josh
Gasic Milica
Han Bo
Hempel Jessi
Huang Ting-Hao K.
Kamar Ece
Kamar Ece
Kenneth Huang Ting-Hao
Kenneth Huang Ting-Hao
Li Jiwei
Li Xuijun
Lynley Matthew
Newton Casey
Pennington Jeffrey
Sadun Erica
Sarma Akash Das
Ting-Hao
Wen Tsung-Hsien
Wikipedia Cleverbot
Wikipedia Tay
Witten Ian H
Zhao Tiancheng
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/01/2018
Field of study

Crowd-powered conversational assistants have been shown to be more robust than automated systems, but do so at the cost of higher response latency and monetary costs. A promising direction is to combine the two approaches for high quality, low latency, and low cost solutions. In this paper, we introduce Evorus, a crowd-powered conversational assistant built to automate itself over time by (i) allowing new chatbots to be easily integrated to automate more scenarios, (ii) reusing prior crowd answers, and (iii) learning to automatically approve response candidates. Our 5-month-long deployment with 80 participants and 281 conversations shows that Evorus can automate itself without compromising conversation quality. Crowd-AI architectures have long been proposed as a way to reduce cost and latency for crowd-powered systems; Evorus demonstrates how automation can be introduced successfully in a deployed system. Its architecture allows future researchers to make further innovation on the underlying automated components in the context of a deployed open domain dialog system.Comment: 10 pages. To appear in the Proceedings of the Conference on Human Factors in Computing Systems 2018 (CHI'18

arXiv.org e-Print Archive

Crossref

Multistart Methods for Quantum Approximate Optimization

Author: Larson Jeffrey
Safro Ilya
Shaydulin Ruslan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 23/05/2019
Field of study

Hybrid quantum-classical algorithms such as the quantum approximate optimization algorithm (QAOA) are considered one of the most promising approaches for leveraging near-term quantum computers for practical applications. Such algorithms are often implemented in a variational form, combining classical optimization methods with a quantum machine to find parameters to maximize performance. The quality of the QAOA solution depends heavily on quality of the parameters produced by the classical optimizer. Moreover, the presence of multiple local optima in the space of parameters makes it harder for the classical optimizer. In this paper we study the use of a multistart optimization approach within a QAOA framework to improve the performance of quantum machines on important graph clustering problems. We also demonstrate that reusing the optimal parameters from similar problems can improve the performance of classical optimization methods, expanding on similar results for MAXCUT

arXiv.org e-Print Archive

Goal-oriented Dialogue Policy Learning from Failures

Author: Chen Xiaoping
Lu Keting
Zhang Shiqi
Publication venue
Publication date: 22/11/2018
Field of study

Reinforcement learning methods have been used for learning dialogue policies. However, learning an effective dialogue policy frequently requires prohibitively many conversations. This is partly because of the sparse rewards in dialogues, and the very few successful dialogues in early learning phase. Hindsight experience replay (HER) enables learning from failures, but the vanilla HER is inapplicable to dialogue learning due to the implicit goals. In this work, we develop two complex HER methods providing different trade-offs between complexity and performance, and, for the first time, enabled HER-based dialogue policy learning. Experiments using a realistic user simulator show that our HER methods perform better than existing experience replay methods (as applied to deep Q-networks) in learning rate

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

A Deep Hierarchical Approach to Lifelong Learning in Minecraft

Author: Givony Shahar
Mankowitz Daniel J.
Mannor Shie
Tessler Chen
Zahavy Tom
Publication venue
Publication date: 30/11/2016
Field of study

We propose a lifelong learning system that has the ability to reuse and transfer knowledge from one task to another while efficiently retaining the previously learned knowledge-base. Knowledge is transferred by learning reusable skills to solve tasks in Minecraft, a popular video game which is an unsolved and high-dimensional lifelong learning problem. These reusable skills, which we refer to as Deep Skill Networks, are then incorporated into our novel Hierarchical Deep Reinforcement Learning Network (H-DRLN) architecture using two techniques: (1) a deep skill array and (2) skill distillation, our novel variation of policy distillation (Rusu et. al. 2015) for learning skills. Skill distillation enables the HDRLN to efficiently retain knowledge and therefore scale in lifelong learning, by accumulating knowledge and encapsulating multiple reusable skills into a single distilled network. The H-DRLN exhibits superior performance and lower learning sample complexity compared to the regular Deep Q Network (Mnih et. al. 2015) in sub-domains of Minecraft

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

OSQP: An Operator Splitting Solver for Quadratic Programs

Author: Banjac Goran
Bemporad Alberto
Boyd Stephen
Goulart Paul
Stellato Bartolomeo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

We present a general-purpose solver for convex quadratic programs based on the alternating direction method of multipliers, employing a novel operator splitting technique that requires the solution of a quasi-definite linear system with the same coefficient matrix at almost every iteration. Our algorithm is very robust, placing no requirements on the problem data such as positive definiteness of the objective function or linear independence of the constraint functions. It can be configured to be division-free once an initial matrix factorization is carried out, making it suitable for real-time applications in embedded systems. In addition, our technique is the first operator splitting method for quadratic programs able to reliably detect primal and dual infeasible problems from the algorithm iterates. The method also supports factorization caching and warm starting, making it particularly efficient when solving parametrized problems arising in finance, control, and machine learning. Our open-source C implementation OSQP has a small footprint, is library-free, and has been extensively tested on many problem instances from a wide variety of application areas. It is typically ten times faster than competing interior-point methods, and sometimes much more when factorization caching or warm start is used. OSQP has already shown a large impact with tens of thousands of users both in academia and in large corporations

arXiv.org e-Print Archive

Repository for Publications and Research Data

DSpace@MIT

Crossref

Oxford University Research Archive

Archivio della ricerca della Scuola IMT Alti Studi Lucca

The TRIZ-CBR synergy: A knowledge based innovation process

Author: Cortes Robles Guillermo
Le Lann Jean-Marc
Machorro Rodriguez Ángel
Negny Stéphane
Publication venue: 'Inderscience Publishers'
Publication date: 01/03/2011
Field of study

Today innovation is recognised as the main driving force in the market. This complex process involves several intangible dimensions, such as creativity, knowledge and social interactions among others. Creativity is the starting point of the process, and knowledge is the force that transforms and materialises creativity in new products, services and processes. In this paper a synergy that aims to assists the innovation process is presented. The synergy combines several concepts and tools of the theory of inventive problem solving (TRIZ) and the case-based reasoning (CBR) process. The main objective of this synergy is to support creative engineering design and problem solving. This synergy is based on the strong link between knowledge and action. In this link, TRIZ offers several concepts and tools to facilitate concept creation and to solve problems, and the CBR process offers a framework capable of storing and reusing knowledge with the aim of accelerating the innovation process

Open Archive Toulouse Archive Ouverte