Search CORE

48 research outputs found

Automating Vehicles by Deep Reinforcement Learning using Task Separation with Hill Climbing

Author: A Liniger
B Paden
C Urmson
CW Anderson
D Dolgov
D Wierstra
DQ Mayne
E Frazzoli
HT Siegelmann
J Xu
P Falcone
R Tedrake
T Schouwenaars
Publication venue
Publication date: 02/08/2018
Field of study

Within the context of autonomous driving a model-based reinforcement learning algorithm is proposed for the design of neural network-parameterized controllers. Classical model-based control methods, which include sampling- and lattice-based algorithms and model predictive control, suffer from the trade-off between model complexity and computational burden required for the online solution of expensive optimization or search problems at every short sampling time. To circumvent this trade-off, a 2-step procedure is motivated: first learning of a controller during offline training based on an arbitrarily complicated mathematical system model, before online fast feedforward evaluation of the trained controller. The contribution of this paper is the proposition of a simple gradient-free and model-based algorithm for deep reinforcement learning using task separation with hill climbing (TSHC). In particular, (i) simultaneous training on separate deterministic tasks with the purpose of encoding many motion primitives in a neural network, and (ii) the employment of maximally sparse rewards in combination with virtual velocity constraints (VVCs) in setpoint proximity are advocated.Comment: 10 pages, 6 figures, 1 tabl

arXiv.org e-Print Archive

Crossref

Sampling-based Algorithms for Optimal Motion Planning

During the last decade, sampling-based path planning algorithms, such as Probabilistic RoadMaps (PRM) and Rapidly-exploring Random Trees (RRT), have been shown to work well in practice and possess theoretical guarantees such as probabilistic completeness. However, little effort has been devoted to the formal analysis of the quality of the solution returned by such algorithms, e.g., as a function of the number of samples. The purpose of this paper is to fill this gap, by rigorously analyzing the asymptotic behavior of the cost of the solution returned by stochastic sampling-based algorithms as the number of samples increases. A number of negative results are provided, characterizing existing algorithms, e.g., showing that, under mild technical conditions, the cost of the solution returned by broadly used sampling-based algorithms converges almost surely to a non-optimal value. The main contribution of the paper is the introduction of new algorithms, namely, PRM* and RRT*, which are provably asymptotically optimal, i.e., such that the cost of the returned solution converges almost surely to the optimum. Moreover, it is shown that the computational complexity of the new algorithms is within a constant factor of that of their probabilistically complete (but not asymptotically optimal) counterparts. The analysis in this paper hinges on novel connections between stochastic sampling-based path planning algorithms and the theory of random geometric graphs.Comment: 76 pages, 26 figures, to appear in International Journal of Robotics Researc

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Crossref

On counterfactual attitudes: a case study of Taiwanese Southern Min

Author: A Giannakidou
A Kratzer
A Kratzer
B Geurts
B Geurts
B Laca
C Lien
D Bolinger
D Farkas
E Maier
E Villalta
E Villalta
F Del Prete
F Moltmann
F Veltman
GN Carlson
I Heim
J Hintikka
J Ross
JO Urmson
K Fintel von
K Fintel von
L Karttunen
L Matthewson
M Kissine
M Simons
P Anand
P Klecha
P Portner
R Stalnaker
R Stalnaker
R Stalnaker
S Yalcin
SA Thompson
V Hacquard
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Understanding Human Avoidance Behavior: Interaction-Aware Decision Making Based on Game Theory

Author: A Bauer
A Colman
A Lerner
A McNeill
A Rubinstein
A Treuille
A Wykowska
AH Olivier
AH Olivier
Annemarie Turnwald
B Basten van
B Efron
B Reeves
C Camerer
C Urmson
D Helbing
Daniel Althoff
Dirk Wollherr
EA Sisbot
F Castelli
F Heider
F Zanlungo
G Attanasi
G Csibra
H Kameda
I Mitchell
J Alonso-Mora
J Berg van den
J Ondřej
J Rios-Martinez
K Leyton-Brown
K Mombaur
K Skrzypczyk
M Cinelli
M Cinelli
M Huber
M Shiomi
Martin Buss
P Basili
P Fiorini
P Trautman
R Cooper
R Myerson
R Vidal
S Bitgood
S Ganebny
S Hoogendoorn
S Johnson
S LaValle
T Başar
T Başar
T Huntsberger
T Kruse
W Sparrow
Y Benjamini
Y Meng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Pragmatic markers in Hungarian: Some introductory remarks

Author: Aijmer K.
Aijmer K.
Alexiadou A.
Altmann H.
Amaral P.
Antal L.
Asher N.
Asher N.
Bartos H.
Bayer J.
Bayer J.
Benincà P.
Beáta Gyuris
Bhatt R.
Blakemore D.
Brinton L.
Brody M.
Brown P.
Büring D.
Błaszczak J.
Cardinaletti A.
Chomsky N.
Cinque G.
Clark B.
Cohen A.
Coniglio M.
Craenenbroeck J.
Dalmi G.
Dikken M.
Dér C. I.
Egedi B.
Ernst T.
Ernst T.
Evans N.
Farkas D.
Farkas D.
Foolen A.
Fraser B.
Fraser B.
Fraser B.
Fraser B.
Frey W.
Fábricz K.
Ginzburg J.
Grewendorf G.
Groot C.
Gunlogson C.
Gunlogson C.
Gutzmann D.
Gyuris B.
Gyuris B.
Gyuris B.
Gécseg Z.
Gécseg Z.
H. Molnár I.
Haegeman L.
Haegeman L.
Haegeman L.
Haegeman L.
Haegeman L.
Haegeman L.
Hans-Martin Gärtner
Heim I.
Hentschel E.
Heycock C.
Hill V.
Hollos M.
Hooper J.
Horvath J.
Horváth K.
Jackendoff R.
Jang Y.
Juhász D.
Kadmon N.
Karttunen L.
Kaufmann M.
Kayne R. S.
Kenesei I.
Kenesei I.
Kiefer F.
Kiefer F.
Kratzer A.
Krifka M.
Krifka M.
Krifka M.
Kálmán L.
König E.
Lenk U.
Lipták A.
Lipták A.
Lyons J.
Maleczki M.
Maynard S.
McCloskey J.
Meibauer J.
Métrich R.
Müller G.
Oppenrieder W.
Ouhalla J.
Pasch R.
Platzack C.
Platzack C.
Poletto C.
Pollock J.-Yves.
Portner P.
Poschmann C.
Potts C.
Potts C.
Prószéky G.
Péter M.
Péteri A.
Rakić S.
Reinhart T.
Rizzi L.
Romero-Trillo J.
Sasse H.-J.
Schiffrin D.
Schourup L.
Searle J. R.
Shlonsky U.
Simoncsics P.
Simons M.
Simonyi Z.
Surányi B.
Szabolcsi A.
Szabolcsi A.
Szücs M.
Tchizmarova I.
Thurmair M.
Truckenbrodt H.
Turi G.
Urmson J. O.
Varga D.
Varga D.
Varga L.
Varga L.
Vaskó I.
Weydt H.
Wilson D.
Zwicky A. M.
É. Kiss K.
É. Kiss K.
É. Kiss K.
É. Kiss K.
É. Kiss K.
Ürögdi B.
Publication venue: 'Akademiai Kiado Zrt.'
Publication date
Field of study

Crossref

Researching the social impact of the arts : literature, fiction and the novel

Author: Allott M.
Alter R.
Atkins B.
Baratz‐Logsted L.
Belfiore E.
Belfiore E.
Bennett O.
Bloom H.
Carey J.
Charon R.
Crusie J.
Currie G.
Davies D.
Dean J.
Derrida J.
Eagleton T.
Eco U.
Eleonora Belfiore
Ferriss S.
Fielding H.
Harré R.
Hewitt P.
Hughes G.
Keeley G.
Kivy P.
Lamarque P.
Lamarque P.
Landau M.
Laski M.
Leitch T.M.
Long J.
Maughan C.
McCloskey D.N.
Nash C.
Oliver Bennett
Reeves H.
Roberts A.M.
Roberts A.M.
Saenger P.
Sennett R.
Shiner L.
Swendson S.
Tatarkiewicz W.
Urmson J.O.
Verghese A.
Woods R.
Yardley C.
Publication venue: 'Informa UK Limited'
Publication date: 01/02/2009
Field of study

This paper offers a contribution to current debates in the field of cultural policy about the social impact of the arts. It explores the conceptual difficulties that arise in the notion of ‘the arts’ and the implications of these difficulties for attempts to generalise about their value, function and impact. It considers both ‘essentialist’ and ‘institutional’ perspectives, first on ‘the arts’ in toto and then on literature, fiction and the novel with the view of making an innovative intellectual connection between aesthetic theories and contemporary cultural policy discourse. The paper shows how literature sits uneasily in the main systems of classifying the arts and how the novel and fiction itself are seen as problematic categories. The position of the novel in the literary canon is also discussed, with particular reference to the shifting instability of the canon. The paper suggests that the dilemmas thrown up in trying to define or classify the novel are likely to be encountered in attempting to define other art forms. The implications of these findings for the interpretation and conduct of traditional ‘impact studies’ are explored

Crossref

Warwick Research Archives Portal Repository

Receding Horizon Control of UAVs using Gradual Dense-Sparse Discretizations

Author: Alpert B.
Alpert B.
Dolgov D.
Liseikin V.
Montemerlo M.
Rao A.
Stoer J.
Urmson C.
Ögren P.
Publication venue: 'American Institute of Aeronautics and Astronautics (AIAA)'
Publication date: 14/09/2010
Field of study

In this paper we propose a way of increasing the efficiency of some direct Receding Horizon Control (RHC) schemes. The basic idea is to adapt the allocation of computational resources to how the iterative plans are used. By using Gradual Dense-Sparse discretizations (GDS), we make sure that the plans are detailed where they need to be, i.e., in the very near future, and less detailed further ahead. The gradual transition in discretization density reflects increased uncertainty and reduced need for detail near the end of the planning horizon. The proposed extension is natural, since the standard RHC approach already contains a computational asymmetry in terms of the coarse cost-to-go computations and the more detailed short horizon plans. Using GDS discretizations, we bring this asymmetry one step further, and let the short horizon plans themselves be detailed in the near term and more coarse in the long term. The rationale for different levels of detail is as follows. 1) Near future plans need to be implemented soon, while far future plans can be refined or revised later. 2) More accurate sensor information is available about the system and its surroundings in the near future, and detailed planning is only rational in low uncertainty situations. 3) It has been shown that reducing the node density in the later parts of fixed horizon optimal control problems gives a very small reduction in the solution quality of the first part of the trajectory. The reduced level of detail in the later parts of a plan can increase the efficiency of the RHC in two ways. If the discretization is made sparse by removing nodes, fewer computations are necessary, and if the discretization is made sparse by spreading the last nodes over a longer time-horizon, the performance will be improved. I

CiteSeerX

Crossref