Search CORE

5 research outputs found

Least Squares Temporal Difference Actor-Critic Methods with Applications to Robot Motion Control

Author: Belta Calin A.
Ding Xu Chu
Estanjini Reza Moazzez
Lahijanian Morteza
Paschalidis Ioannis Ch.
Wang Jing
Publication venue
Publication date: 01/01/2011
Field of study

We consider the problem of finding a control policy for a Markov Decision Process (MDP) to maximize the probability of reaching some states while avoiding some other states. This problem is motivated by applications in robotics, where such problems naturally arise when probabilistic models of robot motion are required to satisfy temporal logic task specifications. We transform this problem into a Stochastic Shortest Path (SSP) problem and develop a new approximate dynamic programming algorithm to solve it. This algorithm is of the actor-critic type and uses a least-square temporal difference learning method. It operates on sample paths of the system and optimizes the policy within a pre-specified class parameterized by a parsimonious set of parameters. We show its convergence to a policy corresponding to a stationary point in the parameters' space. Simulation results confirm the effectiveness of the proposed solution.Comment: Technical report accompanying an accepted paper to CDC 201

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

Minimizing deep sea data collection delay with autonomous underwater vehicles

Author: Akyildiz
Ayaz
Baccour
Cho
Cui
Fleischner
Heidemann
Huanyang Zheng
Incel
Jie Wu
Klein
Kolmogorov
Liu
Ma
Moazzez-Estanjini
Ning Wang
Partan
Tekdas
Ullgren
Publication venue: 'Elsevier BV'
Publication date
Field of study

Collision-free and crossing-free trajectory design for second-order agents persistent monitoring

Author: Boyd
Bryson
Cassandras
Cassandras
Cassandras
Cassandras
da Silva
Foderaro
Jiang-Wen Xiao
Klesh
Le Ny
Lin
Lin
Ming-Jie Zhao
Moazzez-Estanjini
Nigam
Qu
Santos
Smith
Soltero
Song
Su
Wang
Wang
Wang
Wu
Wu Yang
Yan-Wu Wang
Yu
Yu
Zhong
Zhou
Zhou
Zhou
Publication venue: 'Elsevier BV'
Publication date
Field of study

A distributed and energy-efficient approach for collecting emergency data in wireless sensor networks with mobile sinks

Author: Almi'ani
Awerbuch
Bhatia
Cheng
De
Ghosh
Gupta
Heinzelman
Imon
Kaswan
Kaswan
Leili Farzinvash
Li
Li
Li
Li
Li
Li
Li
Li
Majma
Meera
Moazzez-Estanjini
Mottaghi
Nuruzzaman
Pazzi
Saginbekov
Salarian
Samad Najjar-Ghabel
Sharma
Shi
Suganthi
Tahmineh Javadzadeh
Wang
Wang
Wang
Wen
Weng
Wu
Yao
Zhao
Publication venue: 'Elsevier BV'
Publication date
Field of study

Tag-based cooperative data gathering and energy recharging in wide area RFID sensor networks

Author: Al-Turjman
Almiani
Almi’ani
Almi’ani
Amendola
Antonella Molinaro
Antonio Iera
Bansal
Baronti
Buettner
Cheng
Dunbabin
Farris
Farris
Fu
Grefenstette
Gu
Guo
Guo
He
He
Ivan Farris
Jara
Kurs
Leonardo Militano
Lu
Ma
Madhja
Mitrokotsa
Moazzez-Estanjini
Pandya
Ruiz-Garcia
Shahbazi
Shi
Silverio Carlo Spinella
Smith
Tekdas
Wang
Wang
Wang
Wu
Yeager
Zhang
Zhao
Zhao
Zhao
Publication venue: 'Elsevier BV'
Publication date
Field of study

core

core