Search CORE

29,849 research outputs found

Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search

Author: Chebotar Yevgen
Kalakrishnan Mrinal
Levine Sergey
Li Adrian
Yahya Ali
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 03/10/2016
Field of study

In principle, reinforcement learning and policy search methods can enable robots to learn highly complex and general skills that may allow them to function amid the complexity and diversity of the real world. However, training a policy that generalizes well across a wide range of real-world conditions requires far greater quantity and diversity of experience than is practical to collect with a single robot. Fortunately, it is possible for multiple robots to share their experience with one another, and thereby, learn a policy collectively. In this work, we explore distributed and asynchronous policy learning as a means to achieve generalization and improved training times on challenging, real-world manipulation tasks. We propose a distributed and asynchronous version of Guided Policy Search and use it to demonstrate collective policy learning on a vision-based door opening task using four robots. We show that it achieves better generalization, utilization, and training times than the single robot alternative.Comment: Submitted to the IEEE International Conference on Robotics and Automation 201

arXiv.org e-Print Archive

Crossref

Self-adaptive differential artificial bee colony algorithm for global optimization problems

Author: Chen Xu
Li Kangji
Tianfield Huaglory
Publication venue: 'Elsevier BV'
Publication date: 01/03/2019
Field of study

ResearchOnline@GCU

Leaning an University Department: a life experiment

Author: Carvalho Pedro G.
Miragaia Dina
Publication venue
Publication date
Field of study

The European Quality Assurance methodology is pushing hard Portuguese Universities so that they should improve their overall performance. Working at a Portuguese University more than a decade ago, one of the authors experienced several life cycles in different Departments and the experience acquired in foreign Universities (USA) teached him a couple of simple things in order to positively participate in this kind of processes. However, he found it quite difficult to apply his knowledge without other’s contribution, due to several endogenous and exogenous reasons, including age and generation viewpoints. Together with the second author we started to apply some theoretical new insights we were discussing together during her PhD research. The purpose of this paper is to describe the experiment we are in now, following a social network methodology used in my Economics PhD together with three theoretical influences we think are inter twinkled like the lean thinking, the value focus thinking and the complication in innovation diffusion processes. After a brief literature review we describe the basic pillars we used to achieve the main goal of improving performance in a young university department. Using some coaching and economic tools and knowledge, we were able to gather a group of different people – students, staff and teachers - deeply involved in our proposal methodology. Preliminary results are briefly identified, as much as further research challenges.Lean thinking; quality improvement; social networks analysis; decision making; Portuguese Universities

Research Papers in Economics

Metaheuristic design of feedforward neural networks: a review of two decades of research

Author: Abbass
Abraham
Ackley
Ajith Abraham
Akhand
Alba
Ali Ahmadi
Almeida
Alvarez
Amari
Andersen
Angeline
Arifovic
Augusteijn
Azimi-Sadjadi
Bakker
Baranyi
Battiti
Bertsekas
Bishop
Bland
Bousquet
Boussaid
Breiman
Brownlee
Carvalho
Chandra
Charalambous
Chen
Chen
Chen
Chen
Cho
Chrisley
Coello
Cortes
Costa
Cruz-Ramírez
Cybenko
Da
da Silva
Dai
Das
Das
Davis
de Albuquerque Teixeira
Deneubourg
Dhahri
Diebold
Ding
Ditzler
Dominey
Donate
Dorigo
Dumont
Engel
Fahlman
Feo
FernandezCaballero
Fister
Fletcher
Fogel
Fogel
Fontanari
Formato
Frean
Fukumizu
Fullér
Furtuna
Garcia-Pedrajas
García-Pedrajas
García-Pedrajas
Gaspar-Cunha
Geem
Geman
Gershenfeld
Ghalambaz
Girosi
Giustolisi
Glover
Goh
Goldberg
Gori
Gorin
Green
Grossberg
Hagan
Hansen
Haykin
Haykin
Hernández
Hestenes
Hinton
Hinton
Hinton
Hirose
Ho
Holland
Hopfield
Hornik
Hornik
Huang
Huang
Huang
Huang
Huang
Igel
Ilonen
Irani
Irani
Islam
Jacobs
Jain
Jain
Jin
Juang
Kaelbling
Karaboga
Karpat
Kennedy
Khan
Khan
Kim
Kim
Kim
Kim
Kiranyaz
Kirkpatrick
Kitano
Kitano
Kohonen
Kolmogorov
Kordík
Kouda
Koza
Kulluk
Kŭrková
Lam
Larrañaga
LeCun
Lera
Leshno
Leung
Leung
Lewenstein
Li
Lin
Lin
Ling
Lippmann
Liu
Liu
Lowe
Ludermir
Mahdavi
Maniezzo
March
Marquardt
Martínez-Muñoz
Mazurowski
McCulloch
Menczer
Merrill
Metropolis
Minku
Minsky
Mirjalili
Mirjalili
Mitra
Mjolsness
Mladenović
Moriarty
Murray
Nakama
Nandy
Narayanan
Natschläger
Nedjah
Niranjan
Niu
Nolfi
Oh
Ojha
Osman
Pan
Passino
Pearce
Pencina
Peng
Pettersson
Pipino
Polikar
Prechelt
Prisecaru
Puig
Rashedi
Reed
Ritchie
Rosenblatt
Rumelhart
Rumelhart
Saad
Salajegheh
Sarkar
Schaffer
Schapire
Schmidhuber
Schwefel
Sejnowski
Selmic
Sexton
Sexton
Sexton
Shang
Sharma
Sietsma
Simovici
Sivagaminathan
Slowik
Socha
Socha
Sokolova
Sporea
Stanley
Storn
Sum
Sörensen
Tang
Tayefeh Mahmoudi
Toh
Tong
Trelea
Trentin
Tsai
Tsai
Tsoulos
Twomey
Ulagammai
Van den Bergh
van der Voet
Varun Kumar Ojha
Venkadesh
Ventura
Vieira
Václav Snášel
Wand
Wang
Wessels
Weyland
Whitley
Widrow
Wiegand
Wilson
Wolpert
Wolpert
Xi-Zhao
Yaghini
Yang
Yang
Yao
Yao
Yao
Yao
Yao
Yao
Yao
Ye
Yin
Yusiong
Zhang
Zhang
Zhang
Zhang
Zhang
Zhao
Zhou
Zhou
Zikopoulos
Zăvoianu
Černỳ
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Over the past two decades, the feedforward neural network (FNN) optimization has been a key interest among the researchers and practitioners of multiple disciplines. The FNN optimization is often viewed from the various perspectives: the optimization of weights, network architecture, activation nodes, learning parameters, learning environment, etc. Researchers adopted such different viewpoints mainly to improve the FNN's generalization ability. The gradient-descent algorithm such as backpropagation has been widely applied to optimize the FNNs. Its success is evident from the FNN's application to numerous real-world problems. However, due to the limitations of the gradient-based optimization methods, the metaheuristic algorithms including the evolutionary algorithms, swarm intelligence, etc., are still being widely explored by the researchers aiming to obtain generalized FNN for a given problem. This article attempts to summarize a broad spectrum of FNN optimization methodologies including conventional and metaheuristic approaches. This article also tries to connect various research directions emerged out of the FNN optimization practices, such as evolving neural network (NN), cooperative coevolution NN, complex-valued NN, deep learning, extreme learning machine, quantum NN, etc. Additionally, it provides interesting research challenges for future research to cope-up with the present information processing era

arXiv.org e-Print Archive

Central Archive at the University of Reading

Repository for Publications and Research Data

Crossref

DSpace at VSB Technical University of Ostrava