Search CORE

84 research outputs found

The Amazing Race Repeated Update Q-Learning VS. Q-Learning

Author: Al-Hamad Asma Q.
Bakar Mohammad
Bakkar Hazem
Eleraky Hassan
Khan Sohail
Publication venue: International Journal of Sciences: Basic and Applied Research
Publication date: 18/12/2020
Field of study

In this paper, we will conduct an experiment that aims to compare the performance of two reinforcement learning algorithms, the Repeated Update Q-learning algorithm (RUQL) [1] and the Q-learning algorithm(QL) [5]. A simulated version of a robot crawler developed by [6] will be used in this experiment, it is shown in figure (1). An investigation study about the difference in performance between RUQL and Q-learning algorithm (QL) [5] is discussed in this paper. Several trials and tests were conducted to estimate the difference in the crawler’s movement using both algorithms. Additionally, a detailed description of the Markovian decision processes (MDPs) elements [2] is introduced, MDP model includes states, actions and rewards for the task in hand. The parameters that were used and tuned in this experiment will be mentioned and the reasons for choosing their values will be explained.  Finally, the source code for the crawler robot was modified in order to implement RUQL and Q-Learning (QL) algorithms, Eclipse [3] and Java SE Development Kit 8 (JDK) [4] are used for this purpose. After running the crawler robot simulation, the results drawn from the experiment showed that RUQL significantly outperforms the traditional QL.  &nbsp

GSSRR.ORG: International Journals: Publishing Research Papers in all Fields

Using Reinforcement Learning in the tuning of Central Pattern Generators

Author: Duarte Ana Filipa de Sampaio Calçada
Publication venue
Publication date: 12/12/2012
Field of study

Dissertação de mestrado em Engenharia InformáticaÉ objetivo deste trabalho aplicar técnicas de Reinforcement Learning em tarefas de aprendizagem e locomoção de robôs. Reinforcement Learning é uma técnica de aprendizagem útil no que diz respeito à locomoção de robôs, devido à ênfase que dá à interação direta entre o agente e o meio ambiente, e ao facto de não exigir supervisão ou modelos completos, ao contrário do que acontece nas abordagens clássicas. O objetivo desta técnica consiste na decisão das ações a tomar, de forma a maximizar uma recompensa cumulativa, tendo em conta o facto de que as decisões podem afetar não só as recompensas imediatas, como também as futuras. Neste trabalho será apresentada a estrutura e funcionamento do Reinforcement Learning e a sua aplicação em Central Pattern Generators, com o objetivo de gerar locomoção adaptativa otimizada. De forma a investigar e identificar os pontos fortes e capacidades do Reinforcement Learning, e para demonstrar de uma forma simples este tipo de algoritmos, foram implementados dois casos de estudo baseados no estado da arte. No que diz respeito ao objetivo principal desta tese, duas soluções diferentes foram abordadas: uma primeira baseada em métodos Natural-Actor Critic, e a segunda, em Cross-Entropy Method. Este último algoritmo provou ser capaz de lidar com a integração das duas abordagens propostas. As soluções de integração foram testadas e validadas com recurso ao simulador Webots e ao modelo do robô DARwIN-OP.In this work, it is intended to apply Reinforcement Learning techniques in tasks involving learning and robot locomotion. Reinforcement Learning is a very useful learning technique with regard to legged robot locomotion, due to its ability to provide direct interaction between the agent and the environment, and the fact of not requiring supervision or complete models, in contrast with other classic approaches. Its aim consists in making decisions about which actions to take so as to maximize a cumulative reward or reinforcement signal, taking into account the fact that the decisions may affect not only the immediate reward, but also the future ones. In this work it will be studied and presented the Reinforcement Learning framework and its application in the tuning of Central Pattern Generators, with the aim of generating optimized robot locomotion. In order to investigate the strengths and abilities of Reinforcement Learning, and to demonstrate in a simple way the learning process of such algorithms, two case studies were implemented based on the state-of-the-art. With regard to the main purpose of the thesis, two different solutions are addressed: a first one based on Natural-Actor Critic methods, and a second, based on the Cross-Entropy Method. This last algorithm was found to be very capable of handling with the integration of the two proposed approaches. The integration solutions were tested and validated resorting to Webots simulation and DARwIN-OP robot model

Universidade do Minho: RepositoriUM

CHORUS Deliverable 2.1: State of the Art on Multimedia Search Engines

Author: Boujemaa Nozha
Compañó Ramón
Dosch Christoph
Geurts Joost
Karlgren Jussi
King Paul
Kompatsiaris Yiannis
Köhler Joachim
Le Moine Jean-Yves
Ortgies Robert
Point Jean-Charles
Rotenberg Boris
Rudström Åsa
Sebe Nicu
Publication venue: Chorus Project Consortium
Publication date: 01/01/2007
Field of study

Based on the information provided by European projects and national initiatives related to multimedia search as well as domains experts that participated in the CHORUS Think-thanks and workshops, this document reports on the state of the art related to multimedia content search from, a technical, and socio-economic perspective. The technical perspective includes an up to date view on content based indexing and retrieval technologies, multimedia search in the context of mobile devices and peer-to-peer networks, and an overview of current evaluation and benchmark inititiatives to measure the performance of multimedia search engines. From a socio-economic perspective we inventorize the impact and legal consequences of these technical advances and point out future directions of research

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

Proceedings of the 2nd Conference on Production Systems and Logistics (CPSL 2021)

Author: Herberger David
Hübner Marco
Publication venue: Hannover : publish-Ing.
Publication date: 01/01/2021
Field of study

Proceedings of the CPSL 202

Institutionelles Repositorium der Leibniz Universität Hannover

Methodological guidelines for independent work on the subject “English” (for 1–2-year full-time and part-time students majoring in 192 – Construction and Civil Engineering).

Author: Ostapchenko V.O.
Publication venue
Publication date: 01/01/2019
Field of study

Цифровий репозиторій ХНАМГ (KNAME Digital Repository)

Design and semantics of form and movement : DeSForM 2007

Author
Publication venue: Koninklijke Philips Electronics
Publication date: 01/01/2008
Field of study

A strong theme that has emerged in our previous two conferences in the importance of narrative to the process of generating, developing and communicating new modalities of interaction between people, things and environments. Our researches have identified aspects of importance in the design and have begun to establish orders of, priority of approach and representation for these aspects as components of interaction. We have begun to grapple with the growth in the complexity of the interaction design process for truly ‘animated’ functionality in products, especially where this manifests itself as apparent behavioural characteristics resident in or portrayed by products. The findings and experience of researchers is that this increase in complexity is likely to be exponential compared to the rigours relating to the resolution of static physical product configuration or even system operated product with screen based interfaces. The emerging sense is that narrative in the process is essential to bring meaning and to ‘touch’ our humanity or connect with human experience. ‘The science of the artificial in conversation with the poetics of human experience’! Through this conference we will once again engage in presentations, debate and demonstrations on these issues. In this respect we, the conference co-chairs, have sought to bring together researchers from academia, industry and professional design practice and related disciplines connected with interactive product service and system development to share our latest thinking in the field, to asses its outcomes and to identify further research questions, opportunities and territories for future investigation and exploration

Pure OAI Repository

Design and semantics of form and movement : DeSForM 2007

Author
Publication venue: Koninklijke Philips Electronics
Publication date: 01/01/2008
Field of study

Pure OAI Repository

Maintenance Management of Wind Turbines

Author
Publication venue: 'MDPI AG'
Publication date: 01/05/2021
Field of study

“Maintenance Management of Wind Turbines” considers the main concepts and the state-of-the-art, as well as advances and case studies on this topic. Maintenance is a critical variable in industry in order to reach competitiveness. It is the most important variable, together with operations, in the wind energy industry. Therefore, the correct management of corrective, predictive and preventive politics in any wind turbine is required. The content also considers original research works that focus on content that is complementary to other sub-disciplines, such as economics, finance, marketing, decision and risk analysis, engineering, etc., in the maintenance management of wind turbines. This book focuses on real case studies. These case studies concern topics such as failure detection and diagnosis, fault trees and subdisciplines (e.g., FMECA, FMEA, etc.) Most of them link these topics with financial, schedule, resources, downtimes, etc., in order to increase productivity, profitability, maintainability, reliability, safety, availability, and reduce costs and downtime, etc., in a wind turbine. Advances in mathematics, models, computational techniques, dynamic analysis, etc., are employed in analytics in maintenance management in this book. Finally, the book considers computational techniques, dynamic analysis, probabilistic methods, and mathematical optimization techniques that are expertly blended to support the analysis of multi-criteria decision-making problems with defined constraints and requirements

Directory of Open Access Books (DOAB)

Form and Movement in Domestic Networked Systems

Author: Boucher Andy
Gaver William
Kerridge Tobie
Law Andy
Publication venue
Publication date
Field of study

It is increasingly desirable for electronic artefacts in the home to be grouped as sets, sharing data and properties across a network. A range of strategies can be used by a designer to explore the value and use of the systems for users, in particular through the properties of form and dynamic behaviours, including visual output and movement. This paper focuses on a range recent work which exploits rich behaviour and novel forms to highlight opportunities for user engagement in the home

Goldsmiths Research Online

Multi-body dynamic and finite element modeling of ultra-large dump truck - haul road interactions for machine health and haul road structural integrity

Author: Kansake Bruno Ayaga
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2019
Field of study

Haul truck capacities have increased due to their economies of scale in large-scale surface mine production systems. Ultra-large trucks impose high dynamic loads on haul roads. The dynamic loads are exacerbated by road surface roughness and truck over-loading. The dynamic forces also subject trucks to high torsional stresses, which affect truck health. Current haul road response models are 2D and use static truckloads for low capacity trucks. Existing 3D models consider the road as a two-layer system. No models capture the truck dynamic effects on haul roads and predict strut pressures during haulage. Lagrangian mechanics was used to formulate the governing equations of the truck-haul road system. The equations were solved in MSC.ADAMS, based on multi-body dynamics, to generate the truck dynamic forces, which were verified and validated using data obtained from an open-pit mine. These forces were used in an FE model developed, verified and validated in ABAQUS to model the response of the haul road to the truck dynamic forces. The road was modeled using an elastoplastic Mohr-Coulomb model. The results showed that the maximum truck tire dynamic forces were 2.86 and 3.02 times the static force at rated payload and 20% over-loading, respectively. The trucks were exposed to torsional stresses that were up to 2.9 times the recommended threshold. Road deformation decreased with increasing layer modulus and increased with increasing payload. This study proposed novel multivariate models for predicting dynamic truck strut pressures. The novel 3D FE model and empirical relations for calculating truck dynamic forces incorporate truck dynamic forces into haul road design. This study forms a basis for designing structurally competent haul roads and improving truck health --Abstract, page iii

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine