Search CORE

4,696 research outputs found

Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration

Author: Gotlieb Arnaud
Marijan Dusica
Mossige Morten
Spieker Helge
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/11/2018
Field of study

Testing in Continuous Integration (CI) involves test case prioritization, selection, and execution at each cycle. Selecting the most promising test cases to detect bugs is hard if there are uncertainties on the impact of committed code changes or, if traceability links between code and tests are not available. This paper introduces Retecs, a new method for automatically learning test case selection and prioritization in CI with the goal to minimize the round-trip time between code commits and developer feedback on failed test cases. The Retecs method uses reinforcement learning to select and prioritize test cases according to their duration, previous last execution and failure history. In a constantly changing environment, where new test cases are created and obsolete test cases are deleted, the Retecs method learns to prioritize error-prone test cases higher under guidance of a reward function and by observing previous CI cycles. By applying Retecs on data extracted from three industrial case studies, we show for the first time that reinforcement learning enables fruitful automatic adaptive test case selection and prioritization in CI and regression testing.Comment: Spieker, H., Gotlieb, A., Marijan, D., & Mossige, M. (2017). Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration. In Proceedings of 26th International Symposium on Software Testing and Analysis (ISSTA'17) (pp. 12--22). AC

arXiv.org e-Print Archive

Crossref

Addressing Function Approximation Error in Actor-Critic Methods

Author: Fujimoto Scott
Meger David
van Hoof Herke
Publication venue
Publication date: 01/01/2018
Field of study

In value-based reinforcement learning methods such as deep Q-learning, function approximation errors are known to lead to overestimated value estimates and suboptimal policies. We show that this problem persists in an actor-critic setting and propose novel mechanisms to minimize its effects on both the actor and the critic. Our algorithm builds on Double Q-learning, by taking the minimum value between a pair of critics to limit overestimation. We draw the connection between target networks and overestimation bias, and suggest delaying policy updates to reduce per-update error and further improve performance. We evaluate our method on the suite of OpenAI gym tasks, outperforming the state of the art in every environment tested.Comment: Accepted at ICML 201

arXiv.org e-Print Archive

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Local ant system for allocating robot swarms to time-constrained tasks

Author: Khaluf Yara
Simoens Pieter
Vanhee Seppe
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

We propose a novel application of the Ant Colony Optimization algorithm to efficiently allocate a swarm of homogeneous robots to a set of tasks that need to be accomplished by specific deadlines. We exploit the local communication between robots to periodically evaluate the quality of the allocation solutions, and agents select independently among the high-quality alternatives. The evaluation is performed using pheromone trails to favor allocations which minimize the execution time of the tasks. Our approach is validated in both static and dynamic environments (i.e. the task availability changes over time) using different sets of physics-based simulations. (C) 2018 Elsevier B.V. All rights reserved

Ghent University Academic Bibliography

개미알고리즘을 이용한 드론의 제설 경로 최적화

Author: 고지원
Publication venue: 서울대학교 대학원
Publication date: 01/02/2022
Field of study

학위논문(석사) -- 서울대학교대학원 : 공과대학 건설환경공학부, 2022.2. 김동규.Drones can overcome the limitation of ground vehicles by replacing the congestion time and allowing rapid service. For sudden snowfall with climate change, a quickly deployed drone can be a flexible alternative considering the deadhead route and the labor costs. The goal of this study is to optimize a drone arc routing problem (D-ARP), servicing the required roads for snow removal. A D-ARP creates computational burden especially in large network. The D-ARP has a large search space due to its exponentially increased candidate route, arc direction decision, and continuous arc space. To reduce the search space, we developed the auxiliary transformation method in ACO algorithm and adopted the random walk method. The contribution of the work is introducing a new problem and optimization approach of D-ARP in snow removal operation and reduce its search space. The optimization results confirmed that the drone travels shorter distance compared to the truck with a reduction of 5% to 22%. Furthermore, even under the length constraint model, the drone shows 4% reduction compared to the truck. The result of the test sets demonstrated that the adopted heuristic algorithm performs well in the large size networks in reasonable time. Based on the results, introducing a drone in snow removal is expected to save the operation cost in practical terms.드론은 혼잡시간대를 대체하고 빠른 서비스를 가능하게 함으로써 지상차량의 한계를 극복할 수 있다. 최근 기후변화에 따른 갑작스런 강설의 경우에, 드론과 같이 빠르게 투입할 수 있는 서비스는 운행 경로와 노동비용을 고려했을 때도 유연한 운영 옵션이 될 수 있다. 본 연구의 목적은 드론 아크 라우팅(D-ARP)을 최적화하는 것이며, 이는 제설에 필요한 도로를 서비스하는 경로를 탐색하는 것이다. 드론 아크 라우팅은 특히 큰 네트워크에서 컴퓨터 부하를 생성한다. 다시 말해D-ARP는 큰 검색공간을 필요로 하며, 이는 기하급수적으로 증가하는 후보 경로 및 호의 방향 결정 그리고 연속적인 호의 공간으로부터 기인한다. 검색공간을 줄이기 위해, 우리는 개미알고리즘에 보조변환방법을 적용하는 방안을 도입하였으며 또한 랜덤워크 기법을 채택하였다. 본 연구의 기여는 제설 운영에 있어 D-ARP라는 새로운 문제를 설정하고 최적화 접근법을 도입하였으며 검색공간을 최소화한 것이다. 최적화 결과, 드론은 지상트럭에 비해 약 5% ~ 22%의 경로 비용 감소를 보였다. 나아가 길이 제약 모델에서도 드론은 4%의 비용 감소를 보였다. 또한 실험결과는 적용한 휴리스틱 알고리즘이 큰 네트워크에서도 합리적 시간 내에 최적해를 찾음을 입증하였다. 이러한 결과를 바탕으로, 드론을 제설에 도입하는 것은 미래에 제설 운영 비용을 실질적으로 감소시킬 것으로 기대된다.Chapter 1. Introduction 4 1.1. Study Background 4 1.2. Purpose of Research 6 Chapter 2. Literature Review 7 2.1. Drone Arc Routing problem 7 2.2. Snow Removal Routing Problem 8 2.3. The Classic ARPs and Algorithms 9 2.4. Large Search Space and Arc direction 11 Chapter 3. Method 13 3.1. Problem Statement 13 3.2. Formulation 16 Chapter 4. Algorithm 17 4.1. Overview 17 4.2. Auxilary Transformation Method 18 4.3. Ant Colony Optimization (ACO) 20 4.4. Post Process for Arc Direction Decision 23 4.5. Length Constraint and Random Walk 24 Chapter 5. Results 27 5.1. Application in Toy Network 27 5.2. Application in Real-world Networks 29 5.3. Application of the Refill Constraint in Seoul 31 Chapter 6. Conclusion 34 References 35 Acknowledgment 40석

SNU Open Repository and Archive

Hopscotch: Robust Multi-agent Search

Author: Miles Edward C.
Publication venue: UNM Digital Repository
Publication date: 01/05/2013
Field of study

The task of searching a space is critical to a wide range of diverse applications such as land mine clearing and planetary exploration. Because applications frequently require searching remote or hazardous locations, and because the task is easily divisible, it is natural to consider the use of multi-robot teams to accomplish the search task. An important topic of research in this area is the division of the task among robot agents. Interrelated with subtask assignment is failure handling, in the sense that, when an agent fails, its part of the task must then be performed by other agents. This thesis describes Hopscotch, a multi-agent search strategy that divides the search area into a grid of lots. Each agent is assigned responsibility to search one lot at a time, and upon completing the search of that lot the agent is assigned a new lot. Assignment occurs in real time using a simple contract net. Because lots that have been previously searched are skipped, the order of search from the point of view of a particular agent is reminiscent of the progression of steps in the playground game of Hopscotch. Decomposition of the search area is a common approach to multi-agent search, and auction-based contract net strategies have appeared in recent literature as a method of task allocation in multi-agent systems. The Hopscotch strategy combines the two, with a strong focus on robust tolerance of agent failures. Contract nets typically divide all known tasks among available resources. In contrast, Hopscotch limits each agent to one assigned lot at a time, so that failure of an agent compels re-allocation of only one lot search task. Furthermore, the contract net is implemented in an unconventional manner that empowers each agent with responsibility for contract management. This novel combination of real-time assignment and decentralized management allows Hopscotch to resiliently cope with agent failures. The Hopscotch strategy was modeled and compared to other multi-agent strate- gies that tackle the search task in a variety of ways. Simulation results show that Hopscotch is failure-tolerant and very effective in comparison to the other approaches in terms of both search time and search efficiency. Although the search task modeled here is a basic one, results from simulations show the promise of using this strategy for more complicated scenarios, and with actual robot agents