11,470 research outputs found

    Spatial-temporal recurrent reinforcement learning for autonomous ships

    Full text link
    The paper proposes a spatial-temporal recurrent neural network architecture for Deep QQ-Networks to steer an autonomous ship. The network design allows handling an arbitrary number of surrounding target ships while offering robustness to partial observability. Further, a state-of-the-art collision risk metric is proposed to enable an easier assessment of different situations by the agent. The COLREG rules of maritime traffic are explicitly considered in the design of the reward function. The final policy is validated on a custom set of newly created single-ship encounters called "Around the Clock" problems and the commonly chosen Imazu (1987) problems, which include 18 multi-ship scenarios. Additionally, the framework shows robustness when deployed simultaneously in multi-agent scenarios. The proposed network architecture is compatible with other deep reinforcement learning algorithms, including actor-critic frameworks

    ํ•ด์–‘ ์ž‘์—… ์ง€์›์„ ์˜ ์ž์œจ ์šดํ•ญ ๋ฐ ์„ค์น˜ ์ž‘์—… ์ง€์›์„ ์œ„ํ•œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐฉ๋ฒ•

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์กฐ์„ ํ•ด์–‘๊ณตํ•™๊ณผ, 2019. 2. ๋…ธ๋ช…์ผ.Autonomous ships have gained a huge amount of interest in recent years, like their counterparts on land{autonomous cars, because of their potential to significantly lower the cost of operation, attract seagoing professionals and increase transportation safety. Technologies developed for the autonomous ships have potential to notably reduce maritime accidents where 75% cases can be attributed to human error and a significant proportion of these are caused by fatigue and attention deficit. However, developing a high-level autonomous system which can operate in an unstructured and unpredictable environment is still a challenging task. When the autonomous ships are operating in the congested waterway with other manned or unmanned vessels, the collision avoidance algorithm is the crucial point in keeping the safety of both the own ship and any encountered ships. Instead of developing new traffic rules for the autonomous ships to avoid collisions with each other, autonomous ships are expected to follow the existing guidelines based on the International Regulations for Preventing Collisions at Sea (COLREGs). Furthermore, when using the crane on the autonomous ship to transfer and install subsea equipment to the seabed, the heave and swaying phenomenon of the subsea equipment at the end of flexible wire ropes makes its positioning at an exact position is very difficult. As a result, an Anti-Motion Control (AMC) system for the crane is necessary to ensure the successful installation operation. The autonomous ship is highly relying on the effectiveness of autonomous systems such as autonomous path following system, collision avoidance system, crane control system and so on. During the previous two decades, considerable attention has been paid to develop robust autonomous systems. However, several are facing challenges and it is worthwhile devoting much effort to this. First of all, the development and testing of the proposed control algorithms should be adapted across a variety of environmental conditions including wave, wind, and current. This is one of the challenges of this work aimed at creating an autonomous path following and collision avoidance system in the ship. Secondly, the collision avoidance system has to comply with the regulations and rules in developing an autonomous ship. Thirdly, AMC system with anti-sway abilities for a knuckle boom crane remains problems regarding its under-actuated mechanism. At last, the performance of the control system should be evaluated in advance of the operation to perform its function successfully. In particular, such performance analysis is often very costly and time-consuming, and realistic conditions are typically impossible to establish in a testing environment. Consequently, to address these issues, we proposed a simulation framework with the following scenarios, which including the autonomous navigation scenario and crane operation scenario. The research object of this study is an autonomous offshore support vessel (OSV), which provides support services to offshore oil and gas field development such as offshore drilling, pipe laying, and oil producing assets (production platforms and FPSOs) utilized in EP (Exploration Production) activities. Assume that the autonomous OSV confronts an urgent mission under the harsh environmental conditions: on the way to an imperative offshore construction site, the autonomous OSV has to avoid target ships while following a predefined path. When arriving at the construction site, it starts to install a piece of subsea equipment on the seabed. So what technologies are needed, what should be invested for ensuring the autonomous OSV could robustly kilometers from shore, and how can an autonomous OSV be made at least as safe as the conventional ship. In this dissertation, we focus on the above critical activities for answering the above questions. In the general context of the autonomous navigation and crane control problem, the objective of this dissertation is thus fivefold: โ€ข Developing a COLREGs-compliant collision avoidance system. โ€ข Building a robust path following and collision avoidance system which can handle the unknown and complicated environment. โ€ข Investigating an efficient multi-ship collision avoidance method enable it easy to extend. โ€ข Proposing a hardware-in-the-loop simulation environment for the AHC system. โ€ข Solving the anti-sway problem of the knuckle boom crane on an autonomous OSV. First of all, we propose a novel deep reinforcement learning (RL) algorithm to achieve effective and efficient capabilities of the path following and collision avoidance system. To perform and verify the proposed algorithm, we conducted simulations for an autonomous ship under unknown environmental disturbance iiito adjust its heading in real-time. A three-degree-of-freedom dynamic model of the autonomous ship was developed, and the Line-of-sight (LOS) guidance system was used to converge the autonomous ship to follow the predefined path. Then, a proximal policy optimization (PPO) algorithm was implemented on the problem. By applying the advanced deep RL method, in which the autonomous OSV learns the best behavior through repeated trials to determine a safe and economical avoidance behavior in various circumstances. The simulation results showed that the proposed algorithm has the capabilities to guarantee collision avoidance of moving encountered ships while ensuring following a predefined path. Also, the algorithm demonstrated that it could manage complex scenarios with various encountered ships in compliance with COLREGs and have the excellent adaptability to the unknown, sophisticated environment. Next, the AMC system includes Anti-Heave Control (AHC) and Anti-Sway Control (ASC), which is applied to install subsea equipment in regular and irregular for performance analysis. We used the proportional-integral-derivative (PID) control method and the sliding mode control method respectively to achieve the control objective. The simulation results show that heave and sway motion could be significantly reduced by the proposed control methods during the construction. Moreover, to evaluate the proposed control system, we have constructed the HILS environment for the AHC system, then conducted a performance analysis of it. The simulation results show the AHC system could be evaluated effectively within the HILS environment. We can conclude that the proposed or adopted methods solve the problems issued in autonomous system design.ํ•ด์–‘ ์ž‘์—… ์ง€์›์„  (Offshore Support Vessel: OSV)์˜ ๊ฒฝ์šฐ ๊ทนํ•œ์˜ ํ™˜๊ฒฝ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ์ถœํ•ญํ•˜์—ฌ ํ•ด์ƒ์—์„œ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•ด์•ผ ํ•˜๋Š” ๊ฒฝ์šฐ๊ฐ€ ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ์œ„ํ—˜์—์˜ ๋…ธ์ถœ์„ ์ตœ์†Œํ™”ํ•˜๊ธฐ ์œ„ํ•ด ์ž์œจ ์šดํ•ญ์— ๋Œ€ํ•œ ์š”๊ตฌ๊ฐ€ ์ฆ๊ฐ€ํ•˜๊ณ  ์žˆ๋‹ค. ์—ฌ๊ธฐ์„œ์˜ ์ž์œจ ์šดํ•ญ์€ ์„ ๋ฐ•์ด ์ถœ๋ฐœ์ง€์—์„œ ๋ชฉ์ ์ง€๊นŒ์ง€ ์‚ฌ๋žŒ์˜ ๋„์›€ ์—†์ด ์ด๋™ํ•จ์„ ์˜๋ฏธํ•œ๋‹ค. ์ž์œจ ์šดํ•ญ ๋ฐฉ๋ฒ•์€ ๊ฒฝ๋กœ ์ถ”์ข… ๋ฐฉ๋ฒ•๊ณผ ์ถฉ๋Œ ํšŒํ”ผ ๋ฐฉ๋ฒ•์„ ํฌํ•จํ•œ๋‹ค. ์šฐ์„ , ์šดํ•ญ ๋ฐ ์ž‘์—… ์ค‘ ํ™˜๊ฒฝ ํ•˜์ค‘ (๋ฐ”๋žŒ, ํŒŒ๋„, ์กฐ๋ฅ˜ ๋“ฑ)์— ๋Œ€ํ•œ ๊ณ ๋ ค๋ฅผ ํ•ด์•ผ ํ•˜๊ณ , ๊ตญ์ œ ํ•ด์ƒ ์ถฉ๋Œ ์˜ˆ๋ฐฉ ๊ทœ์น™ (Convention of the International Regulations for Preventing Collisions at Sea, COLREGs)์— ์˜ํ•œ ์„ ๋ฐ•๊ฐ„์˜ ํ•ญ๋ฒ• ๊ทœ์ •์„ ๊ณ ๋ คํ•˜์—ฌ ์ถฉ๋Œ ํšŒํ”ผ ๊ทœ์น™์„ ์ค€์ˆ˜ํ•ด์•ผ ํ•œ๋‹ค. ํŠนํžˆ ์—ฐ๊ทผํ•ด์˜ ๋ณต์žกํ•œ ํ•ด์—ญ์—์„œ๋Š” ๋งŽ์€ ์„ ๋ฐ•์„ ์ž๋™์œผ๋กœ ํšŒํ”ผํ•  ํ•„์š”๊ฐ€ ์žˆ๋‹ค. ๊ธฐ์กด์˜ ํ•ด์„์ ์ธ ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ์„ ๋ฐ•๋“ค์— ๋Œ€ํ•œ ์ •ํ™•ํ•œ ์‹œ์Šคํ…œ ๋ชจ๋ธ๋ง์ด ๋˜์–ด์•ผ ํ•˜๋ฉฐ, ๊ทธ ๊ณผ์ •์—์„œ ๊ฒฝํ—˜ (experience)์— ์˜์กดํ•˜๋Š” ํŒŒ๋ผ๋ฏธํ„ฐ ํŠœ๋‹์ด ํ•„์ˆ˜์ ์ด๋‹ค. ๋˜ํ•œ, ํšŒํ”ผํ•ด์•ผ ํ•  ์„ ๋ฐ• ์ˆ˜๊ฐ€ ๋งŽ์•„์งˆ ๊ฒฝ์šฐ ์‹œ์Šคํ…œ ๋ชจ๋ธ์ด ์ปค์ง€๊ฒŒ ๋˜๊ณ  ๊ณ„์‚ฐ ์–‘๊ณผ ๊ณ„์‚ฐ ์‹œ๊ฐ„์ด ๋Š˜์–ด๋‚˜ ์‹ค์‹œ๊ฐ„ ์ ์šฉ์ด ์–ด๋ ต๋‹ค๋Š” ๋‹จ์ ์ด ์žˆ๋‹ค. ๋˜ํ•œ, ๊ฒฝ๋กœ ์ถ”์ข… ๋ฐ ์ถฉ๋Œ ํšŒํ”ผ๋ฅผ ํฌํ•จํ•˜์—ฌ ์ž์œจ ์šดํ•ญ ๋ฐฉ๋ฒ•์„ ์ ์šฉํ•˜๊ธฐ๊ฐ€ ์–ด๋ ต๋‹ค. ๋”ฐ๋ผ์„œ ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ๊ฐ•ํ™” ํ•™์Šต (Reinforcement Learning: RL) ๊ธฐ๋ฒ•์„ ์ด์šฉํ•˜์—ฌ ๊ธฐ์กด ํ•ด์„์ ์ธ ๋ฐฉ๋ฒ•์˜ ๋ฌธ์ œ์ ์„ ๊ทน๋ณตํ•  ์ˆ˜ ์žˆ๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•˜์˜€๋‹ค. ๊ฒฝ๋กœ๋ฅผ ์ถ”์ข…ํ•˜๋Š” ์„ ๋ฐ• (agent)์€ ์™ธ๋ถ€ ํ™˜๊ฒฝ (environment)๊ณผ ์ƒํ˜ธ์ž‘์šฉํ•˜๋ฉด์„œ ํ•™์Šต์„ ์ง„ํ–‰ํ•œ๋‹ค. State S_0 (์„ ๋ฐ•์˜ ์›€์ง์ž„๊ณผ ๊ด€๋ จ๋œ ๊ฐ์ข… ์ƒํƒœ) ๊ฐ€์ง€๋Š” agent๋Š” policy (ํ˜„์žฌ ์œ„์น˜์—์„œ ์–ด๋–ค ์›€์ง์ž„์„ ์„ ํƒํ•  ๊ฒƒ์ธ๊ฐ€)์— ๋”ฐ๋ผ action A_0 (์›€์ง์ผ ๋ฐฉํ–ฅ) ์ทจํ•œ๋‹ค. ์ด์— environment๋Š” agent์˜ ๋‹ค์Œ state S_1 ์„ ๊ณ„์‚ฐํ•˜๊ณ , ๊ทธ์— ๋”ฐ๋ฅธ ๋ณด์ƒ R_0 (ํ•ด๋‹น ์›€์ง์ž„์˜ ์ ํ•ฉ์„ฑ)์„ ๊ฒฐ์ •ํ•˜์—ฌ agent์—๊ฒŒ ์ „๋‹ฌํ•œ๋‹ค. ์ด๋Ÿฌํ•œ ์ž‘์—…์„ ๋ฐ˜๋ณตํ•˜๋ฉด์„œ ๋ณด์ƒ์ด ์ตœ๋Œ€๊ฐ€ ๋˜๋„๋ก policy๋ฅผ ํ•™์Šตํ•˜๊ฒŒ ๋œ๋‹ค. ํ•œํŽธ, ํ•ด์ƒ์—์„œ ํฌ๋ ˆ์ธ์„ ์ด์šฉํ•œ ์žฅ๋น„์˜ ์ด๋™์ด๋‚˜ ์„ค์น˜ ์ž‘์—… ์‹œ ์œ„ํ—˜์„ ์ค„์ด๊ธฐ ์œ„ํ•ด ํฌ๋ ˆ์ธ์˜ ๊ฑฐ๋™ ์ œ์–ด์— ๋Œ€ํ•œ ์š”๊ตฌ๊ฐ€ ์ฆ๊ฐ€ํ•˜๊ณ  ์žˆ๋‹ค. ํŠนํžˆ ํ•ด์ƒ์—์„œ๋Š” ์„ ๋ฐ•์˜ ์šด๋™์— ์˜ํ•ด ํฌ๋ ˆ์ธ์— ๋งค๋‹ฌ๋ฆฐ ๋ฌผ์ฒด๊ฐ€ ์ƒํ•˜ ๋™์š” (heave)์™€ ํฌ๋ ˆ์ธ์„ ๊ธฐ์ค€์œผ๋กœ ์ขŒ์šฐ ๋™์š” (sway)๊ฐ€ ๋ฐœ์ƒํ•˜๋Š”๋ฐ, ์ด๋Ÿฌํ•œ ์šด๋™์€ ์ž‘์—…์„ ์ง€์—ฐ์‹œํ‚ค๊ณ , ์ •ํ™•ํ•œ ์œ„์น˜์— ๋ฌผ์ฒด๋ฅผ ๋†“์ง€ ๋ชปํ•˜๊ฒŒ ํ•˜๋ฉฐ, ์ž์นซ ์ฃผ๋ณ€ ๊ตฌ์กฐ๋ฌผ๊ณผ์˜ ์ถฉ๋Œ์„ ์•ผ๊ธฐํ•  ์ˆ˜ ์žˆ๋‹ค. ์ด์™€ ๊ฐ™์€ ๋™์š”๋ฅผ ์ตœ์†Œํ™”ํ•˜๋Š” Anti-Motion Control (AMC) ์‹œ์Šคํ…œ์€ Anti-Heave Control (AHC)๊ณผ Anti-Sway Control (ASC)์„ ํฌํ•จํ•œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ํ•ด์–‘ ์ž‘์—… ์ง€์›์„ ์— ์ ํ•ฉํ•œ AMC ์‹œ์Šคํ…œ์˜ ์„ค๊ณ„ ๋ฐ ๊ฒ€์ฆ ๋ฐฉ๋ฒ•์„ ์—ฐ๊ตฌํ•˜์˜€๋‹ค. ๋จผ์ € ์ƒํ•˜ ๋™์š”๋ฅผ ์ตœ์†Œํ™”ํ•˜๊ธฐ ์œ„ํ•ด ํฌ๋ ˆ์ธ์˜ ์™€์ด์–ด ๊ธธ์ด๋ฅผ ๋Šฅ๋™์ ์œผ๋กœ ์กฐ์ •ํ•˜๋Š” AHC ์‹œ์Šคํ…œ์„ ์„ค๊ณ„ํ•˜์˜€๋‹ค. ๋˜ํ•œ, ๊ธฐ์กด์˜ ์ œ์–ด ์‹œ์Šคํ…œ์˜ ๊ฒ€์ฆ ๋ฐฉ๋ฒ•์€ ์‹ค์ œ ์„ ๋ฐ•์ด๋‚˜ ํ•ด์–‘ ๊ตฌ์กฐ๋ฌผ์— ํ•ด๋‹น ์ œ์–ด ์‹œ์Šคํ…œ์„ ์ง์ ‘ ์„ค์น˜ํ•˜๊ธฐ ์ „์—๋Š” ๊ทธ ์„ฑ๋Šฅ์„ ํ…Œ์ŠคํŠธํ•˜๊ธฐ๊ฐ€ ํž˜๋“ค์—ˆ๋‹ค. ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” Hardware-In-the-Loop Simulation (HILS) ๊ธฐ๋ฒ•์„ ํ™œ์šฉํ•˜์—ฌ AHC ์‹œ์Šคํ…œ์˜ ๊ฒ€์ฆ ๋ฐฉ๋ฒ•์„ ์—ฐ๊ตฌํ•˜์˜€๋‹ค. ๋˜ํ•œ, ASC ์‹œ์Šคํ…œ์„ ์„ค๊ณ„ํ•  ๋•Œ ์ œ์–ด ๋Œ€์ƒ์ด under-actuated ์‹œ์Šคํ…œ์ด๊ธฐ ๋•Œ๋ฌธ์— ์ œ์–ดํ•˜๊ธฐ๊ฐ€ ๋งค์šฐ ์–ด๋ ต๋‹ค. ๋”ฐ๋ผ์„œ ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” sliding mode control ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ด์šฉํ•˜๋ฉฐ ๋‹ค๊ด€์ ˆ ํฌ๋ ˆ์ธ (knuckle boom crane)์˜ ๊ด€์ ˆ (joint) ๊ฐ๋„๋ฅผ ์ œ์–ดํ•˜์—ฌ ์ขŒ์šฐ ๋™์š”๋ฅผ ์ค„์ผ ์ˆ˜ ์žˆ๋Š” ASC ์‹œ์Šคํ…œ์„ ์„ค๊ณ„ํ•˜์˜€๋‹ค.Chapter 1 Introduction 1 1.1 Background and Motivation . . . . . . . . . . . . . . . . . . . . . 1 1.2 Requirements for Autonomous Operation . . . . . . . . . . . . . 5 1.2.1 Path Following for Autonomous Ship . . . . . . . . . . . . 5 1.2.2 Collision Avoidance for Autonomous Ship . . . . . . . . . 5 1.2.3 Anti-Motion Control System for Autonomous Ship . . . . 6 1.3 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 1.3.1 Related Work for Path Following System . . . . . . . . . 9 1.3.2 Related Work for Collision Avoidance System . . . . . . . 9 1.3.3 Related Work for Anti-Heave Control System . . . . . . . 13 1.3.4 Related Work for Anti-Sway Control System . . . . . . . 14 1.4 Configuration of Simulation Framework . . . . . . . . . . . . . . 16 1.4.1 Application Layer . . . . . . . . . . . . . . . . . . . . . . 16 1.4.2 Autonomous Ship Design Layer . . . . . . . . . . . . . . . 17 1.4.3 General Technique Layer . . . . . . . . . . . . . . . . . . 17 1.5 Contributions (Originality) . . . . . . . . . . . . . . . . . . . . . 19 Chapter 2 Theoretical Backgrounds 20 2.1 Maneuvering Model for Autonomous Ship . . . . . . . . . . . . . 20 2.1.1 Kinematic Equation for Autonomous Ship . . . . . . . . . 20 2.1.2 Kinetic Equation for Autonomous Ship . . . . . . . . . . 21 2.2 Multibody Dynamics Model for Knuckle Boom Crane of Autonomous Ship. . . 25 2.2.1 Embedding Techniques . . . . . . . . . . . . . . . . . . . . 25 2.3 Control System Design . . . . . . . . . . . . . . . . . . . . . . . . 31 2.3.1 Proportional-Integral-Derivative (PID) Control . . . . . . 31 2.3.2 Sliding Mode Control . . . . . . . . . . . . . . . . . . . . 31 2.4 Deep Reinforcement Learning Algorithm . . . . . . . . . . . . . . 34 2.4.1 Value Based Learning Method . . . . . . . . . . . . . . . 36 2.4.2 Policy Based Learning Method . . . . . . . . . . . . . . . 37 2.4.3 Actor-Critic Method . . . . . . . . . . . . . . . . . . . . . 41 2.5 Hardware-in-the-Loop Simulation . . . . . . . . . . . . . . . . . . 43 2.5.1 Integrated Simulation Method . . . . . . . . . . . . . . . 43 Chapter 3 Path Following Method for Autonomous OSV 46 3.1 Guidance System . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 3.1.1 Line-of-sight Guidance System . . . . . . . . . . . . . . . 46 3.2 Deep Reinforcement Learning for Path Following System . . . . . 50 3.2.1 Deep Reinforcement Learning Setup . . . . . . . . . . . . 50 3.2.2 Neural Network Architecture . . . . . . . . . . . . . . . . 56 3.2.3 Training Process . . . . . . . . . . . . . . . . . . . . . . . 58 3.3 Implementation and Simulation Result . . . . . . . . . . . . . . . 62 3.3.1 Implementation for Path Following System . . . . . . . . 62 3.3.2 Simulation Result . . . . . . . . . . . . . . . . . . . . . . 65 3.4 Comparison Results . . . . . . . . . . . . . . . . . . . . . . . . . 83 3.4.1 Comparison Result of PPO with PID . . . . . . . . . . . 83 3.4.2 Comparison Result of PPO with Deep Q-Network (DQN) 87 Chapter 4 Collision Avoidance Method for Autonomous OSV 89 4.1 Deep Reinforcement Learning for Collision Avoidance System . . 89 4.1.1 Deep Reinforcement Learning Setup . . . . . . . . . . . . 89 4.1.2 Neural Network Architecture . . . . . . . . . . . . . . . . 93 4.1.3 Training Process . . . . . . . . . . . . . . . . . . . . . . . 94 4.2 Implementation and Simulation Result . . . . . . . . . . . . . . . 95 4.2.1 Implementation for Collision Avoidance System . . . . . . 95 4.2.2 Simulation Result . . . . . . . . . . . . . . . . . . . . . . 100 4.3 Implementation and Simulation Result for Multi-ship Collision Avoidance Method . . . . . . . . . . . . . . . . . . . . . . . . . . 107 4.3.1 Limitations of Multi-ship Collision Avoidance Method - 1 107 4.3.2 Limitations of Multi-ship Collision Avoidance Method - 2 108 4.3.3 Implementation of Multi-ship Collision Avoidance Method 110 4.3.4 Simulation Result of Multi-ship Collision Avoidance Method 118 Chapter 5 Anti-Motion Control Method for Knuckle Boom Crane 129 5.1 Configuration of HILS for Anti-Heave Control System . . . . . . 129 5.1.1 Virtual Mechanical System . . . . . . . . . . . . . . . . . 132 5.1.2 Virtual Sensor and Actuator . . . . . . . . . . . . . . . . 138 5.1.3 Control System Design . . . . . . . . . . . . . . . . . . . . 141 5.1.4 Integrated Simulation Interface . . . . . . . . . . . . . . . 142 5.2 Implementation and Simulation Result of HILS for Anti-Heave Control System . . . . . . . . 145 5.2.1 Implementation of HILS for Anti-Heave Control System . 145 5.2.2 Simulation Result of HILS for Anti-Heave Control System 146 5.3 Validation of HILS for Anti-Heave Control System . . . . . . . . 159 5.3.1 Hardware Setup . . . . . . . . . . . . . . . . . . . . . . . 159 5.3.2 Comparison Result . . . . . . . . . . . . . . . . . . . . . . 161 5.4 Configuration of Anti-Sway Control System . . . . . . . . . . . . 162 5.4.1 Mechanical System for Knuckle Boom Crane . . . . . . . 162 5.4.2 Anti-Sway Control System Design . . . . . . . . . . . . . 165 5.4.3 Implementation and Simulation Result of Anti-Sway Control . . . . . . . . . . . . . . 168 Chapter 6 Conclusions and Future Works 176 Bibliography 178 Chapter A Appendix 186 ๊ตญ๋ฌธ์ดˆ๋ก 188Docto

    Autonomous Drone Landings on an Unmanned Marine Vehicle using Deep Reinforcement Learning

    Get PDF
    This thesis describes with the integration of an Unmanned Surface Vehicle (USV) and an Unmanned Aerial Vehicle (UAV, also commonly known as drone) in a single Multi-Agent System (MAS). In marine robotics, the advantage offered by a MAS consists of exploiting the key features of a single robot to compensate for the shortcomings in the other. In this way, a USV can serve as the landing platform to alleviate the need for a UAV to be airborne for long periods time, whilst the latter can increase the overall environmental awareness thanks to the possibility to cover large portions of the prevailing environment with a camera (or more than one) mounted on it. There are numerous potential applications in which this system can be used, such as deployment in search and rescue missions, water and coastal monitoring, and reconnaissance and force protection, to name but a few. The theory developed is of a general nature. The landing manoeuvre has been accomplished mainly identifying, through artificial vision techniques, a fiducial marker placed on a flat surface serving as a landing platform. The raison d'etre for the thesis was to propose a new solution for autonomous landing that relies solely on onboard sensors and with minimum or no communications between the vehicles. To this end, initial work solved the problem while using only data from the cameras mounted on the in-flight drone. In the situation in which the tracking of the marker is interrupted, the current position of the USV is estimated and integrated into the control commands. The limitations of classic control theory used in this approached suggested the need for a new solution that empowered the flexibility of intelligent methods, such as fuzzy logic or artificial neural networks. The recent achievements obtained by deep reinforcement learning (DRL) techniques in end-to-end control in playing the Atari video-games suite represented a fascinating while challenging new way to see and address the landing problem. Therefore, novel architectures were designed for approximating the action-value function of a Q-learning algorithm and used to map raw input observation to high-level navigation actions. In this way, the UAV learnt how to land from high latitude without any human supervision, using only low-resolution grey-scale images and with a level of accuracy and robustness. Both the approaches have been implemented on a simulated test-bed based on Gazebo simulator and the model of the Parrot AR-Drone. The solution based on DRL was further verified experimentally using the Parrot Bebop 2 in a series of trials. The outcomes demonstrate that both these innovative methods are both feasible and practicable, not only in an outdoor marine scenario but also in indoor ones as well

    Research on multi-agent-based shipping information system

    Get PDF

    Proceedings, MSVSCC 2013

    Get PDF
    Proceedings of the 7th Annual Modeling, Simulation & Visualization Student Capstone Conference held on April 11, 2013 at VMASC in Suffolk, Virginia

    Spatial planning for fisheries in the Northern Adriatic: working toward viable and sustainable fishing

    Get PDF
    none10siGiven the great overfishing of the demersal resources in the Northern Adriatic Sea (geographical sub-area [GSA] 17), along with the fishing pressure in marine habitats, evidence strongly supports the need to evaluate appropriate management approaches. Several fishing activities operate simultaneously in the area, and the need to minimize conflicts among them is also a social concern. We applied a spatially and temporally explicit fish and fisheries model to assess the impact of a suite of spatial plans suggested by practitioners that could reduce the pressure on the four demersal stocks of high commercial interest in the GSA 17 and that could promote space sharing between mutually exclusive activities. We found that excluding trawlers from some areas has lowered the effective fishing effort, resulting in some economic losses but providing benefit to the set netters. Not every simulated fishing vessel is impacted in the same way because some fishing communities experienced different economic opportunities, particularly when a 6-nautical mile buffer zone from the coast was implemented in the vicinity of important fishing grounds. Along this buffer zone, the four stocks were only slightly benefiting from the protection of the area and from fewer discards. In contrast, assuming a change in the ability of the population to disperse led to a large effect: Some fish became accessible in the coastal waters, therefore increasing the landings for rangelimited fishers, but the discard rate of fish also increased, greatly impairing the long-term biomass levels. Our evaluation, however, confirmed that no effort is displaced onto vulnerable benthic habitats and to grounds not suitable for the continued operation of fishing. We conclude that the tested spatial management is helpful, but not sufficient to ensure sustainable fishing in the area, and therefore, additional management measures should be taken. Our test platform investigates the interaction between fish and fisheries at a fine geographical scale and simulates data for varying fishing methods and from different harbor communities in a unified framework. We contribute to the development of effective science-based inputs to facilitate policy improvement and better governance while evaluating trade-offs in fisheries management and marine spatial planning.noneBastardie, Francois; Angelini, Silvia; Bolognini, Luca; Fuga, Federico; Manfredi, Chiara; Martinelli, Michela; Nielsen, J. Rasmus; Santojanni, Alberto; Scarcella, Giuseppe; Grati, FabioBastardie, Francois; Angelini, Silvia; Bolognini, Luca; Fuga, Federico; Manfredi, Chiara; Martinelli, Michela; Nielsen, J. Rasmus; Santojanni, Alberto; Scarcella, Giuseppe; Grati, Fabi

    Underwater swarm robotics: Challenges and opportunities

    Full text link
    Underwater swarm robotics today faces a series of challenges unique to its aquatic environment. This chapter explores some possible applications of underwater swarm robotics and its challenges. Those challenges include the environment itself, sensor types required, problems with communication and the difficulty in localisation. It notes the serious challenges in underwater communication is that radio communications is practically non-existent in the underwater realm. Localisation also becomes problematic due to the lack of radio waves as GPS cannot be used. It also looks at the platforms required by underwater robots and includes a possible low-cost platform. Also explored is a method of swarm robotics control known as consensus control. It shows possible solutions to the challenges and where swarm robotics may head

    Cooperative Localization in Mobile Underwater Acoustic Sensor Networks

    Get PDF
    Die groรŸflรคchige Erkundung und รœberwachung von Tiefseegebieten gewinnt mehr und mehr an Bedeutung fรผr Industrie und Wissenschaft. Diese schwer zugรคnglichen Areale in der Tiefsee kรถnnen nur mittels Teams unbemannter Tauchbote effizient erkundet werden. Aufgrund der hohen Kosten, war bisher ein Einsatz von mehreren autonomen Unterwasserfahrzeugen (AUV) wirtschaftlich undenkbar, wodurch AUV-Teams nur in Simulationen erforscht werden konnten. In den letzten Jahren konnte jedoch eine Entwicklung hin zu gรผnstigeren und robusteren AUVs beobachtet werden. Somit wird der Einsatz von AUV-Teams in Zukunft zu einer realen Option. Die wachsende Nachfrage nach Technologien zur Unterwasseraufklรคrung und รœberwachung konnte diese Entwicklung noch zusรคtzlich beschleunigen. Eine der grรถรŸten technischen Hรผrden fรผr tief tauchende AUVs ist die Unterwasserlokalisierug. Satelitengestรผtzte Navigation ist in der Tiefe nicht mรถglich, da Radiowellen bereits nach wenigen Metern im Wasser stark an Intensitรคt verlieren. Daher mรผssen neue Ansรคtze fรผr die Unterwasserlokalisierung entwickelt werden die sich auch fรผr Fahrzeugenverbรคnde skalieren lassen. Der Einsatz von AUV-Teams ermรถglicht nicht nur vรถllig neue Mรถglichkeiten der Kooperation, sondern erlaubt auch jedem einzelnen AUV von den Navigationsdaten der anderen Fahrzeuge im Verband zu profitieren, um die eigene Lokalisierung zu verbessern. In dieser Arbeit wird ein kooperativer Lokalisierungsansatz vorgestellt, welcher auf dem Nachrichtenaustausch durch akustische Ultra-Short Base-Line (USBL) Modems basiert. Ein akustisches Modem ermรถglicht die รœbertragung von Datenpaketen im Wasser, wรคrend ein USBL-Sensor die Richtung einer akustischen Quelle bestimmen kann. Durch die Kombination von Modem und Sensor entsteht ein wichtiges Messinstrument fรผr die Unterwasserlokalisierung. Wenn ein Fahrzeug ein Datenpaket mit seiner eignen Position aussendet, kรถnnen andere Fahrzeuge mit einem USBL-Modem diese Nachricht empfangen. In Verbindung mit der Richtungsmessung zur Quelle, kรถnnen diese Daten von einem Empfangenden AUV verwendet werden, um seine eigene Positionsschatzung zu verbessern. Diese Arbeit schlรคgt einen Ansatz zur Fusionierung der empfangenen Nachricht mit der Richtungsmessung vor, welcher auch die jeweiligen Messungenauigkeiten berรผcksichtigt. Um die Messungenauigkeit des komplexen USBL-Sensors bestimmen zu kรถnnen, wurde zudem ein detailliertes Sensormodell entwickelt. Zunรคchst wurden existierende Ansรคtze zur kooperativen Lokalisierung (CL) untersucht, um daraus eine Liste von erwรผnschten Eigenschaften fรผr eine CL abzuleiten. Darauf aufbauend wurde der Deep-Sea Network Lokalisation (DNL) Ansatz entwickelt. Bei DNL handelt es sich um eine CL Methode, bei der die Skalierbarkeit sowie die praktische Anwendbarkeit im Fokus stehen. DNL ist als eine Zwischenschicht konzipiert, welche USBL-Modem und Navigationssystem miteinander verbindet. Es werden dabei Messwerte und Kommunikationsdaten des USBL zu einer Standortbestimmung inklusive Richtungsschรคtzung fusioniert und an das Navigationssystem weiter geleitet, รคhnlich einem GPS-Sensor. Die Funktionalitรคt von USBL-Modell und DNL konnten evaluiert werden anhand von Messdaten aus Seeerprobungen in der Ostsee sowie im Mittelatlantik. Die Qualitรคt einer CL hangt hรคufig von vielen unterschiedlichen Faktoren ab. Die Netzwerktopologie muss genauso berรผcksichtig werden wie die Lokalisierungsfรคhigkeiten jedes einzelnen Teilnehmers. Auch das Kommunikationsverhalten der einzelnen Teilnehmer bestimmt, welche Informationen im Netzwerk vorhanden sind und hat somit einen starken Einfluss auf die CL. Um diese Einflussfaktoren zu untersuchen, wurden eine Reihe von Szenarien simuliert, in denen Kommunikationsverhalten und Netzwerktopologie fรผr eine Gruppe von AUVs variiert wurden. In diesen Experimenten wurden die AUVs durch ein Oberflรคchenfahrzeug unterstรผtzt, welches seine geo-referenzierte Position รผber DNL an die getauchten Fahrzeuge weiter leitete. Anhand der untersuchten Topologie kรถnnen die Experimente eingeteilt werden in Single-Hop und Multi-Hop. Single-Hop bedeutet, dass jedes AUV sich in der Sendereichweite des Oberflรคchenfahrzeugs befindet und dessen Positionsdaten auf direktem Wege erhรคlt. Wie die Ergebnisse der Single-Hop Experimente zeigen, kann der Lokalisierungsfehler der AUVs eingegrenzt werden, wenn man DNL verwendet. Dabei korreliert der Lokalisierungsfehler mit der kombinierten Ungenauigkeit von USBL-Messung und Oberflรคchenfahrzeugposition. Bei den Multi-Hop Experimenten wurde die Topologie so geรคndert, dass sich nur eines der AUVs in direkter Sendereichweite des Oberflรคchenfahrzeugs befindet. Dieses AUV verbessert seine Position mit den empfangen Daten des Oberflรคchenfahrzeugs und sendet wiederum seine verbesserte Position an die anderen AUVs. Auch hier konnte gezeigt werden, dass sich der Lokalisierungfehler der Gruppe mit DNL einschrรคnken lรคsst. ร„ndert man nun das Schema der Kommunikation so, dass alle AUVs zyklisch ihre Position senden, zeigte sich eine Verschlechterung der Lokalisierungsqualitรคt der Gruppe. Dieses unerwartet Ergebnis konnte auf einen Teil des DNL-Algorithmus zurรผck gefรผhrt werden. Da die verwendete USBL-Klasse nur die Richtung eines Signals misst, nicht jedoch die Entfernung zum Sender, wird in der DNL-Schicht eine Entfernungsschatzung vorgenommen. Wenn die Kommunikation nicht streng unidirektional ist, entsteht eine Ruckkopplungsschleife, was zu fehlerhaften Entfernungsschatzungen fรผhrt. Im letzten Experiment wird gezeigt wie sich dieses Problem vermeiden lasst, mithilfe einer relativ neue USBL-Klasse, die sowohl Richtung als auch Entfernung zum Sender misst. Die zwei wesentlichen Beitrรคge dieser Arbeit sind das USBL-Model zum einen und zum Anderen, der neue kooperative Lokalisierungsansatz DNL. Mithilfe des Sensormodels lassen sich nicht nur Messabweichungen einer USBL-Messung bestimmen, es kann auch dazu genutzt werden, einige Fehlereinflรผsse zu korrigieren. Mit DNL wurde eine skalierbare CL-Methode entwickelt, die sich gut fรผr den den Einsatz bei mobilen Unterwassersensornetzwerken eignet. Durch das Konzept als Zwischenschicht, lasst sich DNL einfach in bestehende Navigationslรถsungen integrieren, um die Langzeitstabilitรคt der Navigation fรผr groรŸe Verbรคnde von tiefgetauchten Fahrzeugen zu gewรคhrleisten. Sowohl USBL-Model als auch DNL sind dabei so ressourcenschonend, dass sie auf dem Computer eines Standard USBL laufen kรถnnen, ohne die ursprรผngliche Funktionalitรคt einzuschrรคnken, was den praktischen Einsatz zusรคtzlich vereinfacht
    • โ€ฆ
    corecore