14 research outputs found

    Dual Control for Exploitation and Exploration (DCEE) in Autonomous Search

    Full text link
    This paper proposes an optimal autonomous search framework, namely Dual Control for Exploration and Exploitation (DCEE), for a target at unknown location in an unknown environment. Source localisation is to find sources of atmospheric hazardous material release in a partially unknown environment. This paper proposes a control theoretic approach to this autonomous search problem. To cope with an unknown target location, at each step, the target location is estimated by Bayesian inference. Then a control action is taken to minimise the error between future robot position and the hypothesised future estimation of the target location. The latter is generated by hypothesised measurements at the corresponding future robot positions (due to the control action) with the current estimation of the target location as a prior. It shows that this approach can take into account both the error between the next robot position and the estimate of the target location, and the uncertainty of the estimate. This approach is further extended to the case with not only an unknown source location, but also an unknown local environment (e.g. wind speed and direction). Different from current information theoretic approaches, this new control theoretic approach achieves the optimal trade-off between exploitation and exploration in a unknown environment with an unknown target by driving the robot moving towards estimated target location while reducing its estimation uncertainty. This scheme is implemented using particle filtering on a mobile robot. Simulation and experimental studies demonstrate promising performance of the proposed approach. The relationships between the proposed approach, informative path planning, dual control, and classic model predictive control are discussed and compared

    Autonomous search of an airborne release in urban environments using informed tree planning

    Full text link
    The use of autonomous vehicles for chemical source localisation is a key enabling tool for disaster response teams to safely and efficiently deal with chemical emergencies. Whilst much work has been performed on source localisation using autonomous systems, most previous works have assumed an open environment or employed simplistic obstacle avoidance, separate to the estimation procedure. In this paper, we explore the coupling of the path planning task for both source term estimation and obstacle avoidance in a holistic framework. The proposed system intelligently produces potential gas sampling locations based on the current estimation of the wind field and the local map. Then a tree search is performed to generate paths toward the estimated source location that traverse around any obstacles and still allow for exploration of potentially superior sampling locations. The proposed informed tree planning algorithm is then tested against the Entrotaxis technique in a series of high fidelity simulations. The proposed system is found to reduce source position error far more efficiently than Entrotaxis in a feature rich environment, whilst also exhibiting vastly more consistent and robust results

    On the use of autonomous unmanned vehicles in response to hazardous atmospheric release incidents

    Get PDF
    Recent events have induced a surge of interest in the methods of response to releases of hazardous materials or gases into the atmosphere. In the last decade there has been particular interest in mapping and quantifying emissions for regulatory purposes, emergency response, and environmental monitoring. Examples include: responding to events such as gas leaks, nuclear accidents or chemical, biological or radiological (CBR) accidents or attacks, and even exploring sources of methane emissions on the planet Mars. This thesis presents a review of the potential responses to hazardous releases, which includes source localisation, boundary tracking, mapping and source term estimation. [Continues.]</div

    AID-RL: Active information-directed reinforcement learning for autonomous source seeking and estimation

    Get PDF
    This paper proposes an active information-directed reinforcement learning (AID-RL) framework for autonomous source seeking and estimation problem. Source seeking requires the search agent to move towards the true source, and source estimation demands the agent to maintain and update its knowledge regarding the source properties such as release rate and source position. These two objectives give rise to the newly developed framework, namely, dual control for exploration and exploitation. In this paper, the greedy RL forms an exploitation search strategy that navigates the agent to the source position, while the information-directed search commands the agent to explore most informative positions to reduce belief uncertainty. Extensive results are presented using a high-fidelity dataset for autonomous search, which validates the effectiveness of the proposed AID-RL and highlights the importance of active exploration in improving sampling efficiency and search performance

    Concurrent Active Learning in Autonomous Airborne Source Search: Dual Control for Exploration and Exploitation

    Get PDF
    In this paper, a concurrent learning framework is developed for source search in an unknown environment using autonomous platforms equipped with onboard sensors. Distinct from the existing solutions that require significant computational power for Bayesian estimation and path planning, the proposed solution is computationally affordable for onboard processors. A new concept of concurrent learning using multiple parallel estimators is proposed to learn the operational environment and quantify estimation uncertainty. The search agent is empowered with dual capability of exploiting current estimated parameters to track the source and probing the environment to reduce the impacts of uncertainty, namely Concurrent Learning based Dual Control for Exploration and Exploitation (CL-DCEE). In this setting, the control action not only minimises the tracking error between future agent's position and estimated source location, but also the uncertainty of predicted estimation. More importantly, the rigorous proven properties such as the convergence of CL-DCEE algorithm are established under mild assumptions on noises, and the impact of noises on the search performance is examined. Simulation results are provided to validate the effectiveness of the proposed CL-DCEE algorithm. Compared with the information-theoretic approach, CL-DCEE not only guarantees convergence, but produces better search performance and consumes much less computational time

    Reinforcement learning for source location estimation: a multi-step approach

    Get PDF
    Gas leaks present an undeniable safety concern, the ability to swiftly and accurately detect the source of a leak and pertinent details is critical for effective emergency response. The limited precision of sensors and environmental noise introduce significant uncertainty and randomness, complicating the resolution of such issues. To address these challenges, this study introduces a new approach that integrates multi-step deep reinforcement learning algorithms with Bayesian inference to estimate source information. Compared to single-step Reinforcement Learning and Entrotaxis methods, the multi-step update mechanism in this problem allows the agent to locate sources position more efficiently. This approach not only increases the search’s success rate but also decreases the number of time steps needed for successful detection. Experiments conducted in continuous and discrete environments of equal scale and parameters corroborate the efficiency of our method in tracing the source of gas leaks.<br/

    Cooperative Active Learning based Dual Control for Exploration and Exploitation in Autonomous Search

    Get PDF
    In this paper, a multi-estimator based computationally efficient algorithm is developed for autonomous search in an unknown environment with an unknown source. Different from the existing approaches that require massive computational power to support nonlinear Bayesian estimation and complex decision-making process, an efficient cooperative active learning based dual control for exploration and exploitation (COAL-DCEE) is developed for source estimation and path planning. Multiple cooperative estimators are deployed for environment learning process, which is helpful to improving the search performance and robustness against noisy measurements. The number of estimators used in COAL-DCEE is much smaller than that of particles required for Bayesian estimation in information-theoretic approaches. Consequently, the computational load is significantly reduced. As an important feature of this study, the convergence and performance of COAL-DCEE are established in relation to the characteristics of sensor noises and turbulence disturbances. Numerical and experimental studies have been carried out to verify the effectiveness of the proposed framework. Compared with existing approaches, COAL-DCEE not only provides convergence guarantee, but also yields comparable search performance using much less computational power

    ????????? ????????? ????????? ???????????? ????????? ?????? ?????? ??? ?????? ??????

    Get PDF
    Department of Mechanical EngineeringThe potential danger of invisible hazardous substance leakage accident is increasing, such as hazardous chemical leakage accidents in industrial complexes, potential risks of aging nuclear power plants, and international chemical terrorism threats. In particular, hazardous chemical, biological, or radioactive substances leaked into the atmosphere cause irreversible damage to nature, and there is a risk of human damage if prompt action is not taken. Therefore, estimating the emission source and the amount of invisible hazardous substances is required to minimize human casualties and increase public safety. As the risk of hazardous material leakage and potential terrorism increases in random places, it is difficult using traditional systems such as pre-installed ground sensors in a specific area. This thesis proposes autonomous search method for estimating the source of hazardous materials using a mobile sensor attached to an unmanned aerial vehicle (UAV). Since the mobile sensor can be freely deployed to any arbitrary places, it is possible to monitor a wider area with a relatively low cost. Besides, this approach is an unmanned autonomous system, so it has the advantage of minimizing secondary human casualties that may additionally occur during search. The source term estimation (STE) using mobile sensors is considered to be a challenging problem because the sensor measurements from atmospheric gas dispersion are sparse, intermittent, and time-varying due to the turbulence and the sensor noise. Thus, Bayesian inference-based estimation technique, sequential Monte Carlo method (i.e., particle filter), is used to estimate the source by using the inaccurate measurements which is easily influenced by air turbulence and sensor noise in this thesis. The autonomous search algorithms using information theory are also proposed. In the proposed algorithms, the information entropy (i.e., uncertainty of estimation) is calculated by using information theory and the agent choose the action to move to the next sensing location that can minimize the expected uncertainty. In other words, the proposed information-theoretic search algorithm is reward-based decision making approaches that use information entropy as a reward. The receding horizon and Gaussian mixture model clustering approaches are adopted to improve the search performance in various environment. Since the time required to compute all of the respective rewards increases as the number of action candidates increases, the policy-based autonomous source term search and estimation algorithm is proposed using deep neural network and reinforcement learning approach to determine efficient search path considering continuous action space. Furthermore, this thesis proposes a cooperative search method for multiple unmanned mobile vehicles based on game theory. The inaccuracy of sensor measurement values can be reduced by using multiple mobile sensors with the fusion approach, so the source of hazardous substances can be quickly estimated. The negotiation based on the game theory can improve the group search performance for source term estimation and search. Finally, to verify the performance of the proposed algorithm, numerical simulation and flight test results using an actual gas measurement sensor and multicopter drone are presented.ope

    On gas source declaration methods for single-robot search

    Get PDF
    Source declaration, along with plume finding and plume tracking, is one of the needed processes for gas source localization (GSL). It is a fundamental part of the search, since it is responsible to decide whether the gas source has been found, and also to pinpoint its location. Despite its importance, source declaration is often ignored in most of the GSL research, the criteria for termination being selected in a seemingly arbitrary manner, or even not being discussed at all. A clear example of this is the large number of experiments in the literature that are declared concluded whenever the robot manages to physically reach the source, without formally declaring it. In this work, we seek to provide an overview of the most significant declaration methods that have been used in state-of-the-art research for single-robot GSL, analyzing their strengths and weaknesses. We also provide a preliminary experimental validation of these methods, focusing on how stable their performance is when their input parameters are modified.Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tec

    Information-based path planning for source term estimation using an unmanned aerial vehicle: algorithms and experiments

    Get PDF
    Department of Mechanical Enginering (Mechanical Engineering)Searching and estimating source information such as the location and release rate, called a source term, have many applications across environmental, medical, and security domains. For autonomous source search and estimation in a turbulent environment, this thesis presents two information-theoretic search strategies. Firstly, Gaussian mixture model (GMM) based infotaxis, termed as GMM-Infotaxis, is presented. The GMM is used to determine the action candidates for the next best informative sampling position in a continuous domain by appropriately clustering possible source locations obtained from the particle filter, compared with Infotaxis using discrete action candidates. This facilitates the better trade-off between exploitation and exploration for search, resulting in more efficient search and better estimation performance. However, GMM-Infotaxis has limitations in complex environments with many obstacles such as urban area, as this approach only predicts one step ahead action and the obstacles prevent efficient search. To address this problem, Infotaxis combined with the Rapidly-exploring Random Trees (RRT) is proposed and termed as RRT-Infotaxis. By introducing new utility function which is designed to maximize entropy reduction and minimize searching path at the same time, RRT-Infotaxis has advantage of searching efficient path in obstacle-rich environments. With proposed utility function, this approach is designed not only to avoid obstacles but also to sample the next best sampling positions considering several steps ahead in a continuous domain. Numerical simulations for both strategies, GMM-Infotaxis and RRT-Infotaxis, are implemented to prove the enhanced performance compared to the conventional Infotaxis. Numerical simulations show that in an open space the performance of GMM-Infotaxis is better than the conventional Infotaxis and in various urban environments RRT-Infotaxis outperforms both original Infotaxis and GMM-Infotaxis. Besides, real outdoor flight experiments using a multirotor UAV in an open space for GMM-Infotaxis are conducted. It shows the superior performance of the GMM-Infotaxis compared with the original Infotaxis method.ope
    corecore