20 research outputs found
An Intelligent System for Parking Trailer using Reinforcement Learning and Type 2 fuzzy Logic
Abstract: In examples of reinforcement learning where state space is continuous, it seems impossible to use reference tables to store value-action .In these problems a method is required for value estimation for each state-action pair .The inputs to this estimation system are (characteristics of) state variables which reflect the status of agent in the environment .The system can be either linear of nonlinear .For each member in set of actions of an agent, there exists an estimation system which determines state value for the action .On the other hand, in most real world problems, just as the state space is continuous, so is the action space for an agent .In these cases, type 2 type 2 fuzzy systems may provide a useful solution in selection of final action from action space .In this paper we intend to combine reinforcement learning algorithm with fuzzified actions and state space along with a linear estimation system into an intelligent systems for parking Trailers in cases where both state and action spaces are continuous .Finally, the successful performance of the proposed algorithm is shown through simulations on trailer parking problem
Hybrid Extended Kalman Filtering and Noise Statistics Optimization for Produce Wash State Estimation
Food-borne diseases associated with fresh produce consistently cause serious public health issues. Although sanitization measures are utilized to enhance the safety of fresh produce, strategies that neglect the dynamic nature of commercial wash processes are limited, creating the potential for pathogen cross-contamination and major disease outbreaks. In light of this risk, there is an urgent need for new control approaches during produce washing to reduce the probability of outbreaks. As an important step in this direction, a hybrid extended Kalman filter (HEKF) and particle swarm optimization (PSO)-based noise statistics optimization are designed for a produce wash system. The HEKF uses discrete-time free chlorine (FC) measurements, and PSO is used to optimize the noise statistics of the process noise model. The process model and HEKF enable the estimation of chemical oxygen demand (COD) in the water wash, FC concentration, Escherichia coli concentration (PC) in the water wash, and E. coli level (P) on the lettuce. Although control is not explicitly addressed in this paper, the estimation technique proposed here will enable not only monitoring but also advanced control methods. The HEKF is applied to estimate E. coli O157:H7 contamination of shredded lettuce during an industrial wash. The HEKF estimates COD with a root mean square error (RSME) of 8.24 mg/L, FC concentration with an RMSE of 0.09 mg/L, PC in the wash water with an RMSE of 0.19 MPN/ml, and P on the lettuce with an RSME of 0.04 MPN/g. A sensitivity analysis demonstrates that the estimator has good robustness
An Extensive Set of Kinematic and Kinetic Data for Individuals with Intact Limbs and Transfemoral Prosthesis Users
This paper introduces an extensive human motion data set for typical activities of daily living. These data are crucial for the design and control of prosthetic devices for transfemoral prosthesis users. This data set was collected from seven individuals, including five individuals with intact limbs and two transfemoral prosthesis users. These data include the following types of movements: (1) walking at three different speeds; (2) walking up and down a 5-degree ramp; (3) stepping up and down; (4) sitting down and standing up. We provide full-body marker trajectories and ground reaction forces (GRFs) as well as joint angles, joint velocities, joint torques, and joint powers. This data set is publicly available at the website referenced in this paper. Data from flexion and extension of the hip, knee, and ankle are presented in this paper. However, the data accompanying this paper (available on the internet) include 46 distinct measurements and can be useful for validating or generating mathematical models to simulate the gait of both transfemoral prosthesis users and individuals with intact legs
Self-Supervised Traversability Prediction by Learning to Reconstruct Safe Terrain
Navigating off-road with a fast autonomous vehicle depends on a robust
perception system that differentiates traversable from non-traversable terrain.
Typically, this depends on a semantic understanding which is based on
supervised learning from images annotated by a human expert. This requires a
significant investment in human time, assumes correct expert classification,
and small details can lead to misclassification. To address these challenges,
we propose a method for predicting high- and low-risk terrains from only past
vehicle experience in a self-supervised fashion. First, we develop a tool that
projects the vehicle trajectory into the front camera image. Second, occlusions
in the 3D representation of the terrain are filtered out. Third, an autoencoder
trained on masked vehicle trajectory regions identifies low- and high-risk
terrains based on the reconstruction error. We evaluated our approach with two
models and different bottleneck sizes with two different training and testing
sites with a fourwheeled off-road vehicle. Comparison with two independent test
sets of semantic labels from similar terrain as training sites demonstrates the
ability to separate the ground as low-risk and the vegetation as high-risk with
81.1% and 85.1% accuracy
NeBula: Team CoSTAR's robotic autonomy solution that won phase II of DARPA Subterranean Challenge
This paper presents and discusses algorithms, hardware, and software architecture developed by the TEAM CoSTAR (Collaborative SubTerranean Autonomous Robots), competing in the DARPA Subterranean Challenge. Specifically, it presents the techniques utilized within the Tunnel (2019) and Urban (2020) competitions, where CoSTAR achieved second and first place, respectively. We also discuss CoSTAR¿s demonstrations in Martian-analog surface and subsurface (lava tubes) exploration. The paper introduces our autonomy solution, referred to as NeBula (Networked Belief-aware Perceptual Autonomy). NeBula is an uncertainty-aware framework that aims at enabling resilient and modular autonomy solutions by performing reasoning and decision making in the belief space (space of probability distributions over the robot and world states). We discuss various components of the NeBula framework, including (i) geometric and semantic environment mapping, (ii) a multi-modal positioning system, (iii) traversability analysis and local planning, (iv) global motion planning and exploration behavior, (v) risk-aware mission planning, (vi) networking and decentralized reasoning, and (vii) learning-enabled adaptation. We discuss the performance of NeBula on several robot types (e.g., wheeled, legged, flying), in various environments. We discuss the specific results and lessons learned from fielding this solution in the challenging courses of the DARPA Subterranean Challenge competition.The work is partially supported by the Jet Propulsion Laboratory, California Institute of Technology,
under a contract with the National Aeronautics and Space Administration (80NM0018D0004), and
Defense Advanced Research Projects Agency (DARPA)
Robust Kalman-Type Filter for Non-Gaussian Noise: Performance Analysis With Unknown Noise Covariances
The Kalman filter (KF) is optimal with respect to minimum mean square error (MMSE) if the process noise and measurement noise are Gaussian. However, the KF is suboptimal in the presence of non-Gaussian noise. The maximum correntropy criterion Kalman filter (MCC-KF) is a Kalman-type filter that uses the correntropy measure as its optimality criterion instead of MMSE. In this paper, we modify the correntropy gain in the MCC-KF to obtain a new filter that we call the measurement-specific correntropy filter (MSCF). The MSCF uses a matrix gain rather than a scalar gain to provide better selectivity in the way that it handles the innovation vector. We analytically compare the performance of the KF with that of the MSCF when either the measurement or process noise covariance is unknown. For each of these situations, we analyze two mean square errors (MSEs): the filter-calculated MSE (FMSE) and the true MSE (TMSE). We show that the FMSE of the KF is less than that of the MSCF. However, the TMSE of the KF is greater than that of the MSCF under certain conditions. Illustrative examples are provided to verify the analytical results