7 research outputs found

    Model-based viewpoint invariant human activity recognition from uncalibrated monocular video sequence

    Get PDF
    There is growing interest in human activity recognition systems, motivated by their numerous promising applications in many domains. Despite much progress, most researchers have narrowed the problem towards fixed camera viewpoint owing to inherent difficulty to train their systems across all possible viewpoints. Fixed viewpoint systems are impractical in real scenarios. Therefore, we attempt to relax the fixed viewpoint assumption and present a novel and simple framework to recognize and classify human activities from uncalibrated monocular video source from any viewpoint. The proposed framework comprises two stages: 3D human pose estimation and human activity recognition. In the pose estimation stage, we estimate 3D human pose by a simple search-based and tracking-based technique. In the activity recognition stage, we use Nearest Neighbor, with Dynamic Time Warping as a distance measure, to classify multivariate time series which emanate from streams of pose vectors from multiple video frames. We have performed some experiments to evaluate the accuracy of the two stages separately. The encouraging experimental results demonstrate the effectiveness of our framework

    Treasure hunt : a framework for cooperative, distributed parallel optimization

    Get PDF
    Orientador: Prof. Dr. Daniel WeingaertnerCoorientadora: Profa. Dra. Myriam Regattieri DelgadoTese (doutorado) - Universidade Federal do ParanĂĄ, Setor de CiĂȘncias Exatas, Programa de PĂłs-Graduação em InformĂĄtica. Defesa : Curitiba, 27/05/2019Inclui referĂȘncias: p. 18-20Área de concentração: CiĂȘncia da ComputaçãoResumo: Este trabalho propĂ”e um framework multinĂ­vel chamado Treasure Hunt, que Ă© capaz de distribuir algoritmos de busca independentes para um grande nĂșmero de nĂłs de processamento. Com o objetivo de obter uma convergĂȘncia conjunta entre os nĂłs, este framework propĂ”e um mecanismo de direcionamento que controla suavemente a cooperação entre mĂșltiplas instĂąncias independentes do Treasure Hunt. A topologia em ĂĄrvore proposta pelo Treasure Hunt garante a rĂĄpida propagação da informação pelos nĂłs, ao mesmo tempo em que provĂȘ simutaneamente exploraçÔes (pelos nĂłs-pai) e intensificaçÔes (pelos nĂłs-filho), em vĂĄrios nĂ­veis de granularidade, independentemente do nĂșmero de nĂłs na ĂĄrvore. O Treasure Hunt tem boa tolerĂąncia Ă  falhas e estĂĄ parcialmente preparado para uma total tolerĂąncia Ă  falhas. Como parte dos mĂ©todos desenvolvidos durante este trabalho, um mĂ©todo automatizado de Particionamento Iterativo foi proposto para controlar o balanceamento entre exploraçÔes e intensificaçÔes ao longo da busca. Uma Modelagem de Estabilização de ConvergĂȘncia para operar em modo Online tambĂ©m foi proposto, com o objetivo de encontrar pontos de parada com bom custo/benefĂ­cio para os algoritmos de otimização que executam dentro das instĂąncias do Treasure Hunt. Experimentos em benchmarks clĂĄssicos, aleatĂłrios e de competição, de vĂĄrios tamanhos e complexidades, usando os algoritmos de busca PSO, DE e CCPSO2, mostram que o Treasure Hunt melhora as caracterĂ­sticas inerentes destes algoritmos de busca. O Treasure Hunt faz com que os algoritmos de baixa performance se tornem comparĂĄveis aos de boa performance, e os algoritmos de boa performance possam estender seus limites atĂ© problemas maiores. Experimentos distribuindo instĂąncias do Treasure Hunt, em uma rede cooperativa de atĂ© 160 processos, demonstram a escalabilidade robusta do framework, apresentando melhoras nos resultados mesmo quando o tempo de processamento Ă© fixado (wall-clock) para todas as instĂąncias distribuĂ­das do Treasure Hunt. Resultados demonstram que o mecanismo de amostragem fornecido pelo Treasure Hunt, aliado Ă  maior cooperação entre as mĂșltiplas populaçÔes em evolução, reduzem a necessidade de grandes populaçÔes e de algoritmos de busca complexos. Isto Ă© especialmente importante em problemas de mundo real que possuem funçÔes de fitness muito custosas. Palavras-chave: InteligĂȘncia artificial. MĂ©todos de otimização. Algoritmos distribuĂ­dos. Modelagem de convergĂȘncia. Alta dimensionalidade.Abstract: This work proposes a multilevel framework called Treasure Hunt, which is capable of distributing independent search algorithms to a large number of processing nodes. Aiming to obtain joint convergences between working nodes, Treasure Hunt proposes a driving mechanism that smoothly controls the cooperation between the multiple independent Treasure Hunt instances. The tree topology proposed by Treasure Hunt ensures quick propagation of information, while providing simultaneous explorations (by parents) and exploitations (by children), on several levels of granularity, regardless the number of nodes in the tree. Treasure Hunt has good fault tolerance and is partially prepared to full fault tolerance. As part of the methods developed during this work, an automated Iterative Partitioning method is proposed to control the balance between exploration and exploitation as the search progress. A Convergence Stabilization Modeling to operate in Online mode is also proposed, aiming to find good cost/benefit stopping points for the optimization algorithms running within the Treasure Hunt instances. Experiments on classic, random and competition benchmarks of various sizes and complexities, using the search algorithms PSO, DE and CCPSO2, show that Treasure Hunt boosts the inherent characteristics of these search algorithms. Treasure Hunt makes algorithms with poor performances to become comparable to good ones, and algorithms with good performances to be capable of extending their limits to larger problems. Experiments distributing Treasure Hunt instances in a cooperative network up to 160 processes show the robust scaling of the framework, presenting improved results even when fixing a wall-clock time for the instances. Results show that the sampling mechanism provided by Treasure Hunt, allied to the increased cooperation between multiple evolving populations, reduce the need for large population sizes and complex search algorithms. This is specially important on real-world problems with time-consuming fitness functions. Keywords: Artificial intelligence. Optimization methods. Distributed algorithms. Convergence modeling. High dimensionality

    Gaussian Processes in Reinforcement Learning: Stability Analysis and Efficient Value Propagation

    Get PDF
    Control of nonlinear systems on continuous domains is a challenging task for various reasons. For robust and accurate control of complex systems a precise model of the system dynamics is essential. Building such highly precise dynamics models from physical knowledge often requires substantial manual effort and poses a great challenge in industrial applications. Acquiring a model automatically from system measurements employing regression techniques allows to decrease manual effort and, thus, poses an interesting alternative to knowledge-based modeling. Based on such a learned dynamics model, an approximately optimal controller can be inferred automatically. Such approaches are the subject of model-based reinforcement learning (RL) and learn optimal control from interactions with the system. Especially when probabilistic dynamics models such as Gaussian processes are employed, model-based RL has been tremendously successful and has attracted much attention from both the control and machine learning communities. However, several problems need to be solved to facilitate widespread deployment of model-based RL for learning control in real world scenarios. In this thesis, we address two current limitations of model-based RL that are indispensable prerequisites for widespread deployment of model-based RL in real world tasks. In many real world applications a poor controller can cause severe damage to the system or even put the safety of humans at risk. Thus, it is essential to ensure that the controlled system behaves as desired. While this question has been studied extensively in classical control, stability of closed-loop control systems with dynamics given as a Gaussian process has not been considered yet. We propose an automatic tool to compute regions of the state space where the desired behavior of the system can be guaranteed. We consider dynamics given as the mean of a GP as well as the full GP posterior distribution. In the first case, the proposed tool constructs regions of the state space, such that the trajectories starting in this region converge to the target state. From this asymptotic result, we follow statements for finite time horizons and stability under the presence of disturbances. In the second case the system dynamics is given as a GP posterior distribution. Thus, computation of multi-step-ahead predictions requires averaging over all plausible dynamics models given the observations. A a consequence, multi-step-ahead predictions become analytically intractable. We propose an approximation based on numerical quadrature that can handle complex state distributions, e.g., with multiple modes and provides upper bounds for the approximation error. Exploiting these error bounds, we present an automatic tool to compute stability regions. In these regions of the state space, our tool guarantees that for a finite time horizon the system behaves as desired with a given probability. Furthermore, we analyze asymptotic behavior of closed-loop control systems with dynamics given as a GP posterior distribution. In this case we show that for some common choices of the prior, the system has a unique stationary distribution to which the system state converges irrespective of the starting state. Another major challenge of RL for real world control applications is to minimize interactions with the system required for learning. While RL approaches based on GP dynamics models have demonstrated great data efficiency, the average amount of required system interactions can further be reduced. To achieve this goal, we propose to employ the numerical quadrature based approximation to propagate the value of a state. To show how this approximation can further increase data efficiency, we employ it in the two main classes of model-based RL: policy search and value iteration. In policy search, the state distribution must be computed to evaluate the expected long-term reward for a policy. The proposed numerical quadrature based approximation substantially improves estimates of the expected long-term reward and its gradients. As a result, data efficiency is significantly increased. For the value function based approaches for policy learning, the value propagation step is completely characterized by the Bellman equation. However, this equation is intractable for nonlinear dynamics. In this case, we propose a projection-based value iteration approach. We employ numerical quadrature to facilitate projection of the value function onto a linear feature space. Suitable features for value function representation are learned online without manual effort. This feature learning is constructed such that upper bounds for the projection error can be obtained. The proposed value iteration approach learns globally optimal policies and significantly benefits from the introduced highly accurate approximations

    Optimum Allocation of Inspection Stations in Multistage Manufacturing Processes by Using Max-Min Ant System

    Get PDF
    In multistage manufacturing processes it is common to locate inspection stations after some or all of the processing workstations. The purpose of the inspection is to reduce the total manufacturing cost, resulted from unidentified defective items being processed unnecessarily through subsequent manufacturing operations. This total cost is the sum of the costs of production, inspection and failures (during production and after shipment). Introducing inspection stations into a serial multistage manufacturing process, although constituting an additional cost, is expected to be a profitable course of action. Specifically, at some positions the associated inspection costs will be recovered from the benefits realised through the detection of defective items, before wasting additional cost by continuing to process them. In this research, a novel general cost modelling for allocating a limited number of inspection stations in serial multistage manufacturing processes is formulated. In allocation of inspection station (AOIS) problem, as the number of workstations increases, the number of inspection station allocation possibilities increases exponentially. To identify the appropriate approach for the AOIS problem, different optimisation methods are investigated. The MAX-MIN Ant System (MMAS) algorithm is proposed as a novel approach to explore AOIS in serial multistage manufacturing processes. MMAS is an ant colony optimisation algorithm that was designed originally to begin an explorative search phase and, subsequently, to make a slow transition to the intensive exploitation of the best solutions found during the search, by allowing only one ant to update the pheromone trails. Two novel heuristics information for the MMAS algorithm are created. The heuristic information for the MMAS algorithm is exploited as a novel means to guide ants to build reasonably good solutions from the very beginning of the search. To improve the performance of the MMAS algorithm, six local search methods which are well-known and suitable for the AOIS problem are used. Selecting relevant parameter values for the MMAS algorithm can have a great impact on the algorithm’s performance. As a result, a method for tuning the most influential parameter values for the MMAS algorithm is developed. The contribution of this research is, for the first time, a methodology using MMAS to solve the AOIS problem in serial multistage manufacturing processes has been developed. The methodology takes into account the constraints on inspection resources, in terms of a limited number of inspection stations. As a result, the total manufacturing cost of a product can be reduced, while maintaining the quality of the product. Four numerical experiments are conducted to assess the MMAS algorithm for the AOIS problem. The performance of the MMAS algorithm is compared with a number of other methods this includes the complete enumeration method (CEM), rule of thumb, a pure random search algorithm, particle swarm optimisation, simulated annealing and genetic algorithm. The experimental results show that the effectiveness of the MMAS algorithm lies in its considerably shorter execution time and robustness. Further, in certain conditions results obtained by the MMAS algorithm are identical to the CEM. In addition, the results show that applying local search to the MMAS algorithm has significantly improved the performance of the algorithm. Also the results demonstrate that it is essential to use heuristic information with the MMAS algorithm for the AOIS problem, in order to obtain a high quality solution. It was found that the main parameters of MMAS include the pheromone trail intensity, heuristic information and evaporation of pheromone are less sensitive within the specified range as the number of workstations is significantly increased

    Globally convergent evolution strategies with application to Earth imaging problem in geophysics

    Get PDF
    Au cours des derniĂšres annĂ©es, s’est dĂ©veloppĂ© un intĂ©rĂȘt tout particulier pour l’optimisation sans dĂ©rivĂ©e. Ce domaine de recherche se divise en deux catĂ©gories: une dĂ©terministe et l’autre stochastique. Bien qu’il s’agisse du mĂȘme domaine, peu de liens ont dĂ©jĂ  Ă©tĂ© Ă©tablis entre ces deux branches. Cette thĂšse a pour objectif de combler cette lacune, en montrant comment les techniques issues de l’optimisation dĂ©terministe peuvent amĂ©liorer la performance des stratĂ©gies Ă©volutionnaires, qui font partie des meilleures mĂ©thodes en optimisation stochastique. Sous certaines hypothĂšses, les modifications rĂ©alisĂ©es assurent une forme de convergence globale, c’est-Ă -dire une convergence vers un point stationnaire de premier ordre indĂ©pendamment du point de dĂ©part choisi. On propose ensuite d’adapter notre algorithme afin qu’il puisse traiter des problĂšmes avec des contraintes gĂ©nĂ©rales. On montrera Ă©galement comment amĂ©liorer les performances numĂ©riques des stratĂ©gies Ă©volutionnaires en incorporant un pas de recherche au dĂ©but de chaque itĂ©ration, dans laquelle on construira alors un modĂšle quadratique utilisant les points oĂč la fonction coĂ»t a dĂ©jĂ  Ă©tĂ© Ă©valuĂ©e. GrĂące aux rĂ©cents progrĂšs techniques dans le domaine du calcul parallĂšle, et Ă  la nature parallĂ©lisable des stratĂ©gies Ă©volutionnaires, on propose d’appliquer notre algorithme pour rĂ©soudre un problĂšme inverse d’imagerie sismique. Les rĂ©sultats obtenus ont permis d’amĂ©liorer la rĂ©solution de ce problĂšme. ABSTRACT : In recent years, there has been significant and growing interest in Derivative-Free Optimization (DFO). This field can be divided into two categories: deterministic and stochastic. Despite addressing the same problem domain, only few interactions between the two DFO categories were established in the existing literature. In this thesis, we attempt to bridge this gap by showing how ideas from deterministic DFO can improve the efficiency and the rigorousness of one of the most successful class of stochastic algorithms, known as Evolution Strategies (ES’s). We propose to equip a class of ES’s with known techniques from deterministic DFO. The modified ES’s achieve rigorously a form of global convergence under reasonable assumptions. By global convergence, we mean convergence to first-order stationary points independently of the starting point. The modified ES’s are extended to handle general constrained optimization problems. Furthermore, we show how to significantly improve the numerical performance of ES’s by incorporating a search step at the beginning of each iteration. In this step, we build a quadratic model using the points where the objective function has been previously evaluated. Motivated by the recent growth of high performance computing resources and the parallel nature of ES’s, an application of our modified ES’s to Earth imaging Geophysics problem is proposed. The obtained results provide a great improvement for the problem resolution
    corecore