560 research outputs found

    Learning shepherding behavior

    Get PDF
    Roboter, die Schafe hüten sowie die dazu nötigen Strategien zum Bewegen von Individuen zu einem Ziel, bieten vielseitige Anwendungen wie z. B. die Rettung von Menschen aus bedrohlichen Lagen oder der Einsatz schwimmender Roboter zur Beseitigung von Ölteppichen. In dieser Arbeit nutzen wir ein Multiagentensystem als Modell der Roboter und Schafe. Wir untersuchen die Komplexität des Schafehütens und zeigen einen Greedy-Algorithmus, der in linearer Laufzeit eine fast optimale Lösung berechnet. Weiterhin analysieren wir, wie solche Strategien gelernt werden können, da maschinelles Lernen oftmals vorteilhafte Lösungen findet. Im Folgenden nutzen wir Reinforcement Learning (RL) als Lernmethode. Damit RL Agenten ihr gelerntes Wissen auch in kontinuierlichen oder sehr großen Zustandsräumen (wie im betrachteten Szenario) vorhalten können, sind Methoden zur Wissensabstraktion nötig. Unsere Methoden kombinieren RL mit adaptiven neuronalen Verfahren und erlauben dem Agenten gleichzeitig Strategien sowie Darstellungen dieses Wissens zu lernen. Beide Verfahren basieren auf dem unüberwachten Lernverfahren Growing Neural Gas, das eine Vektorquantisierung lernt, indem es neuronale Einheiten im Eingaberaums platziert und bewegt. GNG-Q gruppiert benachbarte Zustände die gleiches Verhalten erfordern (Zustandsraumapproximation); I-GNG-Q wiederum kombiniert Wissen, um eine glatte Bewertungsfunktion zu erhalten (Approximation der Bewertungsfunktion des RL-Agenten). Beide Verfahren beobachten das Verhalten des Lerners um Stellen der Approximation zu finden, die noch verfeinert werden müssen. Die Hauptvorteile unserer Verfahren sind u.a., dass sie ohne Kenntnis des Modells der Umgebung automatisch eine passende Auflösung der Approximation bestimmen. Die experimentelle Analyse unterstreicht, dass unsere Methoden sehr effiziente und effektive Strategien erzeugen.Artificial shepherding strategies, i.e. using robots to move individuals to given locations, have many applications. For example, people can be guided by mobile robots from dangerous places or swimming robots may help to clean up oil spills. This thesis uses a multiagent system to model the robots and sheep. We analyze the complexity of the shepherding task and present a greedy algorithm that only needs linear time to compute a solution that is proven to be close to optimal. Additionally, we analyze to what extend such strategies can be learned as learning usually provides powerful solutions. This thesis focuses on reinforcement learning (RL) as learning method. To enable RL agents to use their knowledge more efficiently in continuous or large state spaces (as e.g. in the shepherding task), methods to transfer knowledge to unseen but similar situations are required. The approaches developed in this thesis, GNG-Q and I-GNG-Q, combine RL with adaptive neural algorithms and enable the agent to learn behavior in parallel with its representation. Both are based upon the growing neural gas, which is an unsupervised learning approach that learns a vector quantization by placing and adjusting units in the input space. GNG-Q groups states that are spatial close and share the same behavior while I-GNG-Q combines the learned behavior from a larger area of the approximation which results in smoother value functions. Thus, GNG-Q performs a state-space abstraction and I-GNG-Q approximates the value function. Both methods monitor the agent's policy during learning to find regions of the approximation that have to be refined. Amongst many others, the core advantages of our approaches are that they do not need the model of the environment and that the resolution of the approximation is determined automatically. The experimental evaluation underlines that the behaviors learned using our approaches are highly efficient and effective.Michael BaumannTag der Verteidigung: 22.01.2016Fakultät für Elektrotechnik, Informatik und Mathematik, Universität Paderborn, Univ., Dissertation, 201

    Registration and analysis of dynamic magnetic resonance image series

    Get PDF
    Cystic fibrosis (CF) is an autosomal-recessive inherited metabolic disorder that affects all organs in the human body. Patients affected with CF suffer particularly from chronic inflammation and obstruction of the airways. Through early detection, continuous monitoring methods, and new treatments, the life expectancy of patients with CF has been increased drastically in the last decades. However, continuous monitoring of the disease progression is essential for a successful treatment. The current state-of-the-art method for lung disease detection and monitoring is computed tomography (CT) or X-ray. These techniques are ill-suited for the monitoring of disease progressions because of the ionizing radiation the patient is exposed during the examination. Through the development of new magnetic resonance imaging (MRI) sequences and evaluation methods, MRI is able to measure physiological changes in the lungs. The process to create physiological maps, i.e. ventilation and perfusion maps, of the lungs using MRI can be split up into three parts: MR-acquisition, image registration, and image analysis. In this work, we present different methods for the image registration part and the image analysis part. We developed a graph-based registration method for 2D dynamic MR image series of the lungs in order to overcome the problem of sliding motion at organ boundaries. Furthermore, we developed a human-inspired learning-based registration method. Here, the registration is defined as a sequence of local transformations. The sequence-based approach combines the advantage of dense transformation models, i.e. large space of transformations, and the advantage of interpolating transformation models, i.e. smooth local transformations. We also developed a general registration framework called Autograd Image Registration Laboratory (AIRLab), which performs automatic calculation of the gradients for the registration process. This allows rapid prototyping and an easy implementation of existing registration algorithms. For the image analysis part, we developed a deep-learning approach based on gated recurrent units that are able to calculate ventilation maps with less than a third of the number of images of the current method. Automatic defect detection in the estimated MRI ventilation and perfusion maps is essential for the clinical routine to automatically evaluate the treatment progression. We developed a weakly supervised method that is able to infer a pixel-wise defect segmentation by using only a continuous global label during training. In this case, we directly use the lung clearance index (LCI) as a global weak label, without any further manual annotations. The LCI is a global measure to describe ventilation inhomogeneities of the lungs and is obtained by a multiple breath washout test

    Data-efficient machine learning for design and optimisation of complex systems

    Get PDF

    Numerical Computation, Data Analysis and Software in Mathematics and Engineering

    Get PDF
    The present book contains 14 articles that were accepted for publication in the Special Issue “Numerical Computation, Data Analysis and Software in Mathematics and Engineering” of the MDPI journal Mathematics. The topics of these articles include the aspects of the meshless method, numerical simulation, mathematical models, deep learning and data analysis. Meshless methods, such as the improved element-free Galerkin method, the dimension-splitting, interpolating, moving, least-squares method, the dimension-splitting, generalized, interpolating, element-free Galerkin method and the improved interpolating, complex variable, element-free Galerkin method, are presented. Some complicated problems, such as tge cold roll-forming process, ceramsite compound insulation block, crack propagation and heavy-haul railway tunnel with defects, are numerically analyzed. Mathematical models, such as the lattice hydrodynamic model, extended car-following model and smart helmet-based PLS-BPNN error compensation model, are proposed. The use of the deep learning approach to predict the mechanical properties of single-network hydrogel is presented, and data analysis for land leasing is discussed. This book will be interesting and useful for those working in the meshless method, numerical simulation, mathematical model, deep learning and data analysis fields

    Koneoppimiskehys OPC UA datalle (Industry 4.0)

    Get PDF
    Machine learning has rapidly gained popularity in all industries with the increase of computational power and data gathering capabilities. Process industry is a good candidate for machine learning based modeling due to the large amounts of data gathered and need for accurate process state predictions. In this work the viability of combining the OPC UA protocol with existing open source machine learning libraries to create data driven models and generate real time predictions was studied. Scikit-learn was used to generate soft sensor style models for the butane content of a debutanizer column output. The data for offline model training was dynamically fetched from an OCP UA server and with a trained model predictions could be generated in real time. The accuracy of the generated models needs to be further researched with better methodology and larger datasets.Koneoppiminen on kasvattanut suosiotaan nopeasti kaikilla toimialoilla laskentatehon ja datankeruun kasvaessa. Prosessiteollisuus on hyvä kandidaatti koneoppimispohjaiselle mallinnukselle suurien datamäärien sekä vaadittujen tarkkojen prosessimallien takia. Tässä työssä tutkittiin mahdollisuutta OPC UA protokollan yhdistämistä olemassaolevien avoimen lähdekoodin koneoppimiskirjastojen kanssa mittausdataan perustuvien mallien opettamiseksi ja reaaliaikaisten ennusteiden luomiseksi. Scikit-learn kirjastoa käytettiin luomaan malleja butaaninpoistokolonnin ulostulon butaanipitoisuuden ennustamiseen. Data mallien offline opetukseen ladattiin dynaamisesti OPC UA palvelimelta ja valmiiksi opetetulla mallilla ennusteita voitiin generoida reaaliaikaisesti. Luotujen mallien tarkkuutta täytyy tutkia tarkemmin paremmalla metodologialla ja suuremmilla datamäärillä
    corecore