43 research outputs found
Learning the Structure of Continuous Markov Decision Processes
There is growing interest in artificial, intelligent agents which can operate autonomously for an extended period of time in complex environments and fulfill a variety of different tasks. Such agents will face different problems during their lifetime which may not be foreseeable at the time of their deployment. Thus, the capacity for lifelong learning of new behaviors is an essential prerequisite for this kind of agents as it enables them to deal with unforeseen situations. However, learning every complex behavior anew from scratch would be cumbersome for the agent. It is more plausible to consider behavior to be modular and let the agent acquire a set of reusable building blocks for behavior, the so-called skills. These skills might, once acquired, facilitate fast learning and adaptation of behavior to new situations. This work focuses on computational approaches for skill acquisition, namely which kind of skills shall be acquired and how to acquire them. The former is commonly denoted as skill discovery and the latter as skill learning . The main contribution of this thesis is a novel incremental skill acquisition approach which is suited for lifelong learning. In this approach, the agent learns incrementally a graph-based representation of a domain and exploits certain properties of this graph such as its bottlenecks for skill discovery. This thesis proposes a novel approach for learning a graph-based representation of continuous domains based on formalizing the problem as a probabilistic generative model. Furthermore, a new incremental agglomerative clustering approach for identifying bottlenecks of such graphs is presented. Thereupon, the thesis proposes a novel intrinsic motivation system which enables an agent to intelligently allocate time between skill discovery and skill learning in developmental settings, where the agent is not constrained by external tasks. The results of this thesis show that the resulting skill acquisition approach is suited for continuous domains and can deal with domain stochasticity and different explorative behavior of the agent. The acquired skills are reusable and versatile and can be used in multi-task and lifelong learning settings in high-dimensional problems
Identifying Systematic Errors in Object Detectors with the SCROD Pipeline
The identification and removal of systematic errors in object detectors can
be a prerequisite for their deployment in safety-critical applications like
automated driving and robotics. Such systematic errors can for instance occur
under very specific object poses (location, scale, orientation), object
colors/textures, and backgrounds. Real images alone are unlikely to cover all
relevant combinations. We overcome this limitation by generating synthetic
images with fine-granular control. While generating synthetic images with
physical simulators and hand-designed 3D assets allows fine-grained control
over generated images, this approach is resource-intensive and has limited
scalability. In contrast, using generative models is more scalable but less
reliable in terms of fine-grained control. In this paper, we propose a novel
framework that combines the strengths of both approaches. Our meticulously
designed pipeline along with custom models enables us to generate street scenes
with fine-grained control in a fully automated and scalable manner. Moreover,
our framework introduces an evaluation setting that can serve as a benchmark
for similar pipelines. This evaluation setting will contribute to advancing the
field and promoting standardized testing procedures