146,775 research outputs found

    Finding any Waldo: zero-shot invariant and efficient visual search

    Full text link
    Searching for a target object in a cluttered scene constitutes a fundamental challenge in daily vision. Visual search must be selective enough to discriminate the target from distractors, invariant to changes in the appearance of the target, efficient to avoid exhaustive exploration of the image, and must generalize to locate novel target objects with zero-shot training. Previous work has focused on searching for perfect matches of a target after extensive category-specific training. Here we show for the first time that humans can efficiently and invariantly search for natural objects in complex scenes. To gain insight into the mechanisms that guide visual search, we propose a biologically inspired computational model that can locate targets without exhaustive sampling and generalize to novel objects. The model provides an approximation to the mechanisms integrating bottom-up and top-down signals during search in natural scenes.Comment: Number of figures: 6 Number of supplementary figures: 1

    Improving 6D Pose Estimation of Objects in Clutter via Physics-aware Monte Carlo Tree Search

    Full text link
    This work proposes a process for efficiently searching over combinations of individual object 6D pose hypotheses in cluttered scenes, especially in cases involving occlusions and objects resting on each other. The initial set of candidate object poses is generated from state-of-the-art object detection and global point cloud registration techniques. The best-scored pose per object by using these techniques may not be accurate due to overlaps and occlusions. Nevertheless, experimental indications provided in this work show that object poses with lower ranks may be closer to the real poses than ones with high ranks according to registration techniques. This motivates a global optimization process for improving these poses by taking into account scene-level physical interactions between objects. It also implies that the Cartesian product of candidate poses for interacting objects must be searched so as to identify the best scene-level hypothesis. To perform the search efficiently, the candidate poses for each object are clustered so as to reduce their number but still keep a sufficient diversity. Then, searching over the combinations of candidate object poses is performed through a Monte Carlo Tree Search (MCTS) process that uses the similarity between the observed depth image of the scene and a rendering of the scene given the hypothesized pose as a score that guides the search procedure. MCTS handles in a principled way the tradeoff between fine-tuning the most promising poses and exploring new ones, by using the Upper Confidence Bound (UCB) technique. Experimental results indicate that this process is able to quickly identify in cluttered scenes physically-consistent object poses that are significantly closer to ground truth compared to poses found by point cloud registration methods.Comment: 8 pages, 4 figure

    Optimized Ray Tracing

    Get PDF
    Táto práca sa zaoberá problematikou sledovania lúča a jej rôznymi optimalizáciami. Rozoberá matematický princíp sledovania lúča a hľadania priesečníka objektov scény s lúčom.  Tiež analyzuje rôzne osvetľovacie modely a efektívne budovanie kd-stromu pre rozdelenie objektov scény. Práca sa zameriava na návrh a implementáciu multiplatformnej a jednoducho rozšíriteľnej aplikácie vykonávajúcej ray-tracing virtuálnej scény v jazyku C++.This bachelor's thesis analyzes the problem of ray tracing and its optimizations. It describes mathematical principles of ray tracing and intersection searching of objects in scene and ray.  It also analyzes different shading models and efficient building of kd-tree for dividing scene objects. The thesis focuses on design and implementation of multiplatform and easily extensible ray tracing application in C++.

    Remarks on searching zones of interest in locally uniform scenes

    Get PDF
    For searching for objects of limited size in a scene by using geometrical features, we suggest to segment not the whole scene, but only fragments of it containing the objects themselves and their environment. We examine the formalization of the search problem in the scene of such fragments, called the zones of interest, and discuss the results of its solution. © 2011 Pleiades Publishing, Ltd

    Search of regions of interest in objects showing signs of a spot on locally homogeneous scenes

    Get PDF
    It is suggested that while searching a scene for objects of known sizes by means of spectral and geometrical signs, instead of segmentation of the entire scene, only those fragments that contain the objects themselves and their surroundings be segmented. The formalization of a search query in the scene of such fragments, called regions of interest, is discussed, methods of its solution are suggested, and the results obtained are examined further. © 2012 Pleiades Publishing, Ltd
    corecore