4 research outputs found

    BlockDrop: Dynamic Inference Paths in Residual Networks

    Full text link
    Very deep convolutional neural networks offer excellent recognition results, yet their computational expense limits their impact for many real-world applications. We introduce BlockDrop, an approach that learns to dynamically choose which layers of a deep network to execute during inference so as to best reduce total computation without degrading prediction accuracy. Exploiting the robustness of Residual Networks (ResNets) to layer dropping, our framework selects on-the-fly which residual blocks to evaluate for a given novel image. In particular, given a pretrained ResNet, we train a policy network in an associative reinforcement learning setting for the dual reward of utilizing a minimal number of blocks while preserving recognition accuracy. We conduct extensive experiments on CIFAR and ImageNet. The results provide strong quantitative and qualitative evidence that these learned policies not only accelerate inference but also encode meaningful visual information. Built upon a ResNet-101 model, our method achieves a speedup of 20\% on average, going as high as 36\% for some images, while maintaining the same 76.4\% top-1 accuracy on ImageNet.Comment: CVPR 201

    The Effect of Active Learning on Viewpoint Dependence for Novel Objects

    Get PDF
    Active learning of novel objects can facilitate subsequent object recognition and discrimination, but the reasons for its beneficial effects remain unclear. One potential explanation is that active learning enables the formation of a more detailed, realistic, or useful neural object representation than does passive learning. The current study addressed the question of whether active vs. passive learning of objects affects viewpoint discrimination. Participants learned novel wire-like objects either actively or passively and then completed a psychophysical task which they discriminated object orientation. This study did not find a significant difference in viewpoint discrimination between actively and passively learned object representations, which stands in contrast to earlier studies that found an effect of active learning on object recognition across different viewpoints. This suggests that viewpoint discrimination and viewpoint generalization rely on different mechanisms

    Integrating Vision and Physical Interaction for Discovery, Segmentation and Grasping of Unknown Objects

    Get PDF
    In dieser Arbeit werden Verfahren der Bildverarbeitung und die Fähigkeit humanoider Roboter, mit ihrer Umgebung physisch zu interagieren, in engem Zusammenspiel eingesetzt, um unbekannte Objekte zu identifizieren, sie vom Hintergrund und anderen Objekten zu trennen, und letztendlich zu greifen. Im Verlauf dieser interaktiven Exploration werden außerdem Eigenschaften des Objektes wie etwa sein Aussehen und seine Form ermittelt
    corecore