245 research outputs found
Tuning Modular Networks with Weighted Losses for Hand-Eye Coordination
This paper introduces an end-to-end fine-tuning method to improve hand-eye
coordination in modular deep visuo-motor policies (modular networks) where each
module is trained independently. Benefiting from weighted losses, the
fine-tuning method significantly improves the performance of the policies for a
robotic planar reaching task.Comment: 2 pages, to appear in the Deep Learning for Robotic Vision (DLRV)
Workshop in CVPR 201
Towards Assessing Compliant Robotic Grasping from First-Object Perspective via Instrumented Objects
Grasping compliant objects is difficult for robots - applying too little
force may cause the grasp to fail, while too much force may lead to object
damage. A robot needs to apply the right amount of force to quickly and
confidently grasp the objects so that it can perform the required task.
Although some methods have been proposed to tackle this issue, performance
assessment is still a problem for directly measuring object property changes
and possible damage. To fill the gap, a new concept is introduced in this paper
to assess compliant robotic grasping using instrumented objects. A
proof-of-concept design is proposed to measure the force applied on a cuboid
object from a first-object perspective. The design can detect multiple contact
locations and applied forces on its surface by using multiple embedded 3D Hall
sensors to detect deformation relative to embedded magnets. The contact
estimation is achieved by interpreting the Hall-effect signals using neural
networks. In comprehensive experiments, the design achieved good performance in
estimating contacts from each single face of the cuboid and decent performance
in detecting contacts from multiple faces when being used to evaluate grasping
from a parallel jaw gripper, demonstrating the effectiveness of the design and
the feasibility of the concept.Comment: Under review for RA-
Ada-NETS: Face Clustering via Adaptive Neighbour Discovery in the Structure Space
Face clustering has attracted rising research interest recently to take
advantage of massive amounts of face images on the web. State-of-the-art
performance has been achieved by Graph Convolutional Networks (GCN) due to
their powerful representation capacity. However, existing GCN-based methods
build face graphs mainly according to kNN relations in the feature space, which
may lead to a lot of noise edges connecting two faces of different classes. The
face features will be polluted when messages pass along these noise edges, thus
degrading the performance of GCNs. In this paper, a novel algorithm named
Ada-NETS is proposed to cluster faces by constructing clean graphs for GCNs. In
Ada-NETS, each face is transformed to a new structure space, obtaining robust
features by considering face features of the neighbour images. Then, an
adaptive neighbour discovery strategy is proposed to determine a proper number
of edges connecting to each face image. It significantly reduces the noise
edges while maintaining the good ones to build a graph with clean yet rich
edges for GCNs to cluster faces. Experiments on multiple public clustering
datasets show that Ada-NETS significantly outperforms current state-of-the-art
methods, proving its superiority and generalization. Code is available at
https://github.com/damo-cv/Ada-NETS
- …