360 research outputs found
Robot eye-hand coordination learning by watching human demonstrations: a task function approximation approach
We present a robot eye-hand coordination learning method that can directly
learn visual task specification by watching human demonstrations. Task
specification is represented as a task function, which is learned using inverse
reinforcement learning(IRL) by inferring differential rewards between state
changes. The learned task function is then used as continuous feedbacks in an
uncalibrated visual servoing(UVS) controller designed for the execution phase.
Our proposed method can directly learn from raw videos, which removes the need
for hand-engineered task specification. It can also provide task
interpretability by directly approximating the task function. Besides,
benefiting from the use of a traditional UVS controller, our training process
is efficient and the learned policy is independent from a particular robot
platform. Various experiments were designed to show that, for a certain DOF
task, our method can adapt to task/environment variances in target positions,
backgrounds, illuminations, and occlusions without prior retraining.Comment: Accepted in ICRA 201
Segmentation-by-Detection: A Cascade Network for Volumetric Medical Image Segmentation
We propose an attention mechanism for 3D medical image segmentation. The
method, named segmentation-by-detection, is a cascade of a detection module
followed by a segmentation module. The detection module enables a region of
interest to come to attention and produces a set of object region candidates
which are further used as an attention model. Rather than dealing with the
entire volume, the segmentation module distills the information from the
potential region. This scheme is an efficient solution for volumetric data as
it reduces the influence of the surrounding noise which is especially important
for medical data with low signal-to-noise ratio. Experimental results on 3D
ultrasound data of the femoral head shows superiority of the proposed method
when compared with a standard fully convolutional network like the U-Net
On recursive temporal difference and eligibility traces
This work studies a new reinforcement learning method in the framework of Recursive Least-Squares Temporal Difference (RLS-TD). Differently from the standard mechanism of eligibility traces, leading to RLS-TD(λ), in this work we show that the forgetting factor commonly used in gradient-based estimation has a similar role to the mechanism of eligibility traces. We adopt an instrumental variable perspective to illustrate this point and we propose a new algorithm, namely - RLS-TD with forgetting factor (RLS-TD-f). We test the proposed algorithm in a Policy Iteration setting, i.e. when the performance of an initially stabilizing controller must be improved. We take the cart-pole benchmark as experimental platform: extensive experiments show that the proposed RLS-TD algorithm exhibits larger performance improvements in the largest portion of the state space
End-to-end detection-segmentation network with ROI convolution
We propose an end-to-end neural network that improves the segmentation
accuracy of fully convolutional networks by incorporating a localization unit.
This network performs object localization first, which is then used as a cue to
guide the training of the segmentation network. We test the proposed method on
a segmentation task of small objects on a clinical dataset of ultrasound
images. We show that by jointly learning for detection and segmentation, the
proposed network is able to improve the segmentation accuracy compared to only
learning for segmentation. Code is publicly available at
https://github.com/vincentzhang/roi-fcn.Comment: ISBI 201
Gradient estimates for porous medium and fast diffusion equations by martingale method
International audienceIn this paper, we establish several local and global gradient estimates for the positive solution of Porous Medium Equations (PMEs) and Fast Diffusion Equations (FDEs). Our proof is probabilistic and uses martingale techniques
- …