2 research outputs found
Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism
Multi-task learning (MTL) is a paradigm that simultaneously learns multiple
tasks by sharing information at different levels, enhancing the performance of
each individual task. While previous research has primarily focused on
feature-level or parameter-level task relatedness, and proposed various model
architectures and learning algorithms to improve learning performance, we aim
to explore output-level task relatedness. This approach introduces a posteriori
information into the model, considering that different tasks may produce
correlated outputs with mutual influences. We achieve this by incorporating a
feedback mechanism into MTL models, where the output of one task serves as a
hidden feature for another task, thereby transforming a static MTL model into a
dynamic one. To ensure the training process converges, we introduce a
convergence loss that measures the trend of a task's outputs during each
iteration. Additionally, we propose a Gumbel gating mechanism to determine the
optimal projection of feedback signals. We validate the effectiveness of our
method and evaluate its performance through experiments conducted on several
baseline models in spoken language understanding.Comment: submitted to CDC202
Estimation of 3D human hand poses with structured pose prior
Here, the authors present multistage estimation model embedding with structured pose prior (SPP), a novel coarseâtoâfine framework for realâtime 3D hand estimation from single depth image. Authorsâ main contributions can be summarised as follows: (i) The authors proposed SPP to enforce constraints of canonical hand pose instead of original hand pose. (ii) The authors are the first to adopt underâcomplete stacked denoising autoâencoder (SDA) to construct pose prior by mapping canonical hand pose to latent representation. In the case of enforcing constraints of canonical hand pose, the authors empirically validate that underâcomplete SDA outperforms overâcomplete SDA in improving the hand estimation accuracy. (iii) The authors propose candidate keypoints patches (CKP) as intermediate data to conduct further hand pose refinement. Experimental evaluation on two publically available datasets shows that authorsâ model is competitive both in accuracy and computation time. Especially, authorsâ method placed first in the location of palm keyâpoint on both two datasets, and the high accuracy of hand palm keyâpoint plays an important role in many applications, such as that manipulator can grasp objects to specific coordinates with the guiding of human hand palm