Search CORE

13,605 research outputs found

A Principled Approach for Learning Task Similarity in Multitask Learning

Author: Abbasi Mahdieh
Gagné Christian
Robitaille Louis-Émile
Shui Changjian
Wang Boyu
Publication venue: 'International Joint Conferences on Artificial Intelligence'
Publication date: 31/05/2019
Field of study

Multitask learning aims at solving a set of related tasks simultaneously, by exploiting the shared knowledge for improving the performance on individual tasks. Hence, an important aspect of multitask learning is to understand the similarities within a set of tasks. Previous works have incorporated this similarity information explicitly (e.g., weighted loss for each task) or implicitly (e.g., adversarial loss for feature adaptation), for achieving good empirical performances. However, the theoretical motivations for adding task similarity knowledge are often missing or incomplete. In this paper, we give a different perspective from a theoretical point of view to understand this practice. We first provide an upper bound on the generalization error of multitask learning, showing the benefit of explicit and implicit task similarity knowledge. We systematically derive the bounds based on two distinct task similarity metrics: H divergence and Wasserstein distance. From these theoretical results, we revisit the Adversarial Multi-task Neural Network, proposing a new training algorithm to learn the task relation coefficients and neural network parameters iteratively. We assess our new algorithm empirically on several benchmarks, showing not only that we find interesting and robust task relations, but that the proposed approach outperforms the baselines, reaffirming the benefits of theoretical insight in algorithm design

arXiv.org e-Print Archive

Crossref

A comparative review of dynamic neural networks and hidden Markov model methods for mobile on-device speech recognition

Author: B-H Juang
E Zarrouk
Kofi Appiah
LR Rabiner
M Benzeghiba
Mohammed Kyari Mustafa
P Vaidyanathan
RC Rose
T Kamm
Tony Allen
X Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/06/2017
Field of study

The adoption of high-accuracy speech recognition algorithms without an effective evaluation of their impact on the target computational resource is impractical for mobile and embedded systems. In this paper, techniques are adopted to minimise the required computational resource for an effective mobile-based speech recognition system. A Dynamic Multi-Layer Perceptron speech recognition technique, capable of running in real time on a state-of-the-art mobile device, has been introduced. Even though a conventional hidden Markov model when applied to the same dataset slightly outperformed our approach, its processing time is much higher. The Dynamic Multi-layer Perceptron presented here has an accuracy level of 96.94% and runs significantly faster than similar techniques

Crossref

Nottingham Trent Institutional Repository (IRep)

Sheffield Hallam University Research Archive