Search CORE

53,218 research outputs found

Towards Structured Deep Neural Network for Automatic Speech Recognition

Author: Lee Hung-Yi
Lee Lin-shan
Liao Yi-Hsiu
Publication venue
Publication date: 03/06/2015
Field of study

In this paper we propose the Structured Deep Neural Network (Structured DNN) as a structured and deep learning algorithm, learning to find the best structured object (such as a label sequence) given a structured input (such as a vector sequence) by globally considering the mapping relationships between the structure rather than item by item. When automatic speech recognition is viewed as a special case of such a structured learning problem, where we have the acoustic vector sequence as the input and the phoneme label sequence as the output, it becomes possible to comprehensively learned utterance by utterance as a whole, rather than frame by frame. Structured Support Vector Machine (structured SVM) was proposed to perform ASR with structured learning previously, but limited by the linear nature of SVM. Here we propose structured DNN to use nonlinear transformations in multi-layers as a structured and deep learning algorithm. It was shown to beat structured SVM in preliminary experiments on TIMIT

arXiv.org e-Print Archive

Crossref

Exploring matter wave scattering by means of the phase diagram

Author: Lee Jeng Yi
Lee Ray-Kuang
Publication venue: 'IOP Publishing'
Publication date: 22/09/2017
Field of study

For matter wave scattering from passive quantum obstacles, we propose a phase diagram in terms of phase and modulus of scattering coefficients to explore all possible directional scattering patterns. In the phase diagram, we can not only have the physical bounds on scattering coefficients for all channels, but also indicate the competitions among absorption, extinction, and scattering cross sessions. With help of this phase diagram, we discuss different scenarios to steer scattering probability distribution, through the interference between

s

- and

p

-channels. In particular, we reveal the required conditions to implement a quantum scatterer, i.e., a quantum dot in semiconductor matrix, with a minimum (or zero) value in the scattering probability toward any direction. Our results provide a guideline in designing quantum scatterers with controlling and sensing matter waves.Comment: 6 pages, 3 figure

arXiv.org e-Print Archive

EDP Sciences OAI-PMH repository (1.2.0)

DARTS-ASR: Differentiable Architecture Search for Multilingual Speech Recognition and Adaptation

Author: Chen Yi-Chen
Hsu Jui-Yang
Lee Cheng-Kuang
Lee Hung-yi
Publication venue
Publication date: 25/07/2020
Field of study

In previous works, only parameter weights of ASR models are optimized under fixed-topology architecture. However, the design of successful model architecture has always relied on human experience and intuition. Besides, many hyperparameters related to model architecture need to be manually tuned. Therefore in this paper, we propose an ASR approach with efficient gradient-based architecture search, DARTS-ASR. In order to examine the generalizability of DARTS-ASR, we apply our approach not only on many languages to perform monolingual ASR, but also on a multilingual ASR setting. Following previous works, we conducted experiments on a multilingual dataset, IARPA BABEL. The experiment results show that our approach outperformed the baseline fixed-topology architecture by 10.2% and 10.0% relative reduction on character error rates under monolingual and multilingual ASR settings respectively. Furthermore, we perform some analysis on the searched architectures by DARTS-ASR.Comment: Accepted at INTERSPEECH 202

arXiv.org e-Print Archive

Crossref