Search CORE

609 research outputs found

The Construction of a Dictionary for a Two-layer Chinese Morphological Analyzer

Author: Cheng Yuchang
Goh Chooi-Ling
Lu Jia
松本裕治
浅原正幸
Publication venue: 'Tsinghua University Press'
Publication date: 01/10/2006
Field of study

PACLIC 20 / Wuhan, China / 1-3 November, 200

Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM

Author: Chan William
Hori Takaaki
Watanabe Shinji
Zhang Yu
Publication venue
Publication date: 08/06/2017
Field of study

We present a state-of-the-art end-to-end Automatic Speech Recognition (ASR) model. We learn to listen and write characters with a joint Connectionist Temporal Classification (CTC) and attention-based encoder-decoder network. The encoder is a deep Convolutional Neural Network (CNN) based on the VGG network. The CTC network sits on top of the encoder and is jointly trained with the attention-based decoder. During the beam search process, we combine the CTC predictions, the attention-based decoder predictions and a separately trained LSTM language model. We achieve a 5-10\% error reduction compared to prior systems on spontaneous Japanese and Chinese speech, and our end-to-end model beats out traditional hybrid ASR systems.Comment: Accepted for INTERSPEECH 201

arXiv.org e-Print Archive

Prediction of NOx Emissions from a Biomass Fired Combustion Process Based on Flame Radical Imaging and Deep Learning Techniques

Author: Li Nan
Li Xinli
Lu Gang
Yan Yong
Publication venue: 'Informa UK Limited'
Publication date: 23/12/2015
Field of study

This article presents a methodology for predicting NOx emissions from a biomass combustion process through flame radical imaging and deep learning (DL). The dataset was established experimentally from flame radical images captured on a biomass-gas fired test rig. Morphological component analysis is undertaken to improve the quality of the dataset, and the region-of-interest extraction is introduced to extract the flame radical part and rescale the image size. The developed DL-based prediction model contains three successive stages for implementing the feature extraction, feature fusion, and emission prediction. The fine-tuning based on the prediction is introduced to adjust the process of the feature fusion. The effects of the feature fusion and fine-tuning are discussed in detail. A comparison between various image- and machine-learning-based prediction models show that the proposed DL prediction model outperforms other models in terms of root mean square error criteria. The predicted NOx emissions are in good agreement with the measurement results

Statistical Parsing by Machine Learning from a Classical Arabic Treebank

Author: Dukes Kais
Publication venue: University of Leeds
Publication date: 01/09/2013
Field of study

Research into statistical parsing for English has enjoyed over a decade of successful results. However, adapting these models to other languages has met with difficulties. Previous comparative work has shown that Modern Arabic is one of the most difficult languages to parse due to rich morphology and free word order. Classical Arabic is the ancient form of Arabic, and is understudied in computational linguistics, relative to its worldwide reach as the language of the Quran. The thesis is based on seven publications that make significant contributions to knowledge relating to annotating and parsing Classical Arabic. Classical Arabic has been studied in depth by grammarians for over a thousand years using a traditional grammar known as i’rāb (إعغاة ). Using this grammar to develop a representation for parsing is challenging, as it describes syntax using a hybrid of phrase-structure and dependency relations. This work aims to advance the state-of-the-art for hybrid parsing by introducing a formal representation for annotation and a resource for machine learning. The main contributions are the first treebank for Classical Arabic and the first statistical dependency-based parser in any language for ellipsis, dropped pronouns and hybrid representations. A central argument of this thesis is that using a hybrid representation closely aligned to traditional grammar leads to improved parsing for Arabic. To test this hypothesis, two approaches are compared. As a reference, a pure dependency parser is adapted using graph transformations, resulting in an 87.47% F1-score. This is compared to an integrated parsing model with an F1-score of 89.03%, demonstrating that joint dependency-constituency parsing is better suited to Classical Arabic. The Quran was chosen for annotation as a large body of work exists providing detailed syntactic analysis. Volunteer crowdsourcing is used for annotation in combination with expert supervision. A practical result of the annotation effort is the corpus website: http://corpus.quran.com, an educational resource with over two million users per year