Search CORE

824,999 research outputs found

Syllable-aware Neural Language Models: A Failure to Beat Character-aware Ones

Author: Assylbekov Zhenisbek
Myrzakhmetov Bagdat
Takhanov Rustem
Washington Jonathan N.
Publication venue
Publication date: 01/01/2017
Field of study

Syllabification does not seem to improve word-level RNN language modeling quality when compared to character-based segmentation. However, our best syllable-aware language model, achieving performance comparable to the competitive character-aware model, has 18%-33% fewer parameters and is trained 1.2-2.2 times faster.Comment: EMNLP 201

arXiv.org e-Print Archive

Works

Character-level Intra Attention Network for Natural Language Inference

Author: Costa-jussà Marta R.
Fonollosa José A. R.
Yang Han
Publication venue
Publication date: 01/01/2017
Field of study

Natural language inference (NLI) is a central problem in language understanding. End-to-end artificial neural networks have reached state-of-the-art performance in NLI field recently. In this paper, we propose Character-level Intra Attention Network (CIAN) for the NLI task. In our model, we use the character-level convolutional network to replace the standard word embedding layer, and we use the intra attention to capture the intra-sentence semantics. The proposed CIAN model provides improved results based on a newly published MNLI corpus.Comment: EMNLP Workshop RepEval 2017: The Second Workshop on Evaluating Vector Space Representations for NL

arXiv.org e-Print Archive

Crossref

UPCommons. Portal del coneixement obert de la UPC

Context-Aware Prediction of Derivational Word-forms

Author: Baldwin Timothy
Cohn Trevor
Cotterell Ryan
Vylomova Ekaterina
Publication venue
Publication date: 01/01/2017
Field of study

Derivational morphology is a fundamental and complex characteristic of language. In this paper we propose the new task of predicting the derivational form of a given base-form lemma that is appropriate for a given context. We present an encoder--decoder style neural network to produce a derived form character-by-character, based on its corresponding character-level representation of the base form and the context. We demonstrate that our model is able to generate valid context-sensitive derivations from known base forms, but is less accurate under a lexicon agnostic setting

arXiv.org e-Print Archive

Crossref

Character-Level Incremental Speech Recognition with Recurrent Neural Networks

Author: Hwang Kyuyeon
Sung Wonyong
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/01/2016
Field of study

In real-time speech recognition applications, the latency is an important issue. We have developed a character-level incremental speech recognition (ISR) system that responds quickly even during the speech, where the hypotheses are gradually improved while the speaking proceeds. The algorithm employs a speech-to-character unidirectional recurrent neural network (RNN), which is end-to-end trained with connectionist temporal classification (CTC), and an RNN-based character-level language model (LM). The output values of the CTC-trained RNN are character-level probabilities, which are processed by beam search decoding. The RNN LM augments the decoding by providing long-term dependency information. We propose tree-based online beam search with additional depth-pruning, which enables the system to process infinitely long input speech with low latency. This system not only responds quickly on speech but also can dictate out-of-vocabulary (OOV) words according to pronunciation. The proposed model achieves the word error rate (WER) of 8.90% on the Wall Street Journal (WSJ) Nov'92 20K evaluation set when trained on the WSJ SI-284 training set.Comment: To appear in ICASSP 201

arXiv.org e-Print Archive

Crossref