Search CORE

8 research outputs found

Context-Dependent Translation Selection Using Convolutional Neural Network

Author: Baotian Hu
Hang Li
Zhaopeng Tu
Zhengdong Lu
† Qingcai Chen
Publication venue
Publication date: 06/03/2020
Field of study

Abstract We propose a novel method for translation selection in statistical machine translation, in which a convolutional neural network is employed to judge the similarity between a phrase pair in two languages. The specifically designed convolutional architecture encodes not only the semantic similarity of the translation pair, but also the context containing the phrase in the source language. Therefore, our approach is able to capture context-dependent semantic similarities of translation pairs. We adopt a curriculum learning strategy to train the model: we classify the training examples into easy, medium, and difficult categories, and gradually build the ability of representing phrases and sentencelevel contexts by using training examples from easy to difficult. Experimental results show that our approach significantly outperforms the baseline system by up to 1.4 BLEU points

CiteSeerX

Memory-enhanced Decoder for Neural Machine Translation

Author: Li Hang
Liu Qun
Lu Zhengdong
Wang Mingxuan
Publication venue
Publication date: 01/01/2016
Field of study

We propose to enhance the RNN decoder in a neural machine translator (NMT) with external memory, as a natural but powerful extension to the state in the decoding RNN. This memory-enhanced RNN decoder is called \textsc{MemDec}. At each time during decoding, \textsc{MemDec} will read from this memory and write to this memory once, both with content-based addressing. Unlike the unbounded memory in previous work\cite{RNNsearch} to store the representation of source sentence, the memory in \textsc{MemDec} is a matrix with pre-determined size designed to better capture the information important for the decoding process at each time step. Our empirical study on Chinese-English translation shows that it can improve by

4.8

BLEU upon Groundhog and

5.3

BLEU upon on Moses, yielding the best performance achieved with the same training set.Comment: 11 page

arXiv.org e-Print Archive

Crossref

Optimisation Method for Training Deep Neural Networks in Classification of Non- functional Requirements

Author: Sabir M.
Sabir M.
Publication venue: London South Bank University
Publication date: 01/01/2022
Field of study

Non-functional requirements (NFRs) are regarded critical to a software system's success. The majority of NFR detection and classification solutions have relied on supervised machine learning models. It is hindered by the lack of labelled data for training and necessitate a significant amount of time spent on feature engineering. In this work we explore emerging deep learning techniques to reduce the burden of feature engineering. The goal of this study is to develop an autonomous system that can classify NFRs into multiple classes based on a labelled corpus. In the first section of the thesis, we standardise the NFRs ontology and annotations to produce a corpus based on five attributes: usability, reliability, efficiency, maintainability, and portability. In the second section, the design and implementation of four neural networks, including the artificial neural network, convolutional neural network, long short-term memory, and gated recurrent unit are examined to classify NFRs. These models, necessitate a large corpus. To overcome this limitation, we proposed a new paradigm for data augmentation. This method uses a sort and concatenates strategy to combine two phrases from the same class, resulting in a two-fold increase in data size while keeping the domain vocabulary intact. We compared our method to a baseline (no augmentation) and an existing approach Easy data augmentation (EDA) with pre-trained word embeddings. All training has been performed under two modifications to the data; augmentation on the entire data before train/validation split vs augmentation on train set only. Our findings show that as compared to EDA and baseline, NFRs classification model improved greatly, and CNN outperformed when trained using our suggested technique in the first setting. However, we saw a slight boost in the second experimental setup with just train set augmentation. As a result, we can determine that augmentation of the validation is required in order to achieve acceptable results with our proposed approach. We hope that our ideas will inspire new data augmentation techniques, whether they are generic or task specific. Furthermore, it would also be useful to implement this strategy in other languages

LSBU Research Open

Computational intelligent impact force modeling and monitoring in HISLO conditions for maximizing surface mining efficiency, safety, and health

Author: Ali Danish
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2021
Field of study

Shovel-truck systems are the most widely employed excavation and material handling systems for surface mining operations. During this process, a high-impact shovel loading operation (HISLO) produces large forces that cause extreme whole body vibrations (WBV) that can severely affect the safety and health of haul truck operators. Previously developed solutions have failed to produce satisfactory results as the vibrations at the truck operator seat still exceed the “Extremely Uncomfortable Limits”. This study was a novel effort in developing deep learning-based solution to the HISLO problem. This research study developed a rigorous mathematical model and a 3D virtual simulation model to capture the dynamic impact force for a multi-pass shovel loading operation. The research further involved the application of artificial intelligence and machine learning for implementing the impact force detection in real time. Experimental results showed the impact force magnitudes of 571 kN and 422 kN, for the first and second shovel pass, respectively, through an accurate representation of HISLO with continuous flow modelling using FEA-DEM coupled methodology. The novel ‘DeepImpact’ model, showed an exceptional performance, giving an R2, RMSE, and MAE values of 0.9948, 10.750, and 6.33, respectively, during the model validation. This research was a pioneering effort for advancing knowledge and frontiers in addressing the WBV challenges in deploying heavy mining machinery in safe and healthy large surface mining environments. The smart and intelligent real-time monitoring system from this study, along with process optimization, minimizes the impact force on truck surface, which in turn reduces the level of vibration on the operator, thus leading to a safer and healthier working mining environments --Abstract, page iii

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine