Search CORE

6 research outputs found

Vietnamese transition-based dependency parsing with supertag features

Author: Nguyen Ngan Luu-Thuy
Van Nguyen Kiet
Publication venue
Publication date: 09/11/2019
Field of study

In recent years, dependency parsing is a fascinating research topic and has a lot of applications in natural language processing. In this paper, we present an effective approach to improve dependency parsing by utilizing supertag features. We performed experiments with the transition-based dependency parsing approach because it can take advantage of rich features. Empirical evaluation on Vietnamese Dependency Treebank showed that, we achieved an improvement of 18.92% in labeled attachment score with gold supertags and an improvement of 3.57% with automatic supertags.Comment: 2016 Eighth International Conference on Knowledge and Systems Engineering (KSE

arXiv.org e-Print Archive

Emotion Recognition for Vietnamese Social Media Text

Author: Ho Vong Anh
Nguyen Danh Hoang
Nguyen Duc-Vu
Nguyen Duong Huynh-Cong
Nguyen Ngan Luu-Thuy
Pham Linh Thi-Van
Van Nguyen Kiet
Publication venue
Publication date: 26/01/2020
Field of study

Emotion recognition or emotion prediction is a higher approach or a special case of sentiment analysis. In this task, the result is not produced in terms of either polarity: positive or negative or in the form of rating (from 1 to 5) but of a more detailed level of analysis in which the results are depicted in more expressions like sadness, enjoyment, anger, disgust, fear, and surprise. Emotion recognition plays a critical role in measuring the brand value of a product by recognizing specific emotions of customers' comments. In this study, we have achieved two targets. First and foremost, we built a standard Vietnamese Social Media Emotion Corpus (UIT-VSMEC) with exactly 6,927 emotion-annotated sentences, contributing to emotion recognition research in Vietnamese which is a low-resource language in natural language processing (NLP). Secondly, we assessed and measured machine learning and deep neural network models on our UIT-VSMEC corpus. As a result, the CNN model achieved the highest performance with the weighted F1-score of 59.74%. Our corpus is available at our research website.Comment: PACLING 201

arXiv.org e-Print Archive

A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing

Author: Nguyen Dat Quoc
Publication venue
Publication date: 11/11/2019
Field of study

We propose the first multi-task learning model for joint Vietnamese word segmentation, part-of-speech (POS) tagging and dependency parsing. In particular, our model extends the BIST graph-based dependency parser (Kiperwasser and Goldberg, 2016) with BiLSTM-CRF-based neural layers (Huang et al., 2015) for word segmentation and POS tagging. On Vietnamese benchmark datasets, experimental results show that our joint model obtains state-of-the-art or competitive performances.Comment: In Proceedings of the 17th Annual Workshop of the Australasian Language Technology Association (ALTA 2019

arXiv.org e-Print Archive

Syntax-aware Neural Semantic Role Labeling with Supertags

Author: Frank Robert
Friedman Dan
Kasai Jungo
Radev Dragomir
Rambow Owen
Publication venue
Publication date: 03/04/2019
Field of study

We introduce a new syntax-aware model for dependency-based semantic role labeling that outperforms syntax-agnostic models for English and Spanish. We use a BiLSTM to tag the text with supertags extracted from dependency parses, and we feed these supertags, along with words and parts of speech, into a deep highway BiLSTM for semantic role labeling. Our model combines the strengths of earlier models that performed SRL on the basis of a full dependency parse with more recent models that use no syntactic information at all. Our local and non-ensemble model achieves state-of-the-art performance on the CoNLL 09 English and Spanish datasets. SRL models benefit from syntactic information, and we show that supertagging is a simple, powerful, and robust way to incorporate syntax into a neural SRL system.Comment: NAACL 2019, Added Spanish ELMo result

arXiv.org e-Print Archive

LSTM Easy-first Dependency Parsing with Pre-trained Word Embeddings and Character-level Word Embeddings in Vietnamese

Author: Nguyen Binh Duc
Nguyen Ngan Luu-Thuy
Van Nguyen Kiet
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 30/10/2019
Field of study

In Vietnamese dependency parsing, several methods have been proposed. Dependency parser which uses deep neural network model has been reported that achieved state-of-the-art results. In this paper, we proposed a new method which applies LSTM easy-first dependency parsing with pre-trained word embeddings and character-level word embeddings. Our method achieves an accuracy of 80.91% of unlabeled attachment score and 72.98% of labeled attachment score on the Vietnamese Dependency Treebank (VnDT)

arXiv.org e-Print Archive

Deep Learning versus Traditional Classifiers on Vietnamese Students' Feedback Corpus

Author: Hong Tham T. T.
Nguyen Ngan Luu-Thuy
Nguyen Phu X. V.
Van Nguyen Kiet
Publication venue
Publication date: 17/11/2019
Field of study

Student's feedback is an important source of collecting students' opinions to improve the quality of training activities. Implementing sentiment analysis into student feedback data, we can determine sentiments polarities which express all problems in the institution since changes necessary will be applied to improve the quality of teaching and learning. This study focused on machine learning and natural language processing techniques (NaiveBayes, Maximum Entropy, Long Short-Term Memory, Bi-Directional Long Short-Term Memory) on the VietnameseStudents' Feedback Corpus collected from a university. The final results were compared and evaluated to find the most effective model based on different evaluation criteria. The experimental results show that the Bi-Directional LongShort-Term Memory algorithm outperformed than three other algorithms in terms of the F1-score measurement with 92.0% on the sentiment classification task and 89.6% on the topic classification task. In addition, we developed a sentiment analysis application analyzing student feedback. The application will help the institution to recognize students' opinions about a problem and identify shortcomings that still exist. With the use of this application, the institution can propose an appropriate method to improve the quality of training activities in the future.Comment: In Proceeding of the 5th NAFOSTED Conference on Information and Computer Science (NICS 2018

arXiv.org e-Print Archive