6 research outputs found
Vietnamese transition-based dependency parsing with supertag features
In recent years, dependency parsing is a fascinating research topic and has a
lot of applications in natural language processing. In this paper, we present
an effective approach to improve dependency parsing by utilizing supertag
features. We performed experiments with the transition-based dependency parsing
approach because it can take advantage of rich features. Empirical evaluation
on Vietnamese Dependency Treebank showed that, we achieved an improvement of
18.92% in labeled attachment score with gold supertags and an improvement of
3.57% with automatic supertags.Comment: 2016 Eighth International Conference on Knowledge and Systems
Engineering (KSE
Emotion Recognition for Vietnamese Social Media Text
Emotion recognition or emotion prediction is a higher approach or a special
case of sentiment analysis. In this task, the result is not produced in terms
of either polarity: positive or negative or in the form of rating (from 1 to 5)
but of a more detailed level of analysis in which the results are depicted in
more expressions like sadness, enjoyment, anger, disgust, fear, and surprise.
Emotion recognition plays a critical role in measuring the brand value of a
product by recognizing specific emotions of customers' comments. In this study,
we have achieved two targets. First and foremost, we built a standard
Vietnamese Social Media Emotion Corpus (UIT-VSMEC) with exactly 6,927
emotion-annotated sentences, contributing to emotion recognition research in
Vietnamese which is a low-resource language in natural language processing
(NLP). Secondly, we assessed and measured machine learning and deep neural
network models on our UIT-VSMEC corpus. As a result, the CNN model achieved the
highest performance with the weighted F1-score of 59.74%. Our corpus is
available at our research website.Comment: PACLING 201
A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing
We propose the first multi-task learning model for joint Vietnamese word
segmentation, part-of-speech (POS) tagging and dependency parsing. In
particular, our model extends the BIST graph-based dependency parser
(Kiperwasser and Goldberg, 2016) with BiLSTM-CRF-based neural layers (Huang et
al., 2015) for word segmentation and POS tagging. On Vietnamese benchmark
datasets, experimental results show that our joint model obtains
state-of-the-art or competitive performances.Comment: In Proceedings of the 17th Annual Workshop of the Australasian
Language Technology Association (ALTA 2019
Syntax-aware Neural Semantic Role Labeling with Supertags
We introduce a new syntax-aware model for dependency-based semantic role
labeling that outperforms syntax-agnostic models for English and Spanish. We
use a BiLSTM to tag the text with supertags extracted from dependency parses,
and we feed these supertags, along with words and parts of speech, into a deep
highway BiLSTM for semantic role labeling. Our model combines the strengths of
earlier models that performed SRL on the basis of a full dependency parse with
more recent models that use no syntactic information at all. Our local and
non-ensemble model achieves state-of-the-art performance on the CoNLL 09
English and Spanish datasets. SRL models benefit from syntactic information,
and we show that supertagging is a simple, powerful, and robust way to
incorporate syntax into a neural SRL system.Comment: NAACL 2019, Added Spanish ELMo result
LSTM Easy-first Dependency Parsing with Pre-trained Word Embeddings and Character-level Word Embeddings in Vietnamese
In Vietnamese dependency parsing, several methods have been proposed.
Dependency parser which uses deep neural network model has been reported that
achieved state-of-the-art results. In this paper, we proposed a new method
which applies LSTM easy-first dependency parsing with pre-trained word
embeddings and character-level word embeddings. Our method achieves an accuracy
of 80.91% of unlabeled attachment score and 72.98% of labeled attachment score
on the Vietnamese Dependency Treebank (VnDT)
Deep Learning versus Traditional Classifiers on Vietnamese Students' Feedback Corpus
Student's feedback is an important source of collecting students' opinions to
improve the quality of training activities. Implementing sentiment analysis
into student feedback data, we can determine sentiments polarities which
express all problems in the institution since changes necessary will be applied
to improve the quality of teaching and learning. This study focused on machine
learning and natural language processing techniques (NaiveBayes, Maximum
Entropy, Long Short-Term Memory, Bi-Directional Long Short-Term Memory) on the
VietnameseStudents' Feedback Corpus collected from a university. The final
results were compared and evaluated to find the most effective model based on
different evaluation criteria. The experimental results show that the
Bi-Directional LongShort-Term Memory algorithm outperformed than three other
algorithms in terms of the F1-score measurement with 92.0% on the sentiment
classification task and 89.6% on the topic classification task. In addition, we
developed a sentiment analysis application analyzing student feedback. The
application will help the institution to recognize students' opinions about a
problem and identify shortcomings that still exist. With the use of this
application, the institution can propose an appropriate method to improve the
quality of training activities in the future.Comment: In Proceeding of the 5th NAFOSTED Conference on Information and
Computer Science (NICS 2018