5,277 research outputs found
Weakly Supervised Cross-Lingual Named Entity Recognition via Effective Annotation and Representation Projection
The state-of-the-art named entity recognition (NER) systems are supervised
machine learning models that require large amounts of manually annotated data
to achieve high accuracy. However, annotating NER data by human is expensive
and time-consuming, and can be quite difficult for a new language. In this
paper, we present two weakly supervised approaches for cross-lingual NER with
no human annotation in a target language. The first approach is to create
automatically labeled NER data for a target language via annotation projection
on comparable corpora, where we develop a heuristic scheme that effectively
selects good-quality projection-labeled data from noisy data. The second
approach is to project distributed representations of words (word embeddings)
from a target language to a source language, so that the source-language NER
system can be applied to the target language without re-training. We also
design two co-decoding schemes that effectively combine the outputs of the two
projection-based approaches. We evaluate the performance of the proposed
approaches on both in-house and open NER data for several target languages. The
results show that the combined systems outperform three other weakly supervised
approaches on the CoNLL data.Comment: 11 pages, The 55th Annual Meeting of the Association for
Computational Linguistics (ACL), 201
Learning Character-level Compositionality with Visual Features
Previous work has modeled the compositionality of words by creating
character-level models of meaning, reducing problems of sparsity for rare
words. However, in many writing systems compositionality has an effect even on
the character-level: the meaning of a character is derived by the sum of its
parts. In this paper, we model this effect by creating embeddings for
characters based on their visual characteristics, creating an image for the
character and running it through a convolutional neural network to produce a
visual character embedding. Experiments on a text classification task
demonstrate that such model allows for better processing of instances with rare
characters in languages such as Chinese, Japanese, and Korean. Additionally,
qualitative analyses demonstrate that our proposed model learns to focus on the
parts of characters that carry semantic content, resulting in embeddings that
are coherent in visual space.Comment: Accepted to ACL 201
A Syllable-based Technique for Word Embeddings of Korean Words
Word embedding has become a fundamental component to many NLP tasks such as
named entity recognition and machine translation. However, popular models that
learn such embeddings are unaware of the morphology of words, so it is not
directly applicable to highly agglutinative languages such as Korean. We
propose a syllable-based learning model for Korean using a convolutional neural
network, in which word representation is composed of trained syllable vectors.
Our model successfully produces morphologically meaningful representation of
Korean words compared to the original Skip-gram embeddings. The results also
show that it is quite robust to the Out-of-Vocabulary problem.Comment: 5 pages, 3 figures, 1 table. Accepted for EMNLP 2017 Workshop - The
1st Workshop on Subword and Character level models in NLP (SCLeM
A Sub-Character Architecture for Korean Language Processing
We introduce a novel sub-character architecture that exploits a unique
compositional structure of the Korean language. Our method decomposes each
character into a small set of primitive phonetic units called jamo letters from
which character- and word-level representations are induced. The jamo letters
divulge syntactic and semantic information that is difficult to access with
conventional character-level units. They greatly alleviate the data sparsity
problem, reducing the observation space to 1.6% of the original while
increasing accuracy in our experiments. We apply our architecture to dependency
parsing and achieve dramatic improvement over strong lexical baselines.Comment: EMNLP 201
๊ฐ์ฒด๋ช ์ธ์์ ์ํ ์กฐ์ ํ๋ ํ์๋ฒ์ ๊ณ ๋ คํ๋ ๋ด๋ด ๋ชจ๋ธ
ํ์๋
ผ๋ฌธ (์์ฌ)-- ์์ธ๋ํ๊ต ๋ํ์ : ๊ณต๊ณผ๋ํ ์ ๊ธฐยท์ ๋ณด๊ณตํ๋ถ, 2019. 2. ๊นํํ.๊ฐ์ฒด๋ช
์ธ์ (NER) ์ ์์ฐ์ธ์ด์ฒ๋ฆฌ ์๋ฌด๋ค ์ค ์ค์ํ ์๋ฌด์
๋๋ค. ์ด ๋ฌธ์ ์ ๋ํด ๊ธฐ์กด ๊ธฐ์ ์ ์๋ฐฉํฅ ์ํ์ ๊ฒฝ๋ง (BiRNN) ๊ณผ ์กฐ๊ฑด๋ถ ๋ฌด์ ์์ฅ (CRF) ๋ฅผ ํ์ฉํ๋ ๋ฐฉ๋ฒ์
๋๋ค. ๋ณธ ๋
ผ๋ฌธ์ ๊ธฐ๊ณ๋ฒ์ญ ๋ถ์ผ์์ ๋์จ attention์ด๋ ์ปจ์
ํธ์๊ฒ์ ์๊ฐ์ ๋ฐ์ผ๋ฉฐ ๋ชจ๋ธ์ ์ด๋ฃจ์์ต๋๋ค. ์ด ๋ชจ๋ธ์ ํธ๋ ์ด๋ ํ ๋ ๋์ ์ผ๋ก ํ ๋จ์ด์ character-level ํ์๋ฒ๊ณผ ๋จ์ด ์๋ฒ ๋ฉ์ ์จ์ดํธ๋ค์ ๊ฒฐ์ ํ๋ฏ๋ก ๋ชจ๋ธ์ ํจ๊ณผ๋ฅผ ์ฆ๊ฐ์ํต๋๋ค. ๋ณธ ๋
ผ๋ฌธ์ ๋ค์ธ์ด ๋ฐ์ดํฐ์
(์์ด, ์คํ์ธ์ด, ๋ค๋๋๋์ด) ์์ ์คํ์ ์งํํ๊ณ F1 ์ ์์ ๋น๊ต๋ฅผ ํตํด์ ๋ค๋ฅธ ์ต์ ์ฐ๊ตฌ๋ณด๋ค ์ ํ๋๊ฐ ๋์์ก์ต๋๋ค. ๋ํ, ๋
ผ๋ฌธ์ ๋ค์ํ ๋ชจ๋ธ ๋ฐฐ์น ๋ฐฉ์์ ๋ถ์ํด์ hidden layer์์ ๋จ์ด ์๋ฒ ๋ฉ์ด ์ด ๋ชจ๋ธ์๊ฒ ์ฃผ๋ ์ํฅ, ๋ชจ๋ธ์ ์คํ ์๊ฐ๊ณผ ํจ์จ๋ ํ ๋ก ํ์ต๋๋ค.Sequence tagging is an important task in Natural Language Processing (NLP), in which the Named Entity Recognition (NER) is the key issue. So far the most widely adopted model for NER in NLP is that of combining the neural network of bidirectional long short-term memory (BiLSTM) and the statistical sequence prediction method of Conditional Random Field (CRF). In this work, we improve the prediction accuracy of the BiLSTM model by supporting an aligned character and word-level representation mechanism. We have performed experiments on multilingual (English, Spanish and Dutch) datasets and confirmed that our proposed model out-performed the existing state-of-the-art models.1 Introduction
1.1 Study Background
1.2 Purpose of Research
2 The Proposed Model
2.1 Character-level BiLSTM
2.2 Attention Mechanism
2. 2.1 The concept of attention
2.2.2 Word embedding
2.2.3 Our application
2.3 Word-level BiLSTM-CRF
2.3.1 LSTM with Conditional Random Field
2.3.2 Highway layer
3 Experiment
3.1 datasets
3.2 Training
3.3 Performance
3.3.1 Evaluation criterion
3.3.2 NER results
3.3.3 Other results
4 ConclusionMaste
- โฆ