1,244 research outputs found
A Chinese Named Entity Recognition System with Neural Networks
Named entity recognition (NER) is a typical sequential labeling problem that plays an important role in natural language processing (NLP) systems. In this paper, we discussed the details of applying a comprehensive model aggregating neural networks and conditional random field (CRF) on Chinese NER tasks, and how to discovery character level features when implement a NER system in word level. We compared the difference between Chinese and English when modeling the character embeddings. We developed a NER system based on our analysis, it works well on the ACE 2004 and SIGHAN bakeoff 2006 MSRA dataset, and doesn’t rely on any gazetteers or handcraft features. We obtained F1 score of 82.3% on MSRA 2006
A Survey on Semantic Processing Techniques
Semantic processing is a fundamental research domain in computational
linguistics. In the era of powerful pre-trained language models and large
language models, the advancement of research in this domain appears to be
decelerating. However, the study of semantics is multi-dimensional in
linguistics. The research depth and breadth of computational semantic
processing can be largely improved with new technologies. In this survey, we
analyzed five semantic processing tasks, e.g., word sense disambiguation,
anaphora resolution, named entity recognition, concept extraction, and
subjectivity detection. We study relevant theoretical research in these fields,
advanced methods, and downstream applications. We connect the surveyed tasks
with downstream applications because this may inspire future scholars to fuse
these low-level semantic processing tasks with high-level natural language
processing tasks. The review of theoretical research may also inspire new tasks
and technologies in the semantic processing domain. Finally, we compare the
different semantic processing techniques and summarize their technical trends,
application trends, and future directions.Comment: Published at Information Fusion, Volume 101, 2024, 101988, ISSN
1566-2535. The equal contribution mark is missed in the published version due
to the publication policies. Please contact Prof. Erik Cambria for detail
Incorporating Deep Syntactic and Semantic Knowledge for Chinese Sequence Labeling with GCN
Recently, it is quite common to integrate Chinese sequence labeling results
to enhance syntactic and semantic parsing. However, little attention has been
paid to the utility of hierarchy and structure information encoded in syntactic
and semantic features for Chinese sequence labeling tasks. In this paper, we
propose a novel framework to encode syntactic structure features and semantic
information for Chinese sequence labeling tasks with graph convolutional
networks (GCN). Experiments on five benchmark datasets, including Chinese word
segmentation and part-of-speech tagging, demonstrate that our model can
effectively improve the performance of Chinese labeling tasks.Comment: 10 pages,3 Figures, 6 Table
- …