Search CORE

202 research outputs found

A Comparative Study on Regularization Strategies for Embedding-based Neural Networks

Author: Chen Yunchuan
Jin Zhi
Li Ge
Lu Yangyang
Mou Lili
Peng Hao
Publication venue
Publication date: 01/01/2015
Field of study

This paper aims to compare different regularization strategies to address a common phenomenon, severe overfitting, in embedding-based neural networks for NLP. We chose two widely studied neural models and tasks as our testbed. We tried several frequently applied or newly proposed regularization strategies, including penalizing weights (embeddings excluded), penalizing embeddings, re-embedding words, and dropout. We also emphasized on incremental hyperparameter tuning, and combining different regularizations. The results provide a picture on tuning hyperparameters for neural NLP models.Comment: EMNLP '1

arXiv.org e-Print Archive

Crossref

Radical-Enhanced Chinese Character Embedding

Author: Ji Zhenzhou
Lin Lei
Sun Yaming
Tang Duyu
Wang Xiaolong
Yang Nan
Publication venue
Publication date: 01/01/2014
Field of study

We present a method to leverage radical for learning Chinese character embedding. Radical is a semantic and phonetic component of Chinese character. It plays an important role as characters with the same radical usually have similar semantic meaning and grammatical usage. However, existing Chinese processing algorithms typically regard word or character as the basic unit but ignore the crucial radical information. In this paper, we fill this gap by leveraging radical for learning continuous representation of Chinese character. We develop a dedicated neural architecture to effectively learn character embedding and apply it on Chinese character similarity judgement and Chinese word segmentation. Experiment results show that our radical-enhanced method outperforms existing embedding learning algorithms on both tasks.Comment: 8 pages, 4 figure

arXiv.org e-Print Archive

Crossref