Deep feature fusion model for sentence semantic matching

Li, F; Lu, W; Peng, X; Zhang, R; Zhang, X

Deep feature fusion model for sentence semantic matching

Authors: F Li
W Lu
X Peng
R Zhang
X Zhang
Publication date: 1 January 2019
Publisher: 'Computers, Materials and Continua (Tech Science Press)'
Doi

Abstract

© 2019 Tech Science Press. All rights reserved. Sentence semantic matching (SSM) is a fundamental research in solving natural language processing tasks such as question answering and machine translation. The latest SSM research benefits from deep learning techniques by incorporating attention mechanism to semantically match given sentences. However, how to fully capture the semantic context without losing significant features for sentence encoding is still a challenge. To address this challenge, we propose a deep feature fusion model and integrate it into the most popular deep learning architecture for sentence matching task. The integrated architecture mainly consists of embedding layer, deep feature fusion layer, matching layer and prediction layer. In addition, we also compare the commonly used loss function, and propose a novel hybrid loss function integrating MSE and cross entropy together, considering confidence interval and threshold setting to preserve the indistinguishable instances in training process. To evaluate our model performance, we experiment on two real world public data sets: LCQMC and Quora. The experiment results demonstrate that our model outperforms the most existing advanced deep learning models for sentence matching, benefited from our enhanced loss function and deep feature fusion model for capturing semantic context

Similar works

Full text

Available Versions

OPUS - University of Technology Sydney

oai:opus.lib.uts.edu.au:10453/...

Last time updated on 20/04/2021