Search CORE

356 research outputs found

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

Author: Manning Christopher D.
Socher Richard
Tai Kai Sheng
Publication venue
Publication date: 01/01/2015
Field of study

Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of sequence modeling tasks. The only underlying LSTM structure that has been explored so far is a linear chain. However, natural language exhibits syntactic properties that would naturally combine words to phrases. We introduce the Tree-LSTM, a generalization of LSTMs to tree-structured network topologies. Tree-LSTMs outperform all existing systems and strong LSTM baselines on two tasks: predicting the semantic relatedness of two sentences (SemEval 2014, Task 1) and sentiment classification (Stanford Sentiment Treebank).Comment: Accepted for publication at ACL 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Generating High-Quality Surface Realizations Using Data Augmentation and Factored Sequence Models

Author: Elder Henry
Hokamp Chris
Publication venue
Publication date: 01/01/2018
Field of study

This work presents a new state of the art in reconstruction of surface realizations from obfuscated text. We identify the lack of sufficient training data as the major obstacle to training high-performing models, and solve this issue by generating large amounts of synthetic training data. We also propose preprocessing techniques which make the structure contained in the input features more accessible to sequence models. Our models were ranked first on all evaluation metrics in the English portion of the 2018 Surface Realization shared task

arXiv.org e-Print Archive

Crossref