Search CORE

88 research outputs found

Keystroke dynamics as signal for shallow syntactic parsing

Author: Plank Barbara
Publication venue
Publication date: 01/01/2016
Field of study

Keystroke dynamics have been extensively used in psycholinguistic and writing research to gain insights into cognitive processing. But do keystroke logs contain actual signal that can be used to learn better natural language processing models? We postulate that keystroke dynamics contain information about syntactic structure that can inform shallow syntactic parsing. To test this hypothesis, we explore labels derived from keystroke logs as auxiliary task in a multi-task bidirectional Long Short-Term Memory (bi-LSTM). Our results show promising results on two shallow syntactic parsing tasks, chunking and CCG supertagging. Our model is simple, has the advantage that data can come from distinct sources, and produces models that are significantly better than models trained on the text annotations alone.Comment: In COLING 201

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Improving natural language processing with human data:Eye tracking and other data sources reflecting cognitive text processing

Author: Barrett Maria
Publication venue: Det Humanistiske Fakultet, Københavns Universitet
Publication date: 01/10/2018
Field of study

Copenhagen University Research Information System

Sequence labelling and sequence classification with gaze:Novel uses of eye‐tracking data for Natural Language Processing

Author: Barrett Maria Jung
Hollenstein Nora
Publication venue: 'Wiley'
Publication date: 01/01/2020
Field of study

Crossref

Copenhagen University Research Information System

The IT University of Copenhagen's Repository

Identifying beneficial task relations for multi-task learning in deep neural networks

Author: Bingel Joachim
Søgaard Anders
Publication venue
Publication date: 01/01/2017
Field of study

Multi-task learning (MTL) in deep neural networks for NLP has recently received increasing interest due to some compelling benefits, including its potential to efficiently regularize models and to reduce the need for labeled data. While it has brought significant improvements in a number of NLP tasks, mixed results have been reported, and little is known about the conditions under which MTL leads to gains in NLP. This paper sheds light on the specific task relations that can lead to gains from MTL models over single-task setups.Comment: Accepted for publication at EACL 201

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System

Predicting Authorship and Author Traits from Keystroke Dynamics

Author: Plank Barbara
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

Crossref

The IT University of Copenhagen's Repository

Multitask Learning for Fine-Grained Twitter Sentiment Analysis

Author: Amini Massih-Reza
Balikas Georgios
Moura Simon
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 12/07/2017
Field of study

Traditional sentiment analysis approaches tackle problems like ternary (3-category) and fine-grained (5-category) classification by learning the tasks separately. We argue that such classification tasks are correlated and we propose a multitask approach based on a recurrent neural network that benefits by jointly learning them. Our study demonstrates the potential of multitask models on this type of problems and improves the state-of-the-art results in the fine-grained sentiment classification problem.Comment: International ACM SIGIR Conference on Research and Development in Information Retrieval 201

arXiv.org e-Print Archive

Crossref

UCL Discovery

Extraction of transforming sequences and sentence histories from writing process data : a first step towards linguistic modeling of writing

Author: Mahlow Cerstin
Tuggener Don
Ulasik Malgorzata Anna
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Online first, part of special issue "Methods for understanding writing process by analysis of writing timecourse" Erworben im Rahmen der Schweizer Nationallizenzen (http://www.nationallizenzen.ch)Producing written texts is a non-linear process: in contrast to speech, writers are free to change already written text at any place at any point in time. Linguistic considerations are likely to play an important role, but so far, no linguistic models of the writing process exist. We present an approach for the analysis of writing processes with a focus on linguistic structures based on the novel concepts of transforming sequences, text history, and sentence history. The processing of raw keystroke logging data and the application of natural language processing tools allows for the extraction and filtering of product and process data to be stored in a hierarchical data structure. This structure is used to re-create and visualize the genesis and history for a text and its individual sentences. Focusing on sentences as primary building blocks of written language and full texts, we aim to complement established writing process analyses and, ultimately, to interpret writing timecourse data with respect to linguistic structures. To enable researchers to explore this view, we provide a fully functional implementation of our approach as an open-source software tool and visualizations of the results. We report on a small scale exploratory study in German where we used our tool. The results indicate both the feasibility of the approach and that writers actually revise on a linguistic level. The latter confirms the need for modeling written text production from the perspective of linguistic structures beyond the word level

ZHAW digitalcollection

When is multitask learning effective? Semantic sequence prediction under varying data conditions

Author: Alonso Héctor Martínez
Plank Barbara
Publication venue
Publication date: 01/01/2017
Field of study

Multitask learning has been applied successfully to a range of tasks, mostly morphosyntactic. However, little is known on when MTL works and whether there are data characteristics that help to determine its success. In this paper we evaluate a range of semantic sequence labeling tasks in a MTL setup. We examine different auxiliary tasks, amongst which a novel setup, and correlate their impact to data-dependent conditions. Our results show that MTL is not always effective, significant improvements are obtained only for 1 out of 5 tasks. When successful, auxiliary tasks with compact and more uniform label distributions are preferable.Comment: In EACL 201

arXiv.org e-Print Archive

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

INRIA a CCSD electronic archive server

Hal-Diderot

Dissertations of the University of Groningen