Search CORE

52 research outputs found

Sentiment polarity shifters : creating lexical resources through manual annotation and bootstrapped machine learning

Author: Schulder Marc
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2019
Field of study

Alleviating pain is good and abandoning hope is bad. We instinctively understand how words like "alleviate" and "abandon" affect the polarity of a phrase, inverting or weakening it. When these words are content words, such as verbs, nouns and adjectives, we refer to them as polarity shifters. Shifters are a frequent occurrence in human language and an important part of successfully modeling negation in sentiment analysis; yet research on negation modeling has focussed almost exclusively on a small handful of closed class negation words, such as "not", "no" and "without. A major reason for this is that shifters are far more lexically diverse than negation words, but no resources exist to help identify them. We seek to remedy this lack of shifter resources. Our most central step towards this is the creation of a large lexicon of polarity shifters that covers verbs, nouns and adjectives. To reduce the prohibitive cost of such a large annotation task, we develop a bootstrapping approach that combines automatic classification with human verification. This ensures the high quality of our lexicon while reducing annotation cost by over 70%. In designing the bootstrap classifier we develop a variety of features which use both existing semantic resources and linguistically informed text patterns. In addition we investigate how knowledge about polarity shifters might be shared across different parts of speech, highlighting both the potential and limitations of such an approach. The applicability of our bootstrapping approach extends beyond the creation of a single resource. We show how it can further be used to introduce polarity shifter resources for other languages. Through the example case of German we show that all our features are transferable to other languages. Keeping in mind the requirements of under-resourced languages, we also explore how well a classifier would do when relying only on data- but not resource-driven features. We also introduce ways to use cross-lingual information, leveraging the shifter resources we previously created for other languages. Apart from the general question of which words can be polarity shifters, we also explore a number of other factors. One of these is the matter of shifting directions, which indicates whether a shifter affects positive polarities, negative polarities or whether it can shift in either direction. Using a supervised classifier we add shifting direction information to our bootstrapped lexicon. For other aspects of polarity shifting, manual annotation is preferable to automatic classification. Not every word that can cause polarity shifting does so for every of its word senses. As word sense disambiguation technology is not robust enough to allow the automatic handling of such nuances, we manually create a complete sense-level annotation of verbal polarity shifters. To verify the usefulness of the lexica which we create, we provide an extrinsic evaluation in which we apply them to a sentiment analysis task. In this task the different lexica are not only compared amongst each other, but also against a state-of-the-art compositional polarity neural network classifier that has been shown to be able to implicitly learn the negating effect of negation words from a training corpus. However, we find that the same is not true for the far more lexically diverse polarity shifters. Instead, the use of the explicit knowledge provided by our shifter lexica brings clear gains in performance.Deutsche Forschungsgesellschaf

Universaar

Acronym

Opinion Holder and Target Extraction for Verb-based Opinion Predicates – The Problem is Not Solved

Author: Ruppenhofer Josef
Schulder Marc
Wiegand Michael
Publication venue: New York : The Association for Computational Linguistics
Publication date: 20/07/2016
Field of study

We offer a critical review of the current state of opinion role extraction involving opinion verbs. We argue that neither the currently available lexical resources nor the manually annotated text corpora are sufficient to appropriately study this task. We introduce a new corpus focusing on opinion roles of opinion verbs from the Subjectivity Lexicon and show potential benefits of this corpus. We also demonstrate that state-of-the-art classifiers perform rather poorly on this new dataset compared to the standard dataset for the task showing that there still remains significant research to be done

Publikationsserver des Instituts für Deutsche Sprache

ATC-ANNO:Semantic annotation for air traffic control with assistive auto-annotation

Author: Bakanouski Yury
Klakow Dietrich
O'Mahony Johannah
Schulder Marc
Publication venue
Publication date: 31/05/2020
Field of study

Edinburgh Research Explorer

DGS-Korpus OpenPose wrapper

Author: Schulder Marc
Publication venue
Publication date: 10/11/2020
Field of study

Collect OpenPose frame files into a single file using the DGS-Korpus OpenPose wrapper format used by the Public DGS Corpus. For more information, see the project note OpenPose in the Public DGS Corpus. For the reverse procedure (single wrapper file to many frame files), see the Public Corpus OpenPose frame extractor script. For the latest state of this source code, see the GitHub repository

ZFDM Repository (Univ. Hamburg)

Public Corpus OpenPose frame extractor

Author: Schulder Marc
Publication venue
Publication date: 10/11/2020
Field of study

The Public DGS Corpus provides OpenPose data for all of its transcripts. Each transcript collects all of its pose information in a single file, using the DGS-Korpus OpenPose wrapper format. This script converts these files into the one-frame-per-file format used by the OpenPose demo. For more information, see the project note OpenPose in the Public DGS Corpus. For the reverse procedure (many frame files to single wrapper file), see the DGS-Korpus OpenPose wrapper script. For the latest state of this source code, see the GitHub repository

ZFDM Repository (Univ. Hamburg)

Public Corpus OpenPose error corrector

Author: Schulder Marc
Publication venue
Publication date: 15/05/2020
Field of study

The Public DGS Corpus provides OpenPose data for all of its transcripts. For the total perspective of each transcript, which shows both informants and the moderator, the OpenPose output was postprocessed to make several improvements. This repository contains the code used for these postprocessing steps. For more information, see the project note OpenPose in the Public DGS Corpus. To convert the standard one-frame-per-file output of OpenPose into the single-file wrapper format used here, see the DGS-Korpus OpenPose wrapper script. For the latest state of this source code, see the GitHub repository

ZFDM Repository (Univ. Hamburg)

Public Corpus OpenPose frame extractor

Author: Schulder Marc
Publication venue
Publication date: 23/11/2023
Field of study

ZFDM Repository (Univ. Hamburg)

Public Corpus OpenPose frame extractor

Author: Schulder Marc
Publication venue
Publication date: 14/05/2020
Field of study

ZFDM Repository (Univ. Hamburg)

Separating Actor-View from Speaker-View Opinion Expressions using Linguistic Features

Author: Ruppenhofer Josef
Schulder Marc
Wiegand Michael
Publication venue: San Diego (California) : Association for Computational Linguistics
Publication date: 01/01/2016
Field of study

We examine different features and classifiers for the categorization of opinion words into actor and speaker view. To our knowledge, this is the first comprehensive work to address sentiment views on the word level taking into consideration opinion verbs, nouns and adjectives. We consider many high-level features requiring only few labeled training data. A detailed feature analysis produces linguistic insights into the nature of sentiment views. We also examine how far global constraints between different opinion words help to increase classification performance. Finally, we show that our (prior) word-level annotation correlates with contextual sentiment views

Crossref

Publikationsserver des Instituts für Deutsche Sprache

Not just 'any' sign! Searching for negative polarity items in DGS

Author: Loos Cornelia
Schulder Marc
Publication venue
Publication date: 30/09/2022
Field of study

These are the slides, transcript and abstract of the on-stage presentation "Not just any sign! Searching for negative polarity items in DGS" given at TISLR 14

ZFDM Repository (Univ. Hamburg)