Search CORE

2,155 research outputs found

MoNoise: Modeling Noise Using a Modular Normalization System

Author: van der Goot Rob
van Noord Gertjan
Publication venue
Publication date: 01/01/2017
Field of study

We propose MoNoise: a normalization model focused on generalizability and efficiency, it aims at being easily reusable and adaptable. Normalization is the task of translating texts from a non- canonical domain to a more canonical domain, in our case: from social media data to standard language. Our proposed model is based on a modular candidate generation in which each module is responsible for a different type of normalization action. The most important generation modules are a spelling correction system and a word embeddings module. Depending on the definition of the normalization task, a static lookup list can be crucial for performance. We train a random forest classifier to rank the candidates, which generalizes well to all different types of normaliza- tion actions. Most features for the ranking originate from the generation modules; besides these features, N-gram features prove to be an important source of information. We show that MoNoise beats the state-of-the-art on different normalization benchmarks for English and Dutch, which all define the task of normalization slightly different.Comment: Source code: https://bitbucket.org/robvanderg/monois

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Takeover defenses and IPO firm value in the Netherlands

Author: Goot T. van der
Roosenboom P.G.J.
Publication venue
Publication date
Field of study

The central question of this study involves the relation between the use of takeover defenses and IPO firm value. We report that management frequently uses takeover defenses before taking the firm public. The use of takeover defenses is primarily motivated by managerial entrenchment. IPO investors anticipate potential conflict of interests with management and reduce the price they pay for the IPO shares if takeover defenses are adopted. Although managers internalize this cost of takeover defenses to the degree they own pre-IPO stock, they are likely to gain through private control benefits. Non-management pre-IPO owners lose. Their shares are worth less, but different from managers, they do not get offsetting private control benefits. We infer that managers use takeover defenses to protect private control benefits at non-management pre-IPO owners' expense.firm valuation;initial public offering;takeover defense

Research Papers in Economics

To Normalize, or Not to Normalize: The Impact of Normalization on Part-of-Speech Tagging

Author: Nissim Malvina
Plank Barbara
van der Goot Rob
Publication venue
Publication date: 01/01/2017
Field of study

Does normalization help Part-of-Speech (POS) tagging accuracy on noisy, non-canonical data? To the best of our knowledge, little is known on the actual impact of normalization in a real-world scenario, where gold error detection is not available. We investigate the effect of automatic normalization on POS tagging of tweets. We also compare normalization to strategies that leverage large amounts of unlabeled data kept in its raw form. Our results show that normalization helps, but does not add consistently beyond just word embedding layer initialization. The latter approach yields a tagging model that is competitive with a Twitter state-of-the-art tagger.Comment: In WNUT 201

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Older Adults’ Mediated Communication: Current Perspectives among Communication Scholars

Author: van der Goot M.J.
Publication venue: 'Peter Lang, International Academic Publishers'
Publication date: 01/01/2016
Field of study

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Source orientation, anthropomorphism, and social presence in human-chatbot communication: how to proceed with these concepts

Author: van der Goot M.J.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2022
Field of study

International Migration, Integration and Social Cohesion online publications

Source orientation, anthropomorphism, and social presence in human-chatbot communication: how to proceed with these concepts

Author: van der Goot M.J.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2022
Field of study

International Migration, Integration and Social Cohesion online publications

UvA-DARE

An In-depth Analysis of the Effect of Lexical Normalization on the Dependency Parsing of Social Media

Author: van der Goot Rob
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/10/2019
Field of study

Existing natural language processing systems have often been designed with standard texts in mind. However, when these tools are used on the substantially different texts from social media, their performance drops dramatically. One solution is to translate social media data to standard language before processing, this is also called normalization. It is well-known that this improves performance for many natural language processing tasks on social media data. However, little is known about which types of normalization replacements have the most effect. Furthermore, it is unknown what the weaknesses of existing lexical normalization systems are in an extrinsic setting. In this paper, we analyze the effect of manual as well as automatic lexical normalization for dependency parsing. After our analysis, we conclude that for most categories, automatic normalization scores close to manually annotated normalization and that small annotation differences are important to take into consideration when exploiting normalization in a pipeline setup

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

The IT University of Copenhagen's Repository

Dissertations of the University of Groningen