Search CORE

15 research outputs found

AutoSense Model for Word Sense Induction

Author: Amplayo Reinald Kim
Hwang Seung-won
Song Min
Publication venue
Publication date: 22/11/2018
Field of study

Word sense induction (WSI), or the task of automatically discovering multiple senses or meanings of a word, has three main challenges: domain adaptability, novel sense detection, and sense granularity flexibility. While current latent variable models are known to solve the first two challenges, they are not flexible to different word sense granularities, which differ very much among words, from aardvark with one sense, to play with over 50 senses. Current models either require hyperparameter tuning or nonparametric induction of the number of senses, which we find both to be ineffective. Thus, we aim to eliminate these requirements and solve the sense granularity problem by proposing AutoSense, a latent variable model based on two observations: (1) senses are represented as a distribution over topics, and (2) senses generate pairings between the target word and its neighboring word. These observations alleviate the problem by (a) throwing garbage senses and (b) additionally inducing fine-grained word senses. Results show great improvements over the state-of-the-art models on popular WSI datasets. We also show that AutoSense is able to learn the appropriate sense granularity of a word. Finally, we apply AutoSense to the unsupervised author name disambiguation task where the sense granularity problem is more evident and show that AutoSense is evidently better than competing models. We share our data and code here: https://github.com/rktamplayo/AutoSense.Comment: AAAI 201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Combining Neural Language Models for WordSense Induction

Author: Aleksashina Tatiana
Arefyev Nikolay
Sheludko Boris
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 23/06/2020
Field of study

Word sense induction (WSI) is the problem of grouping occurrences of an ambiguous word according to the expressed sense of this word. Recently a new approach to this task was proposed, which generates possible substitutes for the ambiguous word in a particular context using neural language models, and then clusters sparse bag-of-words vectors built from these substitutes. In this work, we apply this approach to the Russian language and improve it in two ways. First, we propose methods of combining left and right contexts, resulting in better substitutes generated. Second, instead of fixed number of clusters for all ambiguous words we propose a technique for selecting individual number of clusters for each word. Our approach established new state-of-the-art level, improving current best results of WSI for the Russian language on two RUSSE 2018 datasets by a large margin.Comment: International Conference on Analysis of Images, Social Networks and Texts AIST 2019: Analysis of Images, Social Networks and Texts, pp 105-12

arXiv.org e-Print Archive

Crossref

Recommended from our members

Pain and Stress Detection Using Wearable Sensors and Devices—A Review

Author: Abbod M
Chen J
Shieh J-S
Publication venue: 'MDPI AG'
Publication date: 03/02/2021
Field of study

© 2021 by the authors. Pain is a subjective feeling; it is a sensation that every human being must have experienced all their life. Yet, its mechanism and the way to immune to it is still a question to be answered. This review presents the mechanism and correlation of pain and stress, their assessment and detection approach with medical devices and wearable sensors. Various physiological signals (i.e., heart activity, brain activity, muscle activity, Electrodermal activity, respiratory, blood volume pulse, skin tempera-ture) and behavioral signals are organized for wearables sensors detection. By reviewing the wearable sensors used in the healthcare domain, we hope to find a way for wearable healthcare-monitoring system to be applied on pain and stress detection. Since pain leads to multiple consequences or symptoms such as muscle tension and depression that are stress related, there is a chance to find a new approach for chronic pain detection using daily life sensors or de-vices. Then by integrating modern computing techniques, there is a chance to handle pain and stress management issue.Ministry of Science and Technology (MOST) of Taiwan (grant number: MOST 107-2221-E-155-009-MY2)

Brunel University Research Archive

Investigations into the value of labeled and unlabeled data in biomedical entity recognition and word sense disambiguation

Author: Sheikhshabbafghi Golnar
Publication venue
Publication date: 31/03/2021
Field of study

Human annotations, especially in highly technical domains, are expensive and time consuming togather, and can also be erroneous. As a result, we never have sufficiently accurate data to train andevaluate supervised methods. In this thesis, we address this problem by taking a semi-supervised approach to biomedical namedentity recognition (NER), and by proposing an inventory-independent evaluation framework for supervised and unsupervised word sense disambiguation. Our contributions are as follows: We introduce a novel graph-based semi-supervised approach to named entity recognition(NER) and exploit pre-trained contextualized word embeddings in several biomedical NER tasks. We propose a new evaluation framework for word sense disambiguation that permits a fair comparison between supervised methods trained on different sense inventories as well as unsupervised methods without a fixed sense inventory

Simon Fraser University Institutional Repository

A proof-of-concept meaning discrimination experiment to compile a word-in-context dataset for adjectives – A graph-based distributional approach

Author: Héja Enikő
Ligeti-Nagy Noémi
Publication venue: 'Akademiai Kiado Zrt.'
Publication date: 01/01/2022
Field of study

Repository of the Academy's Library