Search CORE

8,423 research outputs found

Evaluation of taxonomic and neural embedding methods for calculating semantic similarity

Author: Yang Dongqiang
Yin Yanqin
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 29/09/2022
Field of study

Modelling semantic similarity plays a fundamental role in lexical semantic applications. A natural way of calculating semantic similarity is to access handcrafted semantic networks, but similarity prediction can also be anticipated in a distributional vector space. Similarity calculation continues to be a challenging task, even with the latest breakthroughs in deep neural language models. We first examined popular methodologies in measuring taxonomic similarity, including edge-counting that solely employs semantic relations in a taxonomy, as well as the complex methods that estimate concept specificity. We further extrapolated three weighting factors in modelling taxonomic similarity. To study the distinct mechanisms between taxonomic and distributional similarity measures, we ran head-to-head comparisons of each measure with human similarity judgements from the perspectives of word frequency, polysemy degree and similarity intensity. Our findings suggest that without fine-tuning the uniform distance, taxonomic similarity measures can depend on the shortest path length as a prime factor to predict semantic similarity; in contrast to distributional semantics, edge-counting is free from sense distribution bias in use and can measure word similarity both literally and metaphorically; the synergy of retrofitting neural embeddings with concept relations in similarity prediction may indicate a new trend to leverage knowledge bases on transfer learning. It appears that a large gap still exists on computing semantic similarity among different ranges of word frequency, polysemous degree and similarity intensity

arXiv.org e-Print Archive

Verb similarity on the taxonomy of WordNet

Author: Powers David Martin
Yang Dongqiang
Publication venue: 'Masaryk University Press'
Publication date: 01/01/2006
Field of study

Brn

CiteSeerX

Flinders Academic Commons

Semantic Distance in WordNet: A Simplified and Improved Measure of Semantic Relatedness

Author: Scriver Aaron
Publication venue: 'University of Waterloo'
Publication date: 01/01/2006
Field of study

Measures of semantic distance have received a great deal of attention recently in the field of computational lexical semantics. Although techniques for approximating the semantic distance of two concepts have existed for several decades, the introduction of the WordNet lexical database and improvements in corpus analysis have enabled significant improvements in semantic distance measures. In this study we investigate a special kind of semantic distance, called semantic relatedness. Lexical semantic relatedness measures have proved to be useful for a number of applications, such as word sense disambiguation and real-word spelling error correction. Most relatedness measures rely on the observation that the shortest path between nodes in a semantic network provides a representation of the relationship between two concepts. The strength of relatedness is computed in terms of this path. This dissertation makes several significant contributions to the study of semantic relatedness. We describe a new measure that calculates semantic relatedness as a function of the shortest path in a semantic network. The proposed measure achieves better results than other standard measures and yet is much simpler than previous models. The proposed measure is shown to achieve a correlation of r = 0. 897 with the judgments of human test subjects using a standard benchmark data set, representing the best performance reported in the literature. We also provide a general formal description for a class of semantic distance measures — namely, those measures that compute semantic distance from the shortest path in a semantic network. Lastly, we suggest a new methodology for developing path-based semantic distance measures that would limit the possibility of unnecessary complexity in future measures

University of Waterloo's Institutional Repository

A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method

Author: A Wu
A Wu
AL Barabasi
F Beil
G Erkan
Il-Yeol Song
Illhoi Yoo
J Ghosh
J Kleinberg
LAN Amaral
M Steinbach
MA Hearst
MEJ Newman
MEJ Newman
P Erdos
P Pantel
R Ferrer-Cancho
R Rada
RA Hanneman
S Salton
T Nomato
Xiaohua Hu
Y Zeng
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Crossref

Springer - Publisher Connector

PubMed Central