Skip to main content
Article thumbnail
Location of Repository

Similarity of Semantic Relations

By Peter D. Turney

Abstract

There are at least two kinds of similarity. Relational similarity is correspondence between relations, in contrast with attributional similarity, which is correspondence between attributes. When two words have a high degree of attributional similarity, we call them synonyms. When two pairs of words have a high degree of relational similarity, we say that their relations are analogous. For example, the word pair mason:stone is analogous to the pair carpenter:wood. This paper introduces Latent Relational Analysis (LRA), a method for measuring relational similarity. LRA has potential applications in many areas, including information extraction, word sense disambiguation, and information retrieval. Recently the Vector Space Model (VSM) of information retrieval has been adapted to measuring relational similarity, achieving a score of 47% on a collection of 374 college-level multiple-choice word analogy questions. In the VSM approach, the relation between a pair of words is characterized by a vector of frequencies of predefined patterns in a large corpus. LRA extends the VSM approach in three ways: (1) the patterns are derived automatically from the corpus, (2) the Singular Value Decomposition (SVD) is used to smooth the frequency data, and (3) automatically generated synonyms are used to explore variations of the word pairs. LRA achieves 56% on the 374 analogy questions, statistically equivalent to the average human score of 57%. On the related problem of classifying semantic relations, LRA achieves similar gains over the VSM

Topics: Language, Computational Linguistics, Semantics, Machine Learning, Artificial Intelligence
Year: 2006
OAI identifier: oai:cogprints.org:5098
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://cogprints.org/5098/1/NR... (external link)
  • http://cogprints.org/5098/ (external link)
  • Suggested articles

    Citations

    1. (2000). 10 Real SATs. College Entrance Examination Board.
    2. (1986). An experimental study of factors important in document ranking.
    3. (1998). An overview of multitext. doi
    4. (1990). Analogical interpretation in context.
    5. (1995). and the Fluid Analogies Research Group. doi
    6. (1992). Automatic acquisition of hyponyms from large text corpora.
    7. (2002). Automatic labeling of semantic roles.
    8. (1990). Categorical Data Analysis.
    9. (1965). Cognition and Thought: An Information Processing Approach. doi
    10. (2002). Coupled clustering: A method for detecting structural correspondence.
    11. (1989). Development and application of ametric on semantic nets. doi
    12. (2002). Discovering word senses from text.
    13. (1990). Enhancing performance in latent semantic indexing (LSI) retrieval.
    14. (2003). Exploring noun-modifier semantic relations. doi
    15. (2003). Extended gloss overlaps as a measure of semantic relatedness.
    16. (1999). Finding parts in very large corpora.
    17. (1997). Inferring semantic similarity from distributional evidence: An analogy-based approach to word sense disambiguation.
    18. (1992). Large scale singular value computations.
    19. (1993). Latent semantic indexing (LSI) and TREC-2.
    20. (2000). Latent semantic space: Iterative scaling improves precision of inter-document similarity measurement.
    21. (2003). Learning semantic constraints for the automatic discovery of part-whole relations.
    22. (1998). Lexical chains as representations of context for the detection and correction of malapropisms.
    23. (1991). Lexical cohesion computed by thesaural relations as an indicator of the structure of text.
    24. (1996). Matrix Computations.
    25. (1995). Metaphor as an emergent property of machinereadable dictionaries.
    26. (2001). Metaphor is like analogy.
    27. (1980). Metaphors We Live By. doi
    28. (2004). Models for the semantic classification of noun phrases. doi
    29. (1999). Modern Information Retrieval.
    30. (1999). Probabilistic Latent Semantic Indexing.
    31. (2003). Roget’s thesaurus and semantic similarity.
    32. (1990). Semantic and associative priming in the cerebral hemispheres: Somewords do, some words don’t ... sometimes, some places. Brain and Language,
    33. (2001). Semantic distance in wordnet: An experimental, application-oriented evaluation of five measures.
    34. (1997). Semantic similarity based on corpus statistics and lexical taxonomy.
    35. (1998). Semi-automatic recognition of noun modifier relationships.
    36. (1990). Similarity involving attributes and relations: Judgments of similarity and difference are not inverses. doi
    37. (1990). Similarity of Semantic Relations Turney Indexing by latent semantic analysis. doi
    38. (1983). Structure-mapping: A theoretical framework for analogy.
    39. (2002). Testing debate. National Review Magazine,
    40. (1994). The cell transmission model: A dynamic representation of highway traffic consistent with the hydrodynamic theory. doi
    41. (2002). The computational modeling of analogy-making.
    42. (2005). The emperor’s new clothes: Undressing the new and unimproved sat. Gelf Magazine,
    43. (1989). The structure-mapping engine: Algorithm and examples.
    44. (1995). Using information content to evaluate semantic similarity in a taxonomy.
    45. (1998). Using latent semantic analysis to assess knowledge: Some technical considerations. doi
    46. (1995). Which method learns the most from data? Methodological issues in the analysis of comparative studies.
    47. (1998). WordNet: An Electronic Lexical Database.

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.