Skip to main content
Article thumbnail
Location of Repository

Combining independent modules to solve multiple-choice synonym and analogy problems

By Peter Turney, Michael Littman, Jeffrey Bigham and Victor Shnayder

Abstract

Existing statistical approaches to natural language problems are very coarse approximations to the true complexity of language processing. As such, no single technique will be best for all problem instances. Many researchers are examining ensemble methods that combine the output of successful, separately developed modules to create more accurate solutions. This paper examines three merging rules for combining probability distributions: the well known mixture rule, the logarithmic rule, and a novel product rule. These rules were applied with state-of-the-art results to two problems commonly used to assess human mastery of lexical semantics -- synonym questions and analogy questions. All three merging rules result in ensembles that are more accurate than any of their component modules. The differences among the three rules are not statistically significant, but it is suggestive that the popular mixture rule is not the best rule for either of the two problems

Topics: Statistical Models, Language, Computational Linguistics, Semantics, Machine Learning
Year: 2003
OAI identifier: oai:cogprints.org:3163
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://cogprints.org/3163/1/ra... (external link)
  • http://cogprints.org/3163/ (external link)
  • Suggested articles

    Citations

    1. (2000). 10 Real SATs. College Entrance Examination Board,
    2. (1999). A brief introduction to boosting.
    3. (1997). A probabilistic approach to solving crossword puzzles. doi
    4. (1980). A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction and representation of knowledge. doi
    5. (1991). Adaptive mixtures of experts. doi
    6. (1998). Classi combination for improved lexical disambiguation.
    7. (2003). Frequency estimates for statistical word similarity measures.
    8. (1992). High-level perception, representation, and analogy: a critique of arti intelligence methodology.
    9. (2003). Learning analogies and semantic relations. doi
    10. (2003). Measuring praise and criticism: Inference of semantic orientation from association. doi
    11. (1995). Methods for combining experts’ probability assessments. doi
    12. (1992). Methods of combining multiple classi and their applications to handwriting recognition. doi
    13. (2001). Mining the web for synonyms: PMI-IR versus LSA on TOEFL.
    14. (2002). Modeling consensus: Classi combination for word sense disambiguation. doi
    15. (1999). Products of experts.
    16. Roget’s thesaurus and semantic similarity.
    17. (1998). Selecting weighting factors in logarithmic opinion pools.
    18. (2002). The computational modeling of analogy-making. doi
    19. (1998). WordNet: An Electronic Lexical Database. doi

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.