2 research outputs found

    Computing term translation probabilities with generalized latent semantic analysis

    No full text
    Term translation probabilities proved an effective method of semantic smoothing in the language modelling approach to information retrieval tasks. In this paper, we use Generalized Latent Semantic Analysis to compute semantically motivated term and document vectors. The normalized cosine similarity between the term vectors is used as term translation probability in the language modelling framework. Our experiments demonstrate that GLSAbased term translation probabilities capture semantic relations between terms and improve performance on document classification.
    corecore