Skip to main content
Article thumbnail
Location of Repository

Representing Context Information for Document Retrieval ⋆

By Maya Carrillo, Esaú Villatoro-tello, A. López-lópez, Chris Eliasmith, Manuel Montes-y-gómez, Luis Villaseñor-pineda and Coordinación De Ciencias Computacionales


Abstract. The bag of words representation (BoW), which is widely used in information retrieval (IR), represents documents and queries as word lists that do not express anything about context information. When we look for information, we find that not everything is explicitly stated in a document, so context information is needed to understand its content. This paper proposes the use of bag of concepts (BoC) and Holographic reduced representation (HRR) in IR. These representations go beyond BoW by incorporating context information to document representations. Both HRR and BoC are produced using a vector space methodology known as Random Indexing, and allow expressing additional knowledge from different sources. Our experiments have shown the feasibility of the representations and improved the mean average precision by up to 7% when they are compared with the traditional vector space model

Topics: Information Retrieval, Vector Model, Context Information, Random Indexing, Holographic Reduced Representation
Year: 2013
OAI identifier: oai:CiteSeerX.psu:
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • (external link)
  • (external link)
  • Suggested articles

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.