research

Indexing of Reading Paths for a Structured Information Retrieval on the Web

Abstract

International audienceIn this paper, we present a hyperdocument model taking into account the essential aspects of information on the Web: content, composition (logical structure) and nonlinear reading (hypertext structure). We have developed a Structured Information Retrieval System (SIRS) based on this model. Its phases of indexing and querying are based on a “reading paths” point of view of the Web: a Web site is considered as a set of potential reading paths, instead of a set of atomic and flat pages. We have developed an specific algorithm to index the reading paths. We present some experiments aiming at evaluating the interest of our indexing process of reading paths

    Similar works

    Full text

    thumbnail-image

    Available Versions

    Last time updated on 01/04/2019
    Last time updated on 12/11/2016