Planning to Gather Information


The exponential growth of the Internet has produced a labyrinth of documents, databases and services. While almost any type of information is available somewhere, even expert users waste time and effort searching for appropriate information sources, and phrasing queries in the custom formats required by each site. To make matters worse, many queries can only be answered by combining information from several different sites. This paper describes Occam, a query planning algorithm that determines the best way to integrate data from different sources. As input, Occam takes a library of site descriptions and a user query. As output, Occam automatically generates one or more plans that encode alternative ways to gather the requested information. Occam has several important features: (1) it integrates both legacy systems and full relational databases with an efficient, domain-independent, query-planning algorithm, (2) it reasons about the capabilities of different information sources, (3) ..

Similar works

Full text



Last time updated on 22/10/2014

This paper was published in CiteSeerX.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.