Web based knowledge extraction and consolidation for automatic ontology instantiation

Alani, Harith; Hall, Wendy; Kim, Sanghee; Lewis, Paul H.; Millard, David E.; Shadbolt, Nigel; Weal, Mark J.

research

Web based knowledge extraction and consolidation for automatic ontology instantiation

Authors: Harith Alani
Wendy Hall
Sanghee Kim
Paul H. Lewis
David E. Millard
Nigel Shadbolt
Mark J. Weal
Publication date: 1 January 2003
Publisher

Abstract

The Web is probably the largest and richest information repository available today. Search engines are the common access routes to this valuable source. However, the role of these search engines is often limited to the retrieval of lists of potentially relevant documents. The burden of analysing the returned documents and identifying the knowledge of interest is therefore left to the user. The Artequakt system aims to deploy natural language tools to automatically ex-tract and consolidate knowledge from web documents and instantiate a given ontology, which dictates the type and form of knowledge to extract. Artequakt focuses on the domain of artists, and uses the harvested knowledge to gen-erate tailored biographies. This paper describes the latest developments of the system and discusses the problem of knowledge consolidation