Search CORE

3 research outputs found

Automatic Wrapper Induction from Hidden-Web Sources with Domain Knowledge ABSTRACT

Author: Avin Mittal
Daniel Muschick
Marc Tommasi
Pierre Senellart
Rémi Gilleron
Technische Universität Graz
Publication venue
Publication date: 01/01/2008
Field of study

We present an original approach to the automatic induction of wrappers for sources of the hidden Web that does not need any human supervision. This approach heavily relies on some domain knowledge, expressed in a predefined form, for a given domain of interest. There are two parts in the understanding of a given service of the hidden Web: understanding the structure of its input and the way its output is presented. This amounts to understanding the structure of a given form and to relate its fields to concepts of the domain of interest, and to understanding where and how resulting records are represented in an HTML result page. For the former problem, we use a combination of heuristics and of probing with domain instances; for the latter, we use a supervised machine learning technique adapted to tree-like information on an automatic, imperfect, and imprecise, annotation using the domain knowledge. The result of these two steps is the possibility to automatically wrap a form as a standard Web service with a WSDL description. We implemented such a system and show experiments that demonstrate the validity and potential of this approach

HAL-CentraleSupelec

CiteSeerX

HAL - Lille 3

INRIA a CCSD electronic archive server

HAL-Rennes 1

The renal patient seen by non-renal physicians: the kidney embedded in the ‘milieu intérieur’

Author: Almutary
Anding
Avin
Bagai
Bart
Bauer
Bernard
Binkley
Bischoff-Ferrari
Breidthardt
Brisco
Brüggemann
Bucur
Cheema
Chen
Cheng
Costanzo
Cruz-Jentoft
Cruz-Jentoft
Czer
Damman
Damman
Damman
Damman
Dave
Davison
Davison
de Souza
Frost
Girgis
Harvey
Hirai
Hirschfeld
Iimori
Ishani
Jamal
Jamal
Jamal
Jannot
Kanis
Kanis
Katagiri
Ketteler
Laurent
Lesogor
Lund
Manfredini
McMurray
Mercadante
Miller
Miller
Mittal
Mocroft
Moorthi
Mullens
Nielsen
Pagels
Paintin
Pfeffer
Pham
Proctor
Rooks
Schuit
Shaw
Sprague
Stengel
Vanholder
Vardeny
Verschueren
Wang
Weisbord
Wilson
Yanishi
Yenchek
Zhang
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref