Search CORE

20 research outputs found

Wrapping PDF Documents Exploiting Uncertain Knowledge

Author: A. Bruggemann-Klein
A. Laender
D. Freitag
I. Muslea
L. Zadeh
M. Wygralak
N. Ashish
S. Soderland
Publication venue
Publication date: 01/01/2006
Field of study

Archivio della ricerca - Università degli studi di Napoli Federico II

Wrapping PDF Documents Exploiting Uncertain Knowledge

Author: A. Bruggemann-Klein
A. Laender
D. Freitag
I. Muslea
L. Zadeh
M. Wygralak
N. Ashish
S. Soderland
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Active Learning with Misclassification Sampling Using Diverse Ensembles Enhanced by Unlabeled Instances

Author: C. Campbell
D.D. Lewis
H.S. Seung
I. Muslea
L. Hansen
N. Abe
N. Roy
P. Melville
T.G. Dietterich
Y. Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Information Extraction in Structured Documents using Tree Automata Induction

Author: C. Pair
C.-N. Hsu
D. Angluin
E. M. Gold
I. Muslea
J. Cowie
L. Valiant
M. Takahashi
S. Soderland
Y. Sakakibara
Y. Sakakibara
Publication venue
Publication date: 01/01/2002
Field of study

Information extraction (IE) addresses the problem of extracting speci c information from a collection of documents. Much of the previous work for IE from structured documents formatted in HTML or XML uses techniques for IE from strings, such as grammar and automata induction. However, HTML and XML documents have a tree structure

Institutional Repository Universiteit Antwerpen

Extracting Product Descriptions from Polish E-Commerce Websites Using Classification and Clustering

Author: C.A. Knoblock
C.H. Chang
C.H. Chang
D. Freitag
D. Pinto
I. Muslea
J. Han
L. Liu
V. Crescenzi
W.W. Cohen
Y. Zhai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Adapting Web information extraction knowledge via mining site-invariant and site-dependent features

Author: Ambite J.
Brin S.
Chawathe S.
Ciravegna F.
Crescenzi V.
Downey D.
Freitag D.
Ghani R.
Hsu C.
Kushmerick N.
Kushmerick N.
Muslea I.
Riloff E.
Srihari R.
Tak-Lam Wong
Wai Lam
Wong T. L.
Wong T. L.
Wong T. L.
Wong T. L.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Entropy-based automated wrapper generation for weblog data extraction

Author: A Laender
AK Elmagarmid
Alexandra I. Cristea
B Adelberg
B Liu
George Gkotsis
I Muslea
J Quinlan
K Giles
Karen Stepanyan
L Liu
L Yujian
M Oita
M Pennock
Mike Joy
N Kushmerick
R Baumgartner
S Ihara
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Effective and efficient microprocessor design space exploration using unlabeled design configurations

Author: Chapelle O.
Dagan I.
Dasgupta S.
Fujino A.
Hennessy J. L.
Huang S.-J.
Joachims T.
Joseph P.
Lewis D. D.
Li M.
Miller D. J.
Muslea I.
Wang W.
Wang W.
Wang Y.
Xu L.
Zhou D.
Zhou Z.-H.
Zhou Z.-H.
Zhu X.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Uses of selection strategies in both spectral and sample spaces for classifying hard and soft blueberry using near infrared data

Author: A Carkeet
C Li
C Yang
CC Chang
F Zhang
GA Leivavalenzuela
GP Moreda
H Li
H-D Li
IA Muslea
L Chen
L Chen
MH Hu
MH Hu
MH Hu
MH Hu
MH Hu
N Sinelli
RKH Galvão
S Fan
S Silva
T Fadiji
UL Opara
Y Freund
Y Jiang
Z Xiaobo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Automatic information extraction from large websites

Author: Adelberg B.
Angluin D.
Arlotta L.
Ashish N.
Atzeni P.
Baumgartner R.
Chidlovskii B.
Crescenzi V.
Crescenzi V.
Crescenzi V.
Fernau H.
Freitag D.
Giansalvatore Mecca
Gold E. M.
Grumbach S.
Gupta A.
Hammer J.
Hong T. W.
Hsu C.
Huck G.
Kosala R.
Kushmerick N.
Lerman K.
Lerman K.
Liu L.
Muslea I.
Pitt L.
Radhakrishnan V.
Ribeiro-Neto B. A.
Sahuguet A.
Valter Crescenzi
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

core

core