Search CORE

4 research outputs found

Discovering linkage points over web data

Author: Arasu A.
Bizer C.
Burdick D.
Christen P.
Dhamanka R.
Duan S.
Euzenat J.
Hassanzadeh O.
Hassanzadeh O.
Hernández M. A.
Hutter F.
Isele R.
Kang J.
Lenzerini M.
Naumann F.
Ngonga Ngomo A.-C.
Rahm E.
Robertson S.
Salton G.
Warren R. H.
Zhang M.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Multi-column substring matching for database schema translation

Author: Robert H. Warren
Publication venue
Publication date
Field of study

We describe a method for discovering complex schema translations involving substrings from multiple database columns. The method does not require a training set of instances linked across databases and it is capable of dealing with both fixed- and variable-length field columns. We propose an iterative algorithm that deduces the correct sequence of concatenations of column substrings in order to translate from one database to another. We introduce the algorithm along with examples on common database data values and examine its performance on real-world and synthetic datasets. 1

CiteSeerX

Engineering truly automated data integration and translation systems

Author: Warren Robert H
Publication venue: 'University of Waterloo'
Publication date: 10/12/2007
Field of study

This thesis presents an automated, data-driven integration process for relational databases. Whereas previous integration methods assumed a large amount of user involvement as well as the availability of database meta-data, we make no use of meta-data and little end user input. This is done using a novel join and translation finding algorithm that searches for the proper key / foreign key relationships while inferring the instance transformations from one database to another. Because we rely only on the relations that bind the attributes together, we make no use of the database schema information. A novel searching method allows us to search the database for relevant objects without requiring server side indexes or cooperative databases

University of Waterloo's Institutional Repository