Skip to main content
Article thumbnail
Location of Repository

A Service Late Binding Enabled Solution for Data Integration from Autonomous and Evolving Databases

By CHONG WANG

Abstract

Integrating data from autonomous, distributed and heterogeneous data sources to provide a unified vision is a common demand for many businesses. Since the data sources may evolve frequently to satisfy their own independent business needs, solutions which use hard coded queries to integrate participating databases may cause high maintenance costs when evolution occurs. Thus a new solution which can handle database evolution with lower maintenance effort is required.\ud \ud This thesis presents a new solution: Service Late binding Enabled Data Integration (SLEDI) which is set into a framework modeling the essential processes of the data integration activity. It integrates schematic heterogeneous relational databases with decreased maintenance costs for handling database evolution. An algorithm, named Information Provision Unit Describing (IPUD) is designed to describe each database as a set of Information Provision Units (IPUs). The IPUs are represented as Directed Acyclic Graph (DAG) structured data instead of hard coded queries, and further realized as data services. Hence the data integration is achieved through service invocations. Furthermore, a set of processes is defined to handle the database evolution through automatically identifying and modifying the IPUs which are affected by the evolution.\ud \ud An extensive evaluation based on a case study is presented. The result shows that the schematic heterogeneities defined in this thesis can be solved by IPUD except the relation isomorphism discrepancy. Ten out of thirteen types of schematic database evolution can be automatically handled by the evolution handling processes as long as the evolution is represented by the designed data model. The computational costs of the automatic evolution handling show a slow linear growth with the number of participating databases. Other characteristics addressed include SLEDI’s scalability, independence of application domain and databases model. The descriptive comparison with other data integration approaches shows that although the Data as a Service approach may result in lower performance under some circumstances, it supports better flexibility for integrating data from autonomous and evolving data sources.\u

Topics: Database evolution handling, Data as a Service, Service metadata, Service-based data integration
Year: 2010
OAI identifier: oai:etheses.dur.ac.uk:659
Provided by: Durham e-Theses

Suggested articles

Citations

  1. (2007). 1.2 Part 0: Primer (Second Edition),
  2. (2007). 1.2 Part 1: Messaging Framework (Second Edition), doi
  3. (2008). 2 Reasoner for Java, http://clarkparsia.com/pellet/, last accessed in
  4. (1986). A comparative analysis of methodologies for database schema integration, doi
  5. (1995). A framework for representing knowledge, Computation & intelligence: collected readings, ISBN:0-262-62101-0, Year of Publication:
  6. (2007). A Semantic Web Framework for Java, http://jena.sourceforge.net/, last accessed in
  7. (1986). A State Transition Model for Distributed Query Processing’ doi
  8. (1993). A translation approach to portable ontology specifications, doi
  9. (2006). Addressing 1.0 doi
  10. (2007). Adult mental health care in England, doi
  11. Alberto Mendelzon, Database Techniques for the World-Wide Web: A Survey, doi
  12. (2003). An Introduction to Database System,
  13. (1997). An Overview of Data Warehousing doi
  14. (2001). Answering queries using views: A survey. doi
  15. (2003). Applications of case study research, second edition,
  16. (2005). Awais Rashid, Günter Kniesel, Towards a taxonomy of software change: Research Articles, doi
  17. (2008). Case Study Research Design and Methods, Fourth Edition,
  18. (1999). changing focus on interoperability in information systems: from system, syntax, structure to semantics”, doi
  19. (2000). D.Budgen, P.Brereton, L.A.Macaulay, M.Munro, ‘Service-Based Software: The Future for Flexible Software’ doi
  20. (2002). Data integration: A theoretical perspective. doi
  21. (2002). Data warehousing and business intelligence for e-commerce, ISBN: 978-1-55860-713-2, Year of Publication: doi
  22. (1989). Database and Knowledge-Base Systems, Volumes I and II. doi
  23. (2007). Description Language (WSDL) Version 2.0 Part 0: Primer, doi
  24. (2007). Description Language (WSDL) Version 2.0 Part 1: Core Language, doi
  25. (2004). Description Language 1.0: RDF Schema, doi
  26. (1998). Description logic framework for information integration, doi
  27. (2004). Dynamic Data Integration Using Web Services’, doi
  28. (2008). Elias Theocharopoulos, OGSA-DAI: Middleware for Data Integration: Selected Applications, doi
  29. (2007). Evren Sirin, Naveen Srinivasan, Bringing Semantics to Web Services with OWL-S, doi
  30. (1986). Experimentation in software engineering, doi
  31. (1990). Federated database systems for managing distributed, heterogeneous and autonomous databases’ doi
  32. (1992). Federated databases and systems: part I --- a tutorial on their data sharing, doi
  33. (2006). Five Misunderstandings about Case-Study Research, Qualitative Inquiry, doi
  34. (2003). From SHIQ and RDF to OWL: The Making of a Web Ontology Language, doi
  35. (2000). Fundamentals of Data Warehouses, doi
  36. (2003). G.Glover, D.Barnes, “Software Evolution in an Interdisciplinary Environment”, doi
  37. (1996). Gang Zhou,A framework for supporting data integration using the materialized and virtual approaches, doi
  38. (1996). Generating data integration mediators that use materialization, doi
  39. (2005). Globus Toolkit 4.0, http://www.globus.org/toolkit/docs/4.0/key/index.html, last accessed in
  40. (1999). Growing systems in emergent organizations, doi
  41. (1997). Himanshu Gupta, Héctor García-Molina, Jennifer Widom, The WHIPS prototype for data warehouse creation and maintenance, doi
  42. (2007). http://www.omii.ac.uk/wiki/SoftwareOverview, last accessed in
  43. (2006). http://www.uddi.org/pubs/uddi_v3.htm, last accessed in
  44. (1998). IEEE standard for software maintenance, 1219-1998, doi
  45. Information integration using logical views. doi
  46. (1997). InfoSleuth: agent-based semantic integration of information in open and dynamic environments Full text, doi
  47. (1996). Joann J.Ordille, Querying Heterogeneous Information Sources Using Source Descriptions Source,
  48. (2002). Katia Sycara, Semantic Matching of Web Services Capabilities, ISWC doi
  49. (2002). Khaled El Emam, Jarrett Rosenberg, Preliminary guidelines for empirical research in software engineering, doi
  50. (2004). Knowledge Representation and Reasoning, ISBN:1558609326, Year of Publication: doi
  51. (2008). Language for RDF, doi
  52. (2000). Logic-based techniques in data integration, Logic-based artificial intelligence, ISBN: 0-7923-7224-7, Year of Publication: doi
  53. (1997). Managing semantic heterogeneity in databases: A theoretical perspective, doi
  54. (1998). Marie-Christine Rousset, Combining Horn rules and description logics in CARIN, doi
  55. (1996). Marie-Christine Rousset, The limits on combining recursive horn rules and description logics, doi
  56. Mental Health Minimum Data Set
  57. (2005). Michael N.Huhns. “Service-Oriented Computing” doi
  58. (1998). On the Decidability of Query Containment under Constraints, doi
  59. (1998). On the Role of Software Architectures in Runtime System Reconfiguration, doi
  60. (2004). Ontology Language Overview, W3C Recommendation 10 Feburary
  61. (2006). Ordille, Data integration: the teenage years,
  62. (2004). Part 0: Primer Second Edition,
  63. (2004). Part 1: Structures Second Edition,
  64. (2004). Part 2: Datatypes Second Edition,
  65. (1997). Query processing in the SIMS information mediator Source, Readings in agents, ISBN:1-55860-495-2, Year of Publication:
  66. (1996). Query-Answering Algorithms for Information Agents, doi
  67. (1997). Rewriting queries using views in description logics, doi
  68. (1978). Ripple effect analysis of software maintenance, doi
  69. (2007). Schahram Dustdar, Frank Leymann, “Service-Oriented Computing: State of the Art and Research Challenges”, Computer, ISSN:0018-9162, Year of Publication: doi
  70. (1996). Semantic and schematic similarities between database objects: a context-based approach, doi
  71. (2001). Semantic integration of heterogeneous information sources, doi
  72. (2005). Service-Oriented Architecture: Concepts, Technology, and Design, ISBN:0131858580, Year of Publication:
  73. (2006). Service-Oriented design and development methodology, doi
  74. (2005). Services Choreography Description Language Version 1.0,
  75. (1995). Shari Lawrence Pfleeger, Case Studies for Method 186 and Tool Evaluation, doi
  76. (2000). Software maintenance and evolution: a roadmap, doi
  77. (1980). Software Maintenance Management, doi
  78. (2003). The description logic handbook: theory, implementation, and applications, ISBN:0-521-78176-0, Pages: 5-8, Year of Publication: doi
  79. (2003). The description logic handbook: theory, implementation, and applications, ISBN:0-521-78176-0, Pages: 51-55, Year of Publication: doi
  80. (2005). The design and implementation of Grid database services in OGSA-DAI’, Concurrency and Computation: Practice and Experience, Volume 17, Issue 2-4, Year of Publication: doi
  81. (1975). The entity-relationship model: toward a unified view of data, doi
  82. (2004). The Future of Web Services Full text, doi
  83. (1998). The grid: blueprint for a new computing infrastructure’, ISBN:1-55860-475-8, Year of Publication:
  84. (2007). The Protege; Ontology Editor and Knowledge Acquisition System,
  85. (1990). The relational model for database management: version 2, ISBN:0-201-14192-2, Year of Publication:
  86. (1997). The TSIMMIS Approach to Mediation: Data Models and Languages, doi
  87. (2001). Tuecke, “The Anatomy of the Grid: Enabling Scalable Virtual Organizations”. doi
  88. (2004). Using Web Services Technologies to create an Information Broker: An Experience Report’, doi
  89. (2000). Véronique Lattès and Marie-Christine Rousset, The Use of CARIN Language and Algorithms for Information Integration: The PICSEL System, doi
  90. (2005). Web Services Platform Architecture:
  91. (1996). Wei-Min Shen, Query reformulation for dynamic information integration Source, doi
  92. (2007). Willem-Jan Heuvel, Service-Oriented architectures: approaches, technologies and research issues, doi
  93. (2007). XQuery 1.0 and XPath 2.0 Data Model (XDM) W3C,
  94. (2007). XQuery 1.0 and XPath 2.0 Formal Semantics W3C, doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.