학위논문 (석사)-- 서울대학교 대학원 : 전기·컴퓨터공학부, 2015. 8. 김형주.Today, most of the data on the web is stored in relational databases, which is called deep web. Semantic web is a movement to the next generation of the web, where all data are augmented with well-defined semantics and linked together in machine-readable format. RDB2RDF approaches have been proposed and standardized by W3C, which publishes relational data to semantic web by converting relational data into RDF formatted data. We propose a system that automatically transforms relational data into RDF data and creates OWL ontology based on the schema of database. Some approaches have been proposed, but most of them did not fully make use of schema information to extract rich semantics, nor did they experimented on large databases for performance. We utilize Hadoop framework in transformation process, which enables distributed system for scalability. We present mapping rules that implements augmented direct mapping to create local ontology with rich semantics. The results show that our system successfully transforms relational data into RDF data with OWL ontology, with satisfactory performance on large-sized databases.Abstract i
Introduction 3
Related Work 7
2.1 Semantic ETL Systems 7
2.2 Hadoop MapReduce 8
2.3 Mapping Approaches 9
Mapping Rules 14
3.1 General Rule 1 19
3.2 General Rule 2 20
3.3 General Rule 3 20
3.4 General Rule 4 21
3.5 General Rule 5 21
3.6 Constraint Rule 1 22
3.7 Constraint Rule 2 22
3.8 Constraint Rule 3 23
3.9 Constraint Rule 4 24
3.10 Constraint Rule 5 24
3.11 Constraint Rule 6 25
3.12 Discussion 26
Our Approach 28
4.1 Preprocessing 28
4.1.1 Schema Caching Method 30
4.1.2 Relational Data 32
4.2 Hadoop Algorithm 33
Experiment 36
5.1 Ontology Extraction 37
5.2 Performance 38
5.3 Scalability 41
Conclusion 42
Reference 44
Appendix 46Maste