Search CORE

2,388 research outputs found

Correction to: Schema Extraction for Deep Web Query Interfaces Using Heuristics Rules

Author: Chichang Jou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

[[abstract]]The original copy of this article included incorrect data for“authors and affiliations”. A corrected version of the “authors and affiliations” data is provided below. The author would like to thank the editorial office and colleagues of Springer Science+Business Media, LLC for the assistance in correcting the error: Department of Information Management, Tamkang University, New Taipei City, Taiwan[[notice]]補正完

Tamkang University Institutional Repository

Ontology mapping: the state of the art

Author: Kalfoglou Yannis
Schorlemmer Marco
Publication venue
Publication date: 01/01/2003
Field of study

Ontology mapping is seen as a solution provider in today's landscape of ontology research. As the number of ontologies that are made publicly available and accessible on the Web increases steadily, so does the need for applications to use them. A single ontology is no longer enough to support the tasks envisaged by a distributed environment like the Semantic Web. Multiple ontologies need to be accessed from several applications. Mapping could provide a common layer from which several ontologies could be accessed and hence could exchange information in semantically sound manners. Developing such mapping has beeb the focus of a variety of works originating from diverse communities over a number of years. In this article we comprehensively review and present these works. We also provide insights on the pragmatics of ontology mapping and elaborate on a theoretical approach for defining ontology mapping

CiteSeerX

Southampton (e-Prints Soton)

Proceedings of the Workshop on the Wrapper Techniques for Legacy Systems

Author: Thiran Philippe
van den Heuvel Willem-Jan
Publication venue: Technische Universiteit Eindhoven
Publication date: 01/01/2004
Field of study

Repository of the University of Namur

Preface of the Proceedings of WRAP 2004

Author: Thiran Philippe
Van den Heuvel Willem-Jan
Publication venue: Technische Universiteit Eindhoven
Publication date: 01/01/2004
Field of study

Repository of the University of Namur

WRAP 2004 : first international workshop on wrapper techniques for legacy systems (In connection with the 11th working conference on reverse engineering) : workshop proceedings

Author
Publication venue: Technische Universiteit Eindhoven
Publication date: 01/01/2004
Field of study

Pure OAI Repository

WRAP 2004 : first international workshop on wrapper techniques for legacy systems (In connection with the 11th working conference on reverse engineering) : workshop proceedings

Author
Publication venue: Technische Universiteit Eindhoven
Publication date: 01/01/2004
Field of study

Pure OAI Repository

Proceedings of the International Workshop on Web Information Systems Modeling:WISM 2007

Author: Fransincar Flavius
Houben Geert-Jan
Thiran Philippe
Publication venue
Publication date: 01/01/2007
Field of study

Repository of the University of Namur

Natural Language Processing on Data Warehouses

Author: Maree Stiaan
Publication venue: Department of Statistical Sciences
Publication date: 27/10/2022
Field of study

The main problem addressed in this research was to use natural language to query data in a data warehouse. To this effect, two natural language processing models were developed and compared on a classic star-schema sales data warehouse with sales facts and date, location and item dimensions. Utterances are queries that people make with natural language, for example, What is the sales value for mountain bikes in Georgia for 1 July 2005?" The first model, the heuristics model, implemented an algorithm that steps through the sequence of utterance words and matches the longest number of consecutive words at the highest grain of the hierarchy. In contrast, the embedding model implemented the word2vec algorithm to create different kinds of vectors from the data warehouse. These vectors are aggregated and then the cosine similarity between vectors was used to identify concepts in the utterances that can be converted to a programming language. To understand question style, a survey was set up which then helped shape random utterances created to use for the evaluation of both methods. The first key insight and main premise for the embedding model to work is a three-step process of creating three types of vectors. The first step is to train vectors (word vectors) for each individual word in the data warehouse; this is called word embeddings. For instance, the word `bike' will have a vector. The next step is when the word vectors are averaged for each unique column value (column vectors) in the data warehouse, thus leaving an entry like `mountain bike' with one vector which is the average of the vectors for `mountain' and `bike'. Lastly, the utterance by the user is averaged (utterance vectors) by using the word vectors created in step one, and then, by using cosine similarity, the utterance vector is matched to the closest column vectors in order to identify data warehouse concepts in the utterance. The second key insight was to train word vectors firstly for location, then separately for item - in other words, per dimension (one set for location, and one set for item). Removing stop words was the third key insight, and the last key insight was to use Global Vectors to instantiate the training of the word vectors. The results of the evaluation of the models indicated that the embedding model was ten times faster than the heuristics model. In terms of accuracy, the embedding algorithm (95.6% accurate) also outperformed the heuristics model (70.1% accurate). The practical application of the research is that these models can be used as a component in a chatbot on data warehouses. Combined with a Structured Query Language query generation component, and building Application Programming Interfaces on top of it, this facilitates the quick and easy distribution of data; no knowledge of a programming language such as Structured Query Language is needed to query the data

Cape Town University OpenUCT

WISM'07 : 4th international workshop on web information systems modeling

Author
Publication venue: s.n.
Publication date: 01/01/2007
Field of study

Pure OAI Repository

WISM'07 : 4th international workshop on web information systems modeling

Author
Publication venue: s.n.
Publication date: 01/01/2007
Field of study

Pure OAI Repository