Search CORE

66 research outputs found

Ranking for Web Data Search Using On-The-Fly Data Integration

Author: Herzig Daniel Markus
Publication venue: KIT Scientific Publishing
Publication date: 30/07/2019
Field of study

Ranking - the algorithmic decision on how relevant an information artifact is for a given information need and the sorting of artifacts by their concluded relevancy - is an integral part of every search engine. In this book we investigate how structured Web data can be leveraged for ranking with the goal to improve the effectiveness of search. We propose new solutions for ranking using on-the-fly data integration and experimentally analyze and evaluate them against the latest baselines

Directory of Open Access Books (DOAB)

Model driven design and data integration in semantic web information systems

Author: Sluijs van der, K.A.M.
Publication venue: Technische Universiteit Eindhoven
Publication date: 01/01/2012
Field of study

The Web is quickly evolving in many ways. It has evolved from a Web of documents into a Web of applications in which a growing number of designers offer new and interactive Web applications with people all over the world. However, application design and implementation remain complex, error-prone and laborious. In parallel there is also an evolution from a Web of documents into a Web of `knowledge' as a growing number of data owners are sharing their data sources with a growing audience. This brings the potential new applications for these data sources, including scenarios in which these datasets are reused and integrated with other existing and new data sources. However, the heterogeneity of these data sources in syntax, semantics and structure represents a great challenge for application designers. The Semantic Web is a collection of standards and technologies that offer solutions for at least the syntactic and some structural issues. If offers semantic freedom and flexibility, but this leaves the issue of semantic interoperability. In this thesis we present Hera-S, an evolution of the Model Driven Web Engineering (MDWE) method Hera. MDWEs allow designers to create data centric applications using models instead of programming. Hera-S especially targets Semantic Web sources and provides a flexible method for designing personalized adaptive Web applications. Hera-S defines several models that together define the target Web application. Moreover we implemented a framework called Hydragen, which is able to execute the Hera-S models to run the desired Web application. Hera-S' core is the Application Model (AM) in which the main logic of the application is defined, i.e. defining the groups of data elements that form logical units or subunits, the personalization conditions, and the relationships between the units. Hera-S also uses a so-called Domain Model (DM) that describes the content and its structure. However, this DM is not Hera-S specific, but instead allows any Semantic Web source representation as its DM, as long as its content can be queried by the standardized Semantic Web query language SPARQL. The same holds for the User Model (UM). The UM can be used for personalization conditions, but also as a source of user-related content if necessary. In fact, the difference between DM and UM is conceptual as their implementation within Hydragen is the same. Hera-S also defines a presentation model (PM) which defines presentation details of elements like order and style. In order to help designers with building their Web applications we have introduced a toolset, Hera Studio, which allows to build the different models graphically. Hera Studio also provides some additional functionality like model checking and deployment of the models in Hydragen. Both Hera-S and its implementation Hydragen are designed to be flexible regarding the user of models. In order to achieve this Hydragen is a stateless engine that queries for relevant information from the models at every page request. This allows the models and data to be changed in the datastore during runtime. We show that one way to exploit this flexibility is by applying aspect-orientation to the AM. Aspect-orientation allows us to dynamically inject functionality that pervades the entire application. Another way to exploit Hera-S' flexibility is in reusing specialized components, e.g. for presentation generation. We present a configuration of Hydragen in which we replace our native presentation generation functionality by the AMACONT engine. AMACONT provides more extensive multi-level presentation generation and adaptation capabilities as well aspect-orientation and a form of semantic based adaptation. Hera-S was designed to allow the (re-)use of any (Semantic) Web datasource. It even opens up the possibility for data integration at the back end, by using an extendible storage layer in our database of choice Sesame. However, even though theoretically possible it still leaves much of the actual data integration issue. As this is a recurring issue in many domains, a broader challenge than for Hera-S design only, we decided to look at this issue in isolation. We present a framework called Relco which provides a language to express data transformation operations as well as a collection of techniques that can be used to (semi-)automatically find relationships between concepts in different ontologies. This is done with a combination of syntactic, semantic and collaboration techniques, which together provide strong clues for which concepts are most likely related. In order to prove the applicability of Relco we explore five application scenarios in different domains for which data integration is a central aspect. This includes a cultural heritage portal, Explorer, for which data from several datasources was integrated and was made available by a mapview, a timeline and a graph view. Explorer also allows users to provide metadata for objects via a tagging mechanism. Another application is SenSee: an electronic TV-guide and recommender. TV-guide data was integrated and enriched with semantically structured data from several sources. Recommendations are computed by exploiting the underlying semantic structure. ViTa was a project in which several techniques for tagging and searching educational videos were evaluated. This includes scenarios in which user tags are related with an ontology, or other tags, using the Relco framework. The MobiLife project targeted the facilitation of a new generation of mobile applications that would use context-based personalization. This can be done using a context-based user profiling platform that can also be used for user model data exchange between mobile applications using technologies like Relco. The final application scenario that is shown is from the GRAPPLE project which targeted the integration of adaptive technology into current learning management systems. A large part of this integration is achieved by using a user modeling component framework in which any application can store user model information, but which can also be used for the exchange of user model data

Repository TU/e

Pure OAI Repository

Recommended from our members

OptiqueVQS: A visual query system over ontologies for industry

Author: Arenas
Bevan
Bobed
Brunetti
Calvanese
Catarci
Catarci
Civili
Cuenca Grau
Damljanovic
Dey
Epstein
Giese
Glimm
Hogan
Kaufmann
Khalili
Kharlamov
Kharlamov
Kogalovsky
López
Marchionini
Nikolaou
Poggi
Qiu
Romero
Schraefel
Shneiderman
Skjæveland
Soylu
Soylu
Soylu
Soylu
Soylu
Soylu
Soylu
Spanos
Sutcliffe
ter Hofstede
Tran
Uren
Vega-Gorgojo
Vega-Gorgojo
Publication venue: 'IOS Press'
Publication date: 01/01/2017
Field of study

An important application of semantic technologies in industry has been the formalisation of information models using OWL 2 ontologies and the use of RDF for storing and exchanging application data. Moreover, legacy data can be virtualised as RDF using ontologies following the ontology-based data access (OBDA) approach. In all these applications, it is important to provide domain experts with query formulation tools for expressing their information needs in terms of queries over ontologies. In this work, we present such a tool, OptiqueVQS, which is designed based on our experience with OBDA applications in Statoil and Siemens and on best HCI practices for interdisciplinary engineering environments. OptiqueVQS implements a number of unique techniques distinguishing it from analogous query formulation systems. In particular, it exploits ontology projection techniques to enable graph-based navigation over an ontology during query construction. Secondly, while OptiqueVQS is primarily ontology driven, it exploits sampled data to enhance selection of data values for some data attributes. Finally, OptiqueVQS is built on well-grounded requirements, design rationale, and quality attributes. We evaluated OptiqueVQS with both domain experts and casual users and qualitatively compared our system against prominent visual systems for ontology-driven query formulation and exploration of semantic data. OptiqueVQS is available online and can be downloaded together with an example OBDA scenario

City Research Online

Crossref

NORA - Norwegian Open Research Archives

Strategies for Managing Linked Enterprise Data

Author: Galkin Mikhail
Publication venue: Universitäts- und Landesbibliothek Bonn
Publication date
Field of study

Data, information and knowledge become key assets of our 21st century economy. As a result, data and knowledge management become key tasks with regard to sustainable development and business success. Often, knowledge is not explicitly represented residing in the minds of people or scattered among a variety of data sources. Knowledge is inherently associated with semantics that conveys its meaning to a human or machine agent. The Linked Data concept facilitates the semantic integration of heterogeneous data sources. However, we still lack an effective knowledge integration strategy applicable to enterprise scenarios, which balances between large amounts of data stored in legacy information systems and data lakes as well as tailored domain specific ontologies that formally describe real-world concepts. In this thesis we investigate strategies for managing linked enterprise data analyzing how actionable knowledge can be derived from enterprise data leveraging knowledge graphs. Actionable knowledge provides valuable insights, supports decision makers with clear interpretable arguments, and keeps its inference processes explainable. The benefits of employing actionable knowledge and its coherent management strategy span from a holistic semantic representation layer of enterprise data, i.e., representing numerous data sources as one, consistent, and integrated knowledge source, to unified interaction mechanisms with other systems that are able to effectively and efficiently leverage such an actionable knowledge. Several challenges have to be addressed on different conceptual levels pursuing this goal, i.e., means for representing knowledge, semantic data integration of raw data sources and subsequent knowledge extraction, communication interfaces, and implementation. In order to tackle those challenges we present the concept of Enterprise Knowledge Graphs (EKGs), describe their characteristics and advantages compared to existing approaches. We study each challenge with regard to using EKGs and demonstrate their efficiency. In particular, EKGs are able to reduce the semantic data integration effort when processing large-scale heterogeneous datasets. Then, having built a consistent logical integration layer with heterogeneity behind the scenes, EKGs unify query processing and enable effective communication interfaces for other enterprise systems. The achieved results allow us to conclude that strategies for managing linked enterprise data based on EKGs exhibit reasonable performance, comply with enterprise requirements, and ensure integrated data and knowledge management throughout its life cycle

bonndoc – Der Publikationsserver der Universität Bonn

Recommended from our members

Ontology-based end-user visual query formulation: Why, what, who, how, and which?

Author: A Cali
A D’Ulizia
A Gomez-Perez
A Harth
A Jimeno-Yepes
A Katifori
A McAfee
A Segev
A Soylu
A Soylu
A Soylu
A Soylu
A Soylu
AHM Hofstede Ter
Ahmet Soylu
AK Dey
AS Dadzie
B Glimm
B Henderson-Sellers
B Shneiderman
B Shneiderman
BC Grau
BR Gaines
C Beshers
C Bettini
C Bizer
C Bobed
C Civili
C Martinez-Cruz
D Braga
D Damljanovic
D Howe
DE Spanos
Dmitriy Zheleznyakov
E Kapetanios
E Kaufmann
EF Codd
EF Codd
EF Codd
Ernesto Jimenez-Ruiz
Evgeny Kharlamov
F Benzi
F Fonseca
F Ham van
G Allen
G Lindgaard
G Marchionini
G Marchionini
G Tummarello
GL Lohse
H Kondylakis
H Storrle
HJ Levesque
Ian Horrocks
J Claussen
J Coutaz
J Gersh
J Kawash
J Mackinlay
J Minker
J Nielsen
J Nielsen
JA Gallud
JA Konstan
JF Sequeda
JM Brunetti
K Munir
K Siau
K Zheng
KL Siau
KY Whang
L Certo
L Cinque
LJ Campbell
M Angelaccioa
M Erwig
M Giese
M Kifer
M Latapy
M Salehie
M Turk
MA Hearst
Martin Giese
MC Schraefel
ML Wilson
MM Burnett
MM Zloof
MR Kogalovsky
MYM Yen
N Bevan
NH Balkir
O Kolomiyets
P Besnard
P Ingwersen
PD Bruza
PK Chen
PK Robertson
R Baeza-Yates
R Cassino
R Stevens
R Studer
RG Epstein
RM Friedhoff
RN Cuff
RW White
S Krivov
S Lederman
S Madden
S Philippi
S Spiekermann
T Berners-Lee
T Catarci
T Catarci
T Eiter
T Halpin
T Tran
TR Gruber
V Lopez
V Uren
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Value creation in an organisation is a time-sensitive and data-intensive process, yet it is often delayed and bounded by the reliance on IT experts extracting data for domain experts. Hence, there is a need for providing people who are not professional developers with the flexibility to pose relatively complex and ad hoc queries in an easy and intuitive way. In this respect, visual methods for query formulation undertake the challenge of making querying independent of users’ technical skills and the knowledge of the underlying textual query language and the structure of data. An ontology is more promising than the logical schema of the underlying data for guiding users in formulating queries, since it provides a richer vocabulary closer to the users’ understanding. However, on the one hand, today the most of world’s enterprise data reside in relational databases rather than triple stores, and on the other, visual query formulation has become more compelling due to ever-increasing data size and complexity—known as Big Data. This article presents and argues for ontology-based visual query formulation for end-users; discusses its feasibility in terms of ontology-based data access, which virtualises legacy relational databases as RDF, and the dimensions of Big Data; presents key conceptual aspects and dimensions, challenges, and requirements; and reviews, categorises, and discusses notable approaches and systems

City Research Online

Crossref

NORA - Norwegian Open Research Archives

Managing and Consuming Completeness Information for RDF Data Sources

Author: Darari Fariz
Publication venue
Publication date: 20/06/2017
Field of study

The ever increasing amount of Semantic Web data gives rise to the question: How complete is the data? Though generally data on the Semantic Web is incomplete, many parts of data are indeed complete, such as the children of Barack Obama and the crew of Apollo 11. This thesis aims to study how to manage and consume completeness information about Semantic Web data. In particular, we first discuss how completeness information can guarantee the completeness of query answering. Next, we propose optimization techniques of completeness reasoning and conduct experimental evaluations to show the feasibility of our approaches. We also provide a technique to check the soundness of queries with negation via reduction to query completeness checking. We further enrich completeness information with timestamps, enabling query answers to be checked up to when they are complete. We then introduce two demonstrators, i.e., CORNER and COOL-WD, to show how our completeness framework can be realized. Finally, we investigate an automated method to generate completeness statements from text on the Web via relation cardinality extraction

Qucosa

HSSS - Hochschulschriftenserver der SLUB

Technische Universität Dresden: Qucosa

Ranking for Web Data Search Using On-The-Fly Data Integration

Author: Herzig Daniel Markus
Publication venue: KIT Scientific Publishing, Karlsruhe
Publication date: 01/01/2014
Field of study

KITopen

Directory of Open Access Books (DOAB)

Storing and querying evolving knowledge graphs on the web

Author: Taelman Ruben
Publication venue: Universiteit Gent. Faculteit Ingenieurswetenschappen en Architectuur
Publication date: 01/01/2020
Field of study

Ghent University Academic Bibliography

Hybrid coding taxonomy for clinical search harmonization in Safe Havens

Author: Michael André Pinto Domingues
Publication venue
Publication date: 17/09/2020
Field of study

Repositório Aberto da Universidade do Porto

Data integration in the rail domain

Author: Morris Christopher Robert
Publication venue
Publication date: 01/07/2018
Field of study

The exchange of information is crucial to the operation of railways; starting with the distribution of timetables, information must constantly be exchanged in any railway network. The slow evolution of the information environment within the rail industry has resulted in the existence of a diverse range of systems, only able to exchange information essential to railway operations. Were the cost of data integration reduced, then further cost reductions and improvements to customer service would follow as barriers to the adoption of other technologies are removed. The need for data integration has already been studied extensively and has been included in the UK industry's rail technical strategy however, despite it's identification as a key technique for improving integration, uptake of ontology remains limited. This thesis considers techniques to reduce barriers to the take up of ontology in the UK rail industry, and presents a case study in which these techniques are applied. Amongst the key barriers to uptake identified are a lack of software engineers with ontology experience, and the diverse information environment within the rail domain. Techniques to overcomes these barriers using software based tools are considered, and example tools produced which aid the overcoming of these barriers. The case study presented is of a degraded mode signalling system, drawing data from a range of diverse sources, integrated using an ontology. Tools created to improve data integration are employed in this commercial project, successfully combing signalling data with (simulated) train positioning data

University of Birmingham Research Archive, E-theses Repository