Search CORE

8,937 research outputs found

Combining quantifications for flexible query result ranking

Author: Billiet Christophe
De Tré Guy
Publication venue: IEEE Xplore
Publication date: 01/01/2015
Field of study

Databases contain data and database systems governing such databases are often intended to allow a user to query these data. On one hand, these data may be subject to imperfections, on the other hand, users may employ imperfect query preference specifications to query such databases. All of these imperfections lead to each query answer being accompanied by a collection of quantifications indicating how well (part of) a group of data complies with (part of) the user's query. A fundamental question is how to present the user with the query answers complying best to his or her query preferences. The work presented in this paper first determines the difficulties to overcome in reaching such presentation. Mainly, a useful presentation needs the ranking of the query answers based on the aforementioned quantifications, but it seems advisable to not combine quantifications with different interpretations. Thus, the work presented in this paper continues to introduce and examine a novel technique to determine a query answer ranking. Finally, a few aspects of this technique, among which its computational efficiency, are discussed

Crossref

Ghent University Academic Bibliography

Qualitative Effects of Knowledge Rules in Probabilistic Data Integration

Author: Keijzer A. de
Keulen M. van
Publication venue: Centre for Telematics and Information Technology, University of Twente
Publication date: 01/01/2008
Field of study

One of the problems in data integration is data overlap: the fact that different data sources have data on the same real world entities. Much development time in data integration projects is devoted to entity resolution. Often advanced similarity measurement techniques are used to remove semantic duplicates from the integration result or solve other semantic conflicts, but it proofs impossible to get rid of all semantic problems in data integration. An often-used rule of thumb states that about 90% of the development effort is devoted to solving the remaining 10% hard cases. In an attempt to significantly decrease human effort at data integration time, we have proposed an approach that stores any remaining semantic uncertainty and conflicts in a probabilistic database enabling it to already be meaningfully used. The main development effort in our approach is devoted to defining and tuning knowledge rules and thresholds. Rules and thresholds directly impact the size and quality of the integration result. We measure integration quality indirectly by measuring the quality of answers to queries on the integrated data set in an information retrieval-like way. The main contribution of this report is an experimental investigation of the effects and sensitivity of rule definition and threshold tuning on the integration quality. This proves that our approach indeed reduces development effort — and not merely shifts the effort to rule definition and threshold tuning — by showing that setting rough safe thresholds and defining only a few rules suffices to produce a ‘good enough’ integration that can be meaningfully used

CiteSeerX

University of Twente Research Information

Data Model and Query Constructs for Versatile Web Query Languages

Author: Bry François
Furche Tim
Linse Benedikt
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

As the Semantic Web is gaining momentum, the need for truly versatile query languages becomes increasingly apparent. A Web query language is called versatile if it can access in the same query program data in different formats (e.g. XML and RDF). Most query languages are not versatile: they have not been specifically designed to cope with both worlds, providing a uniform language and common constructs to query and transform data in various formats. Moreover, most of them do not provide a flexible data model that is powerful enough to naturally convey both Semantic Web data formats (especially RDF and Topic Maps) and XML. This article highlights challenges related to the data model and language constructs for querying both standard Web and Semantic Web data with an emphasis on facilitating sophisticated reasoning. It is shown that Xcerpt’s data model and querying constructs are particularly well-suited for the Semantic Web, but that some adjustments of the Xcerpt syntax allow for even more effective and natural querying of RDF and Topic Maps

CiteSeerX

Crossref

Open Access LMU

Survey over Existing Query and Transformation Languages

Author: Bolzer Oliver
Bry François
Furche Tim
Horrocks Ian
Kraus Michael
Orsini Renzo
Schaffert Sebastian
Publication venue
Publication date: 01/01/2004
Field of study

A widely acknowledged obstacle for realizing the vision of the Semantic Web is the inability of many current Semantic Web approaches to cope with data available in such diverging representation formalisms as XML, RDF, or Topic Maps. A common query language is the first step to allow transparent access to data in any of these formats. To further the understanding of the requirements and approaches proposed for query languages in the conventional as well as the Semantic Web, this report surveys a large number of query languages for accessing XML, RDF, or Topic Maps. This is the first systematic survey to consider query languages from all these areas. From the detailed survey of these query languages, a common classification scheme is derived that is useful for understanding and differentiating languages within and among all three areas

CiteSeerX

Open Access LMU

Mediated data integration and transformation for web service-based software architectures

Author: Pahl Claus
Zhu Yaoling
Publication venue
Publication date: 06/12/2006
Field of study

Service-oriented architecture using XML-based web services has been widely accepted by many organisations as the standard infrastructure to integrate heterogeneous and autonomous data sources. As a result, many Web service providers are built up on top of the data sources to share the data by supporting provided and required interfaces and methods of data access in a unified manner. In the context of data integration, problems arise when Web services are assembled to deliver an integrated view of data, adaptable to the specific needs of individual clients and providers. Traditional approaches of data integration and transformation are not suitable to automate the construction of connectors dedicated to connect selected Web services to render integrated and tailored views of data. We propose a declarative approach that addresses the oftenneglected data integration and adaptivity aspects of serviceoriented architecture

Irish Universities

DCU Online Research Access Service

Four Lessons in Versatility or How Query Languages Adapt to the Web

Author: A. Bonifati
A. Gelder van
A. Polleres
A. Polleres
A.C. Klug
B. Adida
B. Cooper
B. Jenner
D. Olteanu
D. Olteanu
D. Recordon
D.D. Chamberlin
D.R. Fulkerson
E. Augurusa
F. Bry
F. Bry
F. Bry
F. Wei
F. Weigel
G. Gottlob
G. Karvounarakis
H. Björklund
H. Garcia-Molina
H. Meuss
H. Meuss
H. Przymusinska
H. Tamaki
H. Wang
H.V. Jagadish
J. Bailey
J. Euzenat
J. Pérez
J. Pérez
J. Pérez
J.D. Ullman
J.J. Carroll
J.V.D. Bussche
K. Kochut
K.A. Ross
K.R. Apt
K.S. Booth
L. Cabibbo
M. Habib
M. Kay
M. Marx
M. Marx
N. Bruno
N. Walsh
P. Boncz
P. Buneman
P. Cholak
P. O’Neil
P.G. Kolaitis
P.P. Schneider
R. Agrawal
R. Fagin
R. Goldman
R. Hull
R. Khare
R. Khare
R. Schenkel
S. Abiteboul
S. Abiteboul
S. Abiteboul
S. Al-Khalifa
S. Berger
S. Groppe
S. Trißl
T. Chen
T. Furche
T. Grust
T. Schwentick
T.C. Przymusinski
U. Assmann
W. Akhtar
W. Chen
W.L. Hsu
W.L. Hsu
Z. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Exposing not only human-centered information, but machine-processable data on the Web is one of the commonalities of recent Web trends. It has enabled a new kind of applications and businesses where the data is used in ways not foreseen by the data providers. Yet this exposition has fractured the Web into islands of data, each in different Web formats: Some providers choose XML, others RDF, again others JSON or OWL, for their data, even in similar domains. This fracturing stifles innovation as application builders have to cope not only with one Web stack (e.g., XML technology) but with several ones, each of considerable complexity. With Xcerpt we have developed a rule- and pattern based query language that aims to give shield application builders from much of this complexity: In a single query language XML and RDF data can be accessed, processed, combined, and re-published. Though the need for combined access to XML and RDF data has been recognized in previous work (including the W3C’s GRDDL), our approach differs in four main aspects: (1) We provide a single language (rather than two separate or embedded languages), thus minimizing the conceptual overhead of dealing with disparate data formats. (2) Both the declarative (logic-based) and the operational semantics are unified in that they apply for querying XML and RDF in the same way. (3) We show that the resulting query language can be implemented reusing traditional database technology, if desirable. Nevertheless, we also give a unified evaluation approach based on interval labelings of graphs that is at least as fast as existing approaches for tree-shaped XML data, yet provides linear time and space querying also for many RDF graphs. We believe that Web query languages are the right tool for declarative data access in Web applications and that Xcerpt is a significant step towards a more convenient, yet highly efficient data access in a “Web of Data”

CiteSeerX

Crossref

Open Access LMU

Twelve Theses on Reactive Rules for the Web

Author: F. Bry
F. Bry
F. Bry
G.F. Knolmayer
J. Bailey
J. Bailey
M. Winslett
S. Preibusch
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Reactivity, the ability to detect and react to events, is an essential functionality in many information systems. In particular, Web systems such as online marketplaces, adaptive (e.g., recommender) systems, and Web services, react to events such as Web page updates or data posted to a server. This article investigates issues of relevance in designing high-level programming languages dedicated to reactivity on the Web. It presents twelve theses on features desirable for a language of reactive rules tuned to programming Web and Semantic Web applications

CiteSeerX

Crossref

Open Access LMU

Taming Data Explosion in Probabilistic Information Integration

Author: Keijzer Ander de
Keulen Maurice van
Li Yiping
Publication venue: University of Twente, Centre for Telematics and Information Technology
Publication date: 01/01/2006
Field of study

Data integration has been a challenging problem for decades. In an ambient environment, where many autonomous devices have their own information sources and network connectivity is ad hoc and peer-to-peer, it even becomes a serious bottleneck. To enable devices to exchange information without the need for interaction with a user at data integration time and without the need for extensive semantic annotations, a probabilistic approach seems rather promising. It simply teaches the device how to cope with the uncertainty occurring during data integration. Unfortunately, without any kind of world knowledge, almost everything becomes uncertain, hence maintaining all possibilities produces huge integrated information sources. In this paper, we claim that only very simple and generic rules are enough world knowledge to drastically reduce the amount of uncertainty, hence to tame the data explosion to a manageable size

CiteSeerX

University of Twente Research Information

On various forms of bipolarity in flexible querying

Author: Dubois Didier
Prade Henri
Publication venue: 'Atlantis Press'
Publication date: 01/01/2013
Field of study

International audienceThe paper discusses the modeling of “if possible" in requirements of the form “A and if possible B". We distinguish between two types of understanding: either i) A and B are requirements of the same nature and are viewed as constraints with different levels of priority, or ii) they are of different nature (only A induces constraint(s) and B is only used for breaking ties among items that are equally satisfying A). We indicate that the two views are related to different types of bipolarity, and discuss them in relation with possibilistic logic. The disjunctive dual of the first view (“A or at least B") is then presented in this logical setting. We also briefly mention the idea of an extension of the second view where B may refer both to bonus conditions or malus conditions that may increase or decrease respectively the interest in an item satisfying A

Scientific Publications of the University of Toulouse II Le Mirail

OPUS - University of Technology Sydney

Open Archive Toulouse Archive Ouverte

CHORUS Deliverable 2.1: State of the Art on Multimedia Search Engines

Author: Boujemaa Nozha
Compañó Ramón
Dosch Christoph
Geurts Joost
Karlgren Jussi
King Paul
Kompatsiaris Yiannis
Köhler Joachim
Le Moine Jean-Yves
Ortgies Robert
Point Jean-Charles
Rotenberg Boris
Rudström Åsa
Sebe Nicu
Publication venue: Chorus Project Consortium
Publication date: 01/01/2007
Field of study

Based on the information provided by European projects and national initiatives related to multimedia search as well as domains experts that participated in the CHORUS Think-thanks and workshops, this document reports on the state of the art related to multimedia content search from, a technical, and socio-economic perspective. The technical perspective includes an up to date view on content based indexing and retrieval technologies, multimedia search in the context of mobile devices and peer-to-peer networks, and an overview of current evaluation and benchmark inititiatives to measure the performance of multimedia search engines. From a socio-economic perspective we inventorize the impact and legal consequences of these technical advances and point out future directions of research

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive