Search CORE

152 research outputs found

When Can Matrix Query Languages Discern Matrices?

Author: Geerts Floris
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 23rd International Conference on Database Theory (ICDT 2020)
Publication date: 01/01/2020
Field of study

We investigate when two graphs, represented by their adjacency matrices, can be distinguished by means of sentences formed in MATLANG, a matrix query language which supports a number of elementary linear algebra operators. When undirected graphs are concerned, and hence the adjacency matrices are real and symmetric, precise characterisations are in place when two graphs (i.e., their adjacency matrices) can be distinguished. Turning to directed graphs, one has to deal with asymmetric adjacency matrices. This complicates matters. Indeed, it requires to understand the more general problem of when two arbitrary matrices can be distinguished in MATLANG. We provide characterisations of the distinguishing power of MATLANG on real and complex matrices, and on adjacency matrices of directed graphs in particular. The proof techniques are a combination of insights from the symmetric matrix case and results from linear algebra and linear control theory

Dagstuhl Research Online Publication Server

Institutional Repository Universiteit Antwerpen

A Uniform Dependency Language for Improving Data Quality

Author: Fan Wenfei
Geerts Floris
Publication venue
Publication date: 01/01/2011
Field of study

Edinburgh Research Explorer

Institutional Repository Universiteit Antwerpen

On the Expressive Power of Linear Algebra on Graphs

Author: Geerts Floris
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 22nd International Conference on Database Theory (ICDT 2019)
Publication date: 01/01/2019
Field of study

Most graph query languages are rooted in logic. By contrast, in this paper we consider graph query languages rooted in linear algebra. More specifically, we consider MATLANG, a matrix query language recently introduced, in which some basic linear algebra functionality is supported. We investigate the problem of characterising equivalence of graphs, represented by their adjacency matrices, for various fragments of MATLANG. A complete picture is painted of the impact of the linear algebra operations in MATLANG on their ability to distinguish graphs

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Institutional Repository Universiteit Antwerpen

Semandaq: a data quality system based on conditional functional dependencies

Author: Fan Wenfei
Geerts Floris
Jia Xibei
Publication venue
Publication date: 01/01/2008
Field of study

Edinburgh Research Explorer

Institutional Repository Universiteit Antwerpen

A revival of integrity constraints for data cleaning

Author: Fan Wenfei
Geerts Floris
Jia Xibei
Publication venue
Publication date: 01/01/2008
Field of study

Integrity constraints, a.k.a . data dependencies, are being widely used for improving the quality of schema . Recently constraints have enjoyed a revival for improving the quality of data . The tutorial aims to provide an overview of recent advances in constraint-based data cleaning. </jats:p

Crossref

Edinburgh Research Explorer

Institutional Repository Universiteit Antwerpen

Making Queries Tractable on Big Data with Preprocessing

Author: Fan Wenfei
Geerts Floris
Neven Frank
Publication venue
Publication date: 01/01/2013
Field of study

A query class is traditionally considered tractable if there exists a polynomial-time (PTIME) algorithm to answer its queries. When it comes to big data, however, PTIME al-gorithms often become infeasible in practice. A traditional and effective approach to coping with this is to preprocess data off-line, so that queries in the class can be subsequently evaluated on the data efficiently. This paper aims to pro-vide a formal foundation for this approach in terms of com-putational complexity. (1) We propose a set of Π-tractable queries, denoted by ΠT0Q, to characterize classes of queries that can be answered in parallel poly-logarithmic time (NC) after PTIME preprocessing. (2) We show that several natu-ral query classes are Π-tractable and are feasible on big data. (3) We also study a set ΠTQ of query classes that can be ef-fectively converted to Π-tractable queries by re-factorizing its data and queries for preprocessing. We introduce a form of NC reductions to characterize such conversions. (4) We show that a natural query class is complete for ΠTQ. (5) We also show that ΠT0Q ⊂ P unless P = NC, i.e., the set ΠT0Q of all Π-tractable queries is properly contained in the set P of all PTIME queries. Nonetheless, ΠTQ = P, i.e., all PTIME query classes can be made Π-tractable via proper re-factorizations. This work is a step towards understanding the tractability of queries in the context of big data. 1

CiteSeerX

Edinburgh Research Explorer