Search CORE

22 research outputs found

In Good Company: Efficient Retrieval of the Top-k Most Relevant Event-Partner Pairs

Author: F Yu
G Salton
IF Ilyas
IF Ilyas
Ihab F. Ilyas
J Bao
K Schnaitter
N Mamoulis
R Fagin
W Tu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Crossref

VBN

A SAT-based System for Consistent Query Answering

Author: A Fuxman
Akhil A. Dixit
D Lembo
G Greco
IF Ilyas
J Chomicki
J Davies
J Wijsen
J Wijsen
Leopoldo Bertossi
M Arenas
MC Marileo
P Koutris
P Koutris
Pablo Barceló
PG Kolaitis
PG Kolaitis
T Rekatsinas
Publication venue
Publication date: 07/05/2019
Field of study

An inconsistent database is a database that violates one or more integrity constraints, such as functional dependencies. Consistent Query Answering is a rigorous and principled approach to the semantics of queries posed against inconsistent databases. The consistent answers to a query on an inconsistent database is the intersection of the answers to the query on every repair, i.e., on every consistent database that differs from the given inconsistent one in a minimal way. Computing the consistent answers of a fixed conjunctive query on a given inconsistent database can be a coNP-hard problem, even though every fixed conjunctive query is efficiently computable on a given consistent database. We designed, implemented, and evaluated CAvSAT, a SAT-based system for consistent query answering. CAvSAT leverages a set of natural reductions from the complement of consistent query answering to SAT and to Weighted MaxSAT. The system is capable of handling unions of conjunctive queries and arbitrary denial constraints, which include functional dependencies as a special case. We report results from experiments evaluating CAvSAT on both synthetic and real-world databases. These results provide evidence that a SAT-based approach can give rise to a comprehensive and scalable system for consistent query answering.Comment: 25 pages including appendix, to appear in the 22nd International Conference on Theory and Applications of Satisfiability Testin

arXiv.org e-Print Archive

Crossref

Model-Based Diversification for Sequential Exploratory Queries

Author: E Erkut
E Erkut
E Vee
G Chatzopoulou
IF Ilyas
K Chakrabarti
M Drosou
M Drosou
MA Sharaf
ML Kersten
Y Tao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Microenvironmental acidosis in carcinogenesis and metastases: new strategies in prevention and therapy

Author: A Calcinotto
A Garcia-Garcia
A Giatromanolaki
A Hinton
A Ibrahim-Hashim
A Milito De
A Milito De
A Milito De
A Roos
A Sario Di
A Udelnow
AP Halestrap
AP Halestrap
AS Silva
BA Webb
Bob Gatenby
C Chung
C Daniel
C Federici
C Pinheiro
CB Colen
CC Wykoff
CC Wykoff
D Generali
D Hanahan
DA Hume
E Miraglia
EJ Bowman
EP Spugnini
ER Fearon
F Kastelein
F Luciani
F Pacchiano
F Perut
G Hanna
G Kroemer
G Lauritzen
G Simone De
Giulietta Venturi
H Matthews
H Nagata
H You
HG Keizer
I Parolini
IF Robey
IN Rich
J Capecci
J Fang
J Pastorek
J Xu
JA Beutler
JA Kellen
JM Lemons
JM Mullin
JM Philippe
K Kusuzaki
K Kusuzaki
K Kusuzaki
K Kusuzaki
K Smallbone
K Xu
L Dubois
L Huang
L Olbe
LY Bourguignon
M Bellone
M Chen
M Chen
M Ilyas
M Perez-Sayans
M Perez-Sayans
M Yeo
MA Magalhaes
ME Malo
MG Vander Heiden
ML Marino
ML Wahl
MR Boyd
N Altan
N Raghunand
N Touisni
NK Vishvakarma
O Warburg
P Sonveaux
P Swietach
P Swietach
P Wong
PC Nowell
PD Roepe
Q Lu
R Becelli
R Belhoussine
R Martinez-Zaguilan
R Martinez-Zaguilan
RA Cardone
RA Gatenby
RA Gatenby
RA Gatenby
RA Gatenby
RJ DeBerardinis
RJ Gillies
RJ Gillies
S Avnet
S Fais
S Ferrari
S Harguindey
S Harguindey
S Harguindey
S Hashiguchi
S Pastorekova
S Simon
SB Garcia
SJ Reshkin
SJ Reshkin
SK Chia
SR Amith
SR Sennoune
SR Sennoune
Stefano Fais
T Matsubara
T Matsubara
T Murakami
T Nishi
T Nishisho
T Ohta
V Estrella
V Ganapathy
V Huber
V Michel
V Miranda-Goncalves
W Harley
W Shen
WH Chang
WY Lee
X Lu
X Yang
XF Che
XL Zu
Y Lou
Z Ouar
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Efficient Top-k Cloud Services Query Processing Using Trust and QoS

Author: A Jøsang
B Martens
IF Ilyas
M Menzel
N Limam
R Fagin
S Silas
Z Zheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

International audienc

Crossref

HAL

Hal-Diderot

Reinforcement Learning for Data Preparation with Active Reward Learning

Author: CE Rasmussen
CJCH Watkins
Gonzalo Navarro
IF Ilyas
M Feurer
P Konda
P Konda
RS Sutton
T Rekatsinas
W Fan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/12/2019
Field of study

International audienceDatacleaninganddatapreparationhavebeenlong-standingchallenges in data science to avoid incorrect results, biases, and misleading conclusions ob- tained from “dirty” data. For a given dataset and data analytics task, a plethora of data preprocessing techniques and alternative data cleaning strategies are avail- able, but they may lead to dramatically different outputs with unequal result quality performances. For adequate data preparation, the users generally do not know how to start with or which methods to use. Most current work can be classified into two categories: 1) they propose new data cleaning algorithms specific to certain types of data anomalies usually considered in isolation and without a “pipeline vision” of the entire data preprocessing strategy; 2) they develop automated machine learning approaches (AutoML) that can optimize the hyper- parameters of a considered ML model with a list of by-default preprocessing methods. We argue that more efforts should be devoted to proposing a principled and adaptive data preparation approach to help and learn from the user for selecting the optimal sequence of data preparation tasks to obtain the best quality performance of the final result. In this paper, we extend Learn2Clean, a method based on Q-Learning, a model-free reinforcement learning technique that selects, for a given data set, a given ML model, and a pre-selected quality performance metric, the optimal sequence of tasks for preprocessing the data such that the quality metric is maximized. We will discuss some new results of Learn2Clean for semi-automating data preparation with “the human in the loop” using active reward learning and Q-learning

Crossref

HAL AMU

Answering the Min-Cost Quality-Aware Query on Multi-Sources in Sensor-Cloud Systems

Author: A Alamri
E Rahm
I Lazaridis
IF Ilyas
MR Garey
NK Yeganeh
S Abiteboul
W Fan
XL Dong
XL Dong
Publication venue: 'MDPI AG'
Publication date: 01/12/2018
Field of study

In sensor-based systems, the data of an object is often provided by multiple sources. Since the data quality of these sources might be different, when querying the observations, it is necessary to carefully select the sources to make sure that high quality data is accessed. A solution is to perform a quality evaluation in the cloud and select a set of high-quality, low-cost data sources (i.e., sensors or small sensor networks) that can answer queries. This paper studies the problem of min-cost quality-aware query which aims to find high quality results from multi-sources with the minimized cost. The measurement of the query results is provided, and two methods for answering min-cost quality-aware query are proposed. How to get a reasonable parameter setting is also discussed. Experiments on real-life data verify that the proposed techniques are efficient and effective

Multidisciplinary Digital Publishing Institute

Crossref

Directory of Open Access Journals

Extending Graph Pattern Matching with Regular Expressions

Author: CH Papadimitriou
IF Ilyas
J Hromkovic
JL Reutter
LG Terveen
LP Cordella
R Fagin
R Fagin
W Fan
W Fan
W Fan
X Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Graph pattern matching, which is to compute the set M(Q, G) of matches of Q in G, for the given pattern graph Q and data graph G, has been increasingly used in emerging applications e.g., social network analysis. As the matching semantic is typically defined in terms of subgraph isomorphism, two key issues are hence raised: the semantic is often too rigid to identify meaningful matches, and the problem is intractable, which calls for efficient matching methods. Motivated by these, this paper extends matching semantic with regular expressions, and investigates the top-k graph pattern matching problem. (1) We introduce regular patterns, which revise traditional pattern graphs by incorporating regular expressions; extend traditional matching semantic by allowing edge to regular path mapping. With the extension, more meaningful matches could be captured. (2) We propose a relevance function, that is defined in terms of tightness of connectivity, for ranking matches. Based on the ranking function, we introduce the top-k graph pattern matching problem, denoted by TopK. (3) We show that TopK is intractable. Despite hardness, we develop an algorithm with early termination property, i.e., it finds top-k matches without identifying entire match set. (4) Using real-life and synthetic data, we experimentally verify that our top-k matching algorithms are effective, and outperform traditional counterparts

Crossref

University of Southern Queensland ePrints

Optimal algorithms for selecting top-k combinations of attributes: theory and applications

Author: A Natsev
Chunbin Lin
IF Ilyas
IF Ilyas
J Feng
Jiaheng Lu
Jianguo Wang
JT Schwartz
M Hua
M Qiao
MA Soliman
ML Yiu
N Bruno
N Mamoulis
R Fagin
R Fagin
R Varadarajan
R Zhu
W Fan
X Lian
X Zhang
Xiaokui Xiao
Zhewei Wei
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Similarity Joins and Beyond: An Extended Set of Binary Operators with Order

Author: C Böhm
C Xiao
EH Jacox
IF Ilyas
K Fredriksson
LO Carvalho
MA Cheema
R Paredes
SS Pearson
V Dohnal
Y Gao
YN Silva
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref