Search CORE

37 research outputs found

Provenance Circuits for Trees and Treelike Instances (Extended Version)

Author: B Kimelfeld
HL Bodlaender
M Bojańczyk
R Pichler
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Query evaluation in monadic second-order logic (MSO) is tractable on trees and treelike instances, even though it is hard for arbitrary instances. This tractability result has been extended to several tasks related to query evaluation, such as counting query results [3] or performing query evaluation on probabilistic trees [10]. These are two examples of the more general problem of computing augmented query output, that is referred to as provenance. This article presents a provenance framework for trees and treelike instances, by describing a linear-time construction of a circuit provenance representation for MSO queries. We show how this provenance can be connected to the usual definitions of semiring provenance on relational instances [20], even though we compute it in an unusual way, using tree automata; we do so via intrinsic definitions of provenance for general semirings, independent of the operational details of query evaluation. We show applications of this provenance to capture existing counting and probabilistic results on trees and treelike instances, and give novel consequences for probability evaluation.Comment: 48 pages. Presented at ICALP'1

arXiv.org e-Print Archive

HAL - Lille 3

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Optimizing and parallelizing ranked enumeration

Author: Cohen S.
Hristidis V.
Kacholia V.
Kimelfeld B.
Kimelfeld B.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Modeling and querying probabilistic XML data

Author: Benny Kimelfeld
Cohen S.
Kimelfeld B.
Nierman A.
Ré C.
Yehoshua Sagiv
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Optimal top-k query evaluation for weighted business processes

Author: Jones T.
Kemeny J.
Kimelfeld B.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Probabilistic XML via Markov Chains

Author: Kimelfeld B.
Stockmeyer L.
Thomas W.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Uncertain XML Documents Classification Using Extreme Learning Machine

Author: B. Kimelfeld
B. Kimelfeld
B. Ning
B. Ning
G.B. Huang
J. Li
J. Liu
L. Chang
P. Senellart
S. Abiteboul
Y. Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Crossref

Spatial voting with incomplete voter information

Author: Brill Markus
Imber A.
Israel J.
Kimelfeld B.
Shachnai H.
Publication venue: AAAI Digital Library
Publication date: 24/03/2024
Field of study

When selecting committees based on preferences of voters, a variety of different criteria can be considered. Two natural objectives are maximizing the utilitarian welfare (the sum of voters' utilities) and coverage (the number of represented voters) of the selected committee. Previous work has studied the impact on utilitarian welfare and coverage when requiring the committee to satisfy minimal requirements such as justified representation or weak proportionality. In this paper, we consider the impact of imposing much more demanding proportionality axioms. We identify a class of voting rules that achieve strong guarantees on utilitarian welfare and coverage when combined with appropriate completions. This class is defined via a weakening of priceability and contains prominent rules such as the Method of Equal Shares. We show that committees selected by these rules (i) can be completed to achieve optimal coverage and (ii) can be completed to achieve an asymptotically optimal approximation to the utilitarian welfare if they additionally satisfy EJR+. Answering an open question of Elkind et al. (2022), we use the Greedy Justified Candidate Rule to obtain the best possible utilitarian guarantee subject to proportionality. We also consider completion methods suggested in the participatory budgeting literature and other objectives besides welfare and coverage

Warwick Research Archives Portal Repository

Declarative probabilistic programming with Datalog

Author: Barany V
Kimelfeld b
Olteanu D
Ten Cate B
Vagena Z
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

Probabilistic programming languages are used for developing statistical models. They typically consist of two components: a specification of a stochastic process (the prior), and a specification of observations that restrict the probability space to a conditional subspace (the posterior). Use cases of such formalisms include the development of algorithms in machine learning and artificial intelligence. In this article we establish a probabilistic-programming extension of Datalog that, on the one hand, allows for defining a rich family of statistical models, and on the other hand retains the fundamental properties of declarativity. Our proposed extension provides mechanisms to include common numerical probability functions; in particular, conclusions of rules may contain values drawn from such functions. The semantics of a program is a probability distribution over the possible outcomes of the input database with respect to the program. Observations are naturally incorporated by means of integrity constraints over the extensional and intensional relations. The resulting semantics is robust under different chases and invariant to rewritings that preserve logical equivalence.</p

Oxford University Research Archive

Declarative probabilistic programming with Datalog

Author: Barany V
Kimelfeld b
Olteanu DA
Ten Cate B
Vagena Z
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 27/10/2017
Field of study

Oxford University Research Archive

Declarative probabilistic programming with datalog

Author: Barany V
Cate BT
Kimelfeld B
Olteanu D
Vagena Z
Publication venue: Dagstuhl Publishing
Publication date: 01/01/2016
Field of study

Probabilistic programming languages are used for developing statistical models, and they typically consist of two components: a specification of a stochastic process (the prior), and a specification of observations that restrict the probability space to a conditional subspace (the posterior). Use cases of such formalisms include the development of algorithms in machine learning and artificial intelligence. We propose and investigate an extension of Datalog for specifying statistical models, and establish a declarative probabilistic-programming paradigm over databases. Our proposed extension provides convenient mechanisms to include common numerical probability functions; in particular, conclusions of rules may contain values drawn from such functions. The semantics of a program is a probability distribution over the possible outcomes of the input database with respect to the program. Observations are naturally incorporated by means of integrity constraints over the extensional and intensional relations. The resulting semantics is robust under different chases and invariant to rewritings that preserve logical equivalence

Crossref

Dagstuhl Research Online Publication Server

Oxford University Research Archive