74 research outputs found
On the Complexity of Mining Itemsets from the Crowd Using Taxonomies
We study the problem of frequent itemset mining in domains where data is not
recorded in a conventional database but only exists in human knowledge. We
provide examples of such scenarios, and present a crowdsourcing model for them.
The model uses the crowd as an oracle to find out whether an itemset is
frequent or not, and relies on a known taxonomy of the item domain to guide the
search for frequent itemsets. In the spirit of data mining with oracles, we
analyze the complexity of this problem in terms of (i) crowd complexity, that
measures the number of crowd questions required to identify the frequent
itemsets; and (ii) computational complexity, that measures the computational
effort required to choose the questions. We provide lower and upper complexity
bounds in terms of the size and structure of the input taxonomy, as well as the
size of a concise description of the output itemsets. We also provide
constructive algorithms that achieve the upper bounds, and consider more
efficient variants for practical situations.Comment: 18 pages, 2 figures. To be published to ICDT'13. Added missing
acknowledgemen
Grothendieck Rings of Theories of Modules
The model-theoretic Grothendieck ring of a first order structure, as defined
by Krajic\v{e}k and Scanlon, captures some combinatorial properties of the
definable subsets of finite powers of the structure. In this paper we compute
the Grothendieck ring, , of a right -module , where
is any unital ring. As a corollary we prove a conjecture of Prest
that is non-trivial, whenever is non-zero. The main proof uses
various techniques from the homology theory of simplicial complexes.Comment: 42 Page
Ramsey numbers for partially-ordered sets
We present a refinement of Ramsey numbers by considering graphs with a
partial ordering on their vertices. This is a natural extension of the ordered
Ramsey numbers. We formalize situations in which we can use arbitrary families
of partially-ordered sets to form host graphs for Ramsey problems. We explore
connections to well studied Tur\'an-type problems in partially-ordered sets,
particularly those in the Boolean lattice. We find a strong difference between
Ramsey numbers on the Boolean lattice and ordered Ramsey numbers when the
partial ordering on the graphs have large antichains.Comment: 18 pages, 3 figures, 1 tabl
Poset Ramsey number . III. N-shaped poset
Given partially ordered sets (posets) and , we
say that contains a copy of if for some injective function and for any , if and only if
. For any posets and , the poset Ramsey number
is the least positive integer such that no matter how the elements
of an -dimensional Boolean lattice are colored in blue and red, there is
either a copy of with all blue elements or a copy of with all red
elements.
We focus on the poset Ramsey number for a fixed poset and an
-dimensional Boolean lattice , as grows large. It is known that
, for positive constants and .
However, there is no poset known, for which , for
. This paper is devoted to a new method for finding upper bounds
on using a duality between copies of and sets of elements
that cover them, referred to as blockers. We prove several properties of
blockers and their direct relation to the Ramsey numbers. Using these
properties we show that , for a poset
with four elements and , such that , ,
, and the remaining pairs of elements are incomparable.Comment: 19 pages, 6 figure
- β¦