1 research outputs found
Making problems tractable on big data via preprocessing with polylog-size output
To provide a dichotomy between those queries that can be made feasible on big
data after appropriate preprocessing and those for which preprocessing does not
help, Fan et al. developed the -tractability theory. This theory
provides a formal foundation for understanding the tractability of query
classes in the context of big data. Along this line, we introduce a novel
notion of -tractability in this paper. Inspired by some technologies
used to deal big data, we place a restriction on preprocessing function, which
limits the function to produce a relatively small database as output, at most
polylog-size of the input database. At the same time, we bound the redundancy
information when re-factorizing data and queries for preprocessing. These
changes aim to make our theory more closely linked to practice. We set two
complexity classes to denote the classes of Boolean queries that are
-tractable themselves and that can be made -tractable,
respectively. Based on a new factorization in our complexity classes, we
investigate two reductions, which differ from whether allowing re-factorizing
data and query parts. We verify the transitive and compatible properties of the
reductions and analysis the complete problems and sizes of the complexity
classes. We conclude that all PTIME classes of Boolean queries can be made
-tractable, similar to that of the -tractability theory. With
a little surprise, we prove that the set of all -tractable queries is
strictly smaller than that of all -tractable queries, and thus the set
of -tractable queries is properly contained in that of PTIME queries.
In this way, we attain a new complexity class inside the complexity class of
PTIME queries.Comment: 16 pages, 2 figure