Search CORE

1,123 research outputs found

Learning Unions of $\omega(1)$ -Dimensional Rectangles

Author: Aizenstein
Alp Atıcı
Beimel
Bruck
Chen
Chen
Freund
Freund
Hajnal
Jackson
Khardon
Klivans
Krause
Kushilevitz
Maass
Pisier
Rocco A. Servedio
Schapire
Servedio
Publication venue: 'Elsevier BV'
Publication date: 26/06/2007
Field of study

We consider the problem of learning unions of rectangles over the domain

[b]^n

, in the uniform distribution membership query learning setting, where both b and n are "large". We obtain poly

(n, \log b)

-time algorithms for the following classes: - poly

(n \log b)

-way Majority of

O(\frac{\log(n \log b)} {\log \log(n \log b)})

-dimensional rectangles. - Union of poly

(\log(n \log b))

many

O(\frac{\log^2 (n \log b)} {(\log \log(n \log b) \log \log \log (n \log b))^2})

-dimensional rectangles. - poly

(n \log b)

-way Majority of poly

(n \log b)

-Or of disjoint

O(\frac{\log(n \log b)} {\log \log(n \log b)})

-dimensional rectangles. Our main algorithmic tool is an extension of Jackson's boosting- and Fourier-based Harmonic Sieve algorithm [Jackson 1997] to the domain

[b]^n

, building on work of [Akavia, Goldwasser, Safra 2003]. Other ingredients used to obtain the results stated above are techniques from exact learning [Beimel, Kushilevitz 1998] and ideas from recent work on learning augmented

AC^{0}

circuits [Jackson, Klivans, Servedio 2002] and on representing Boolean functions as thresholds of parities [Klivans, Servedio 2001].Comment: 25 pages. Some corrections. Recipient of E. M. Gold award ALT 2006. To appear in Journal of Theoretical Computer Scienc

arXiv.org e-Print Archive

CiteSeerX

Elsevier - Publisher Connector

Crossref

Complexity of Equivalence and Learning for Multiplicity Tree Automata

Author: A. Beimel
A. Habrard
A.R. Klivans
D. Angluin
E. Allender
H. Seidl
S. Bozapalidis
Publication venue
Publication date: 01/01/2014
Field of study

We consider the complexity of equivalence and learning for multiplicity tree automata, i.e., weighted tree automata over a field. We first show that the equivalence problem is logspace equivalent to polynomial identity testing, the complexity of which is a longstanding open problem. Secondly, we derive lower bounds on the number of queries needed to learn multiplicity tree automata in Angluin's exact learning model, over both arbitrary and fixed fields. Habrard and Oncina (2006) give an exact learning algorithm for multiplicity tree automata, in which the number of queries is proportional to the size of the target automaton and the size of a largest counterexample, represented as a tree, that is returned by the Teacher. However, the smallest tree-counterexample may be exponential in the size of the target automaton. Thus the above algorithm does not run in time polynomial in the size of the target automaton, and has query complexity exponential in the lower bound. Assuming a Teacher that returns minimal DAG representations of counterexamples, we give a new exact learning algorithm whose query complexity is quadratic in the target automaton size, almost matching the lower bound, and improving the best previously-known algorithm by an exponential factor

arXiv.org e-Print Archive

CiteSeerX

Crossref

Oxford University Research Archive

Inferring Symbolic Automata

Author: Fisman Dana
Frenkel Hadar
Zilles Sandra
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 30th EACSL Annual Conference on Computer Science Logic (CSL 2022)
Publication date: 01/01/2022
Field of study

We study the learnability of symbolic finite state automata, a model shown useful in many applications in software verification. The state-of-the-art literature on this topic follows the query learning paradigm, and so far all obtained results are positive. We provide a necessary condition for efficient learnability of SFAs in this paradigm, from which we obtain the first negative result. The main focus of our work lies in the learnability of SFAs under the paradigm of identification in the limit using polynomial time and data. We provide a necessary condition and a sufficient condition for efficient learnability of SFAs in this paradigm, from which we derive a positive and a negative result

Dagstuhl Research Online Publication Server

Active Learning with Multiple Views

Author: Knoblock C. A.
Minton S.
Muslea I.
Publication venue: 'AI Access Foundation'
Publication date: 05/10/2011
Field of study

Active learners alleviate the burden of labeling large amounts of data by detecting and asking the user to label only the most informative examples in the domain. We focus here on active learning for multi-view domains, in which there are several disjoint subsets of features (views), each of which is sufficient to learn the target concept. In this paper we make several contributions. First, we introduce Co-Testing, which is the first approach to multi-view active learning. Second, we extend the multi-view learning framework by also exploiting weak views, which are adequate only for learning a concept that is more general/specific than the target concept. Finally, we empirically show that Co-Testing outperforms existing active learners on a variety of real world domains such as wrapper induction, Web page classification, advertisement removal, and discourse tree parsing

arXiv.org e-Print Archive

Crossref

Learning from different teachers and imperfect queries

Author: Kriķis Mārtiņš
Publication venue
Publication date: 01/01/1998
Field of study

E-resource repository of the University of Latvia

Inferring Symbolic Automata

Author: Fisman Dana
Frenkel Hadar
Zilles Sandra
Publication venue
Publication date: 01/01/2022
Field of study

arXiv.org e-Print Archive

CISPA – Helmholtz-Zentrum für Informationssicherheit

Episciences.org

Dagstuhl Research Online Publication Server

Four Lessons in Versatility or How Query Languages Adapt to the Web

Author: A. Bonifati
A. Gelder van
A. Polleres
A. Polleres
A.C. Klug
B. Adida
B. Cooper
B. Jenner
D. Olteanu
D. Olteanu
D. Recordon
D.D. Chamberlin
D.R. Fulkerson
E. Augurusa
F. Bry
F. Bry
F. Bry
F. Wei
F. Weigel
G. Gottlob
G. Karvounarakis
H. Björklund
H. Garcia-Molina
H. Meuss
H. Meuss
H. Przymusinska
H. Tamaki
H. Wang
H.V. Jagadish
J. Bailey
J. Euzenat
J. Pérez
J. Pérez
J. Pérez
J.D. Ullman
J.J. Carroll
J.V.D. Bussche
K. Kochut
K.A. Ross
K.R. Apt
K.S. Booth
L. Cabibbo
M. Habib
M. Kay
M. Marx
M. Marx
N. Bruno
N. Walsh
P. Boncz
P. Buneman
P. Cholak
P. O’Neil
P.G. Kolaitis
P.P. Schneider
R. Agrawal
R. Fagin
R. Goldman
R. Hull
R. Khare
R. Khare
R. Schenkel
S. Abiteboul
S. Abiteboul
S. Abiteboul
S. Al-Khalifa
S. Berger
S. Groppe
S. Trißl
T. Chen
T. Furche
T. Grust
T. Schwentick
T.C. Przymusinski
U. Assmann
W. Akhtar
W. Chen
W.L. Hsu
W.L. Hsu
Z. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Exposing not only human-centered information, but machine-processable data on the Web is one of the commonalities of recent Web trends. It has enabled a new kind of applications and businesses where the data is used in ways not foreseen by the data providers. Yet this exposition has fractured the Web into islands of data, each in different Web formats: Some providers choose XML, others RDF, again others JSON or OWL, for their data, even in similar domains. This fracturing stifles innovation as application builders have to cope not only with one Web stack (e.g., XML technology) but with several ones, each of considerable complexity. With Xcerpt we have developed a rule- and pattern based query language that aims to give shield application builders from much of this complexity: In a single query language XML and RDF data can be accessed, processed, combined, and re-published. Though the need for combined access to XML and RDF data has been recognized in previous work (including the W3C’s GRDDL), our approach differs in four main aspects: (1) We provide a single language (rather than two separate or embedded languages), thus minimizing the conceptual overhead of dealing with disparate data formats. (2) Both the declarative (logic-based) and the operational semantics are unified in that they apply for querying XML and RDF in the same way. (3) We show that the resulting query language can be implemented reusing traditional database technology, if desirable. Nevertheless, we also give a unified evaluation approach based on interval labelings of graphs that is at least as fast as existing approaches for tree-shaped XML data, yet provides linear time and space querying also for many RDF graphs. We believe that Web query languages are the right tool for declarative data access in Web applications and that Xcerpt is a significant step towards a more convenient, yet highly efficient data access in a “Web of Data”

CiteSeerX

Crossref

Open Access LMU

Characterising Modal Formulas with Examples

Author: Cate Balder ten
Koudijs Raoul
Publication venue
Publication date: 13/04/2023
Field of study

We study the existence of finite characterisations for modal formulas. A finite characterisation of a modal formula

\varphi

is a finite collection of positive and negative examples that distinguishes

\varphi

from every other, non-equivalent modal formula, where an example is a finite pointed Kripke structure. This definition can be restricted to specific frame classes and to fragments of the modal language: a modal fragment

L

admits finite characterisations with respect to a frame class

F

if every formula

\varphi\in L

has a finite characterisation with respect to

L

consting of examples that are based on frames in

F

. Finite characterisations are useful for illustration, interactive specification, and debugging of formal specifications, and their existence is a precondition for exact learnability with membership queries. We show that the full modal language admits finite characterisations with respect to a frame class

F

only when the modal logic of

F

is locally tabular. We then study which modal fragments, freely generated by some set of connectives, admit finite characterisations. Our main result is that the positive modal language without the truth-constants

\top

and

\bot

admits finite characterisations w.r.t. the class of all frames. This result is essentially optimal: finite characterizability fails when the language is extended with the truth constant

\bot

or with all but very limited forms of negation.Comment: Expanded version of material from Raoul Koudijs's MSc thesis (2022

arXiv.org e-Print Archive