We discuss the problem of learning a deterministic finite automaton (DFA)
from a confidence oracle. That is, we are given access to an oracle Q with
incomplete knowledge of some target language L over an alphabet Ξ£; the
oracle maps a string xβΞ£β to a score in the interval [β1,1]
indicating its confidence that the string is in the language. The
interpretation is that the sign of the score signifies whether xβL, while
the magnitude β£Q(x)β£ represents the oracle's confidence. Our goal is to learn
a DFA representation of the oracle that preserves the information that it is
confident in. The learned DFA should closely match the oracle wherever it is
highly confident, but it need not do this when the oracle is less sure of
itself