Search CORE

37,536 research outputs found

The complexity of the Multiple Pattern Matching Problem for random strings

Author: Bassino Frédérique
Rakotoarimalala Tsinjo
Sportiello Andrea
Publication venue
Publication date: 01/01/2017
Field of study

We generalise a multiple string pattern matching algorithm, recently proposed by Fredriksson and Grabowski [J. Discr. Alg. 7, 2009], to deal with arbitrary dictionaries on an alphabet of size

s

. If

r_m

is the number of words of length

m

in the dictionary, and

\phi(r) = \max_m \ln(s\, m\, r_m)/m

, the complexity rate for the string characters to be read by this algorithm is at most

\kappa_{{}_\textrm{UB}}\, \phi(r)

for some constant

\kappa_{{}_\textrm{UB}}

. On the other side, we generalise the classical lower bound of Yao [SIAM J. Comput. 8, 1979], for the problem with a single pattern, to deal with arbitrary dictionaries, and determine it to be at least

\kappa_{{}_\textrm{LB}}\, \phi(r)

. This proves the optimality of the algorithm, improving and correcting previous claims.Comment: 25 pages, 4 figure

arXiv.org e-Print Archive

HAL-Paris 13

Approximation errors of online sparsification criteria

Author: Honeine Paul
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 21/09/2014
Field of study

Many machine learning frameworks, such as resource-allocating networks, kernel-based methods, Gaussian processes, and radial-basis-function networks, require a sparsification scheme in order to address the online learning paradigm. For this purpose, several online sparsification criteria have been proposed to restrict the model definition on a subset of samples. The most known criterion is the (linear) approximation criterion, which discards any sample that can be well represented by the already contributing samples, an operation with excessive computational complexity. Several computationally efficient sparsification criteria have been introduced in the literature, such as the distance, the coherence and the Babel criteria. In this paper, we provide a framework that connects these sparsification criteria to the issue of approximating samples, by deriving theoretical bounds on the approximation errors. Moreover, we investigate the error of approximating any feature, by proposing upper-bounds on the approximation error for each of the aforementioned sparsification criteria. Two classes of features are described in detail, the empirical mean and the principal axes in the kernel principal component analysis.Comment: 10 page

arXiv.org e-Print Archive

Hal-Diderot

Analyzing sparse dictionaries for online learning with kernels

Author: Honeine Paul
Publication venue
Publication date: 21/09/2014
Field of study

Many signal processing and machine learning methods share essentially the same linear-in-the-parameter model, with as many parameters as available samples as in kernel-based machines. Sparse approximation is essential in many disciplines, with new challenges emerging in online learning with kernels. To this end, several sparsity measures have been proposed in the literature to quantify sparse dictionaries and constructing relevant ones, the most prolific ones being the distance, the approximation, the coherence and the Babel measures. In this paper, we analyze sparse dictionaries based on these measures. By conducting an eigenvalue analysis, we show that these sparsity measures share many properties, including the linear independence condition and inducing a well-posed optimization problem. Furthermore, we prove that there exists a quasi-isometry between the parameter (i.e., dual) space and the dictionary's induced feature space.Comment: 10 page

arXiv.org e-Print Archive

Hal-Diderot

Meta learning of bounds on the Bayes classifier error

Author: Delouille Veronique
Hero III Alfred O.
Moon Kevin R.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 03/07/2015
Field of study

Meta learning uses information from base learners (e.g. classifiers or estimators) as well as information about the learning problem to improve upon the performance of a single base learner. For example, the Bayes error rate of a given feature space, if known, can be used to aid in choosing a classifier, as well as in feature selection and model selection for the base classifiers and the meta classifier. Recent work in the field of f-divergence functional estimation has led to the development of simple and rapidly converging estimators that can be used to estimate various bounds on the Bayes error. We estimate multiple bounds on the Bayes error using an estimator that applies meta learning to slowly converging plug-in estimators to obtain the parametric convergence rate. We compare the estimated bounds empirically on simulated data and then estimate the tighter bounds on features extracted from an image patch analysis of sunspot continuum and magnetogram images.Comment: 6 pages, 3 figures, to appear in proceedings of 2015 IEEE Signal Processing and SP Education Worksho

arXiv.org e-Print Archive