Search CORE

11 research outputs found

Position Models and Language Modeling

Author: C. Kermorvant
F. Thollard
F. Thollard
J. Callut
J. Daciuk
M. Marcus
P. Dupont
V. Siivola
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

International audienceIn statistical language modelling the classic model used is

n

-gram. This model is not able however to capture long term dependencies, \emph{i.e.} dependencies larger than

n

. An alternative to this model is the probabilistic automaton. Unfortunately, it appears that preliminary experiments on the use of this model in language modelling is not yet competitive, partly because it tries to model too long term dependencies. We propose here to improve the use of this model by restricting the dependency to a more reasonable value. Experiments shows an improvement of 45\% reduction in the perplexity obtained on the Wall Street Journal language modeling task

Crossref

HAL-UJM

Sur la question de l’élite des Celtes orienta ux à l’âge du Fer

Author: Adam A-M
Arcelin P
Baray L
Benadík B
Billy P-H
Božić D
Brunaux J-L
Buchsenschutz O
Bujna J
Bujna J
Bujna J
Bujna J
Bujna J
Charpy J-J
Chytraček M
Colbert De Beaulieu J-B
Crummy Ph
Delamarre X
Dobesch =G
Dobesch G
D’Agostino B
Egg M
Filip J
Fischer F
Gebhard R
Ginoux N
Ginoux N
Guillaumet J-P
Guštin M
Guštin M
Guštin M
Haffner A
Harbison P
Hauschild M
Hellebrandt M
Hellebrandt M
Hodson F R
Horváth J
Horváth L
Hunyady I
Hunyady I
Jacobsthal P
Joachim H-E
Jud P
Kaenel G
Kimmig W
Kruta Poppi L
Kruta V
Kruta V
Kruta V
Kruta V
Kruta V
Lambert P-Y
Lambrechts P
Le Rider G
Lejars T
Lejars T
Lejars T
Lejars T
M. Szabó
Malitz I
Marion S
Marion S
Maráz B
Mennessier-Jouannet C
Moscati
Márton L
Mócsy A
Nachtergael G
Perrin F
Petres É F F
Peyre Chr
Piggott S
Pink K
Polenz H
Poux dir M
Rapin A
Rapin A
Rapin A
Ratimorská P
Rolley Cl
Roska M
Rustoiu A
Rusu M
Sankot P
Schaaff U
Schönfelder M
Schönfelder M
Schönfelder M
Schönfelder M
Shefton B B
Sievers S
Stead J
Stead J
Stead J
Stojić M
Szabó dir M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szilágyi J Gy
Thollard P
Todorović J
Venedikov J
Végh K K
Végh K K
Waldhauser J
Werner W M
Zirra V
Publication venue: 'Akademiai Kiado Zrt.'
Publication date
Field of study

Crossref

Efficient Pruning of Probabilistic Automata

Author: A. Clark
C. Kermorvant
C. Zhai
F. Casacuberta
F. Thollard
F. Thollard
F. Thollard
J. Callut
N. Abe
P. Dupont
R.C. Carrasco
R.C. Carrasco
T.M. Cover
Publication venue: Springer Verlag
Publication date: 01/01/2008
Field of study

International audienceApplications of probabilistic grammatical inference are limited due to time and space consuming constraints. In statistical language modeling, for example, large corpora are now available and lead to managing automata with millions of states. We propose in this article a method for pruning automata (when restricted to tree based structures) which is not only efficient (sub-quadratic) but that allows to dramatically reduce the size of the automaton with a small impact on the underlying distribution. Results are evaluated on a language modeling task

HAL-UJM

Crossref

Use of Grammatical Inference in Natural Speech Recognition

Author: F. Thollard
Rue P. Michelon
Universit&apos
Publication venue
Publication date
Field of study

This paper presents the application of stochastic grammatical inference to speech recognition. In speech recognition, the acoustic signal process produces a set of words which are combinating to build sentences. Language models are then used to lead the speech recognition application to the most pertinent combination. Up to now, statistical language models are used. We suggest to use stochastic formal grammars instead of statistical models. Theses stochastic grammars will be build by machine learning algorithms. We will first show that unaided grammatical inference cannot be used for speech recognition. We will then make manifest that smoothing is necessary and show the gain that one can obtain by using a basic smoothing. We finally put up a smoothing technic dedicates to stochastic formal grammars. 2 THE QUALITY CRITERION 1 Introduction Our aim is to use stochastic grammatical inference for natural speech recognition. The main difference between validations of grammatical inference..

CiteSeerX

Towards Feasible PAC-Learning of Probabilistic Deterministic Finite Automata

Author: A. Clark
C. Kermorvant
D. Ron
F. Denis
F. Thollard
N. Palmer
O. Guttman
P. Dupont
R. Gavaldà
R.C. Carrasco
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Crossref

Pautomac: A probabilistic automata and hidden markov models learning competition,PAutomaC : a PFA/HMM Learning Competition

Author: A. Beimel
A. Clark
A. Gelfand
A. Habrard
A. Habrard
A. Hasan Ibne
A. Paz
A. Sanjeev
A. Sudkamp
B. Balle
C. Higuera de la
C. Higuera de la
C. Higuera de la
C. Higuera de la
C. R. Shalizi
C. S. Wetherell
C. Shibata
C. Zhai
Colin de la Higuera
D. Gildea
D. Lee
D. M. Blei
D. Ron
D. Ron
E. Brill
E. Vidal
E. Vidal
F. Bergadano
F. Denis
F. Denis
F. Denis
F. Denis
F. Jelinek
F. Kepler
F. Thollard
F. Thollard
F. Thollard
F. Thollard
H. Ney
J. Borges
J. Castro
J. Gao
J. R. Partington
K. J. Lang
L. E. Baum
L. E. Baum
L. Rabiner
L. Saul
M. Heule
M. Hulden
M. J. Kearns
M. J. Kearns
M. Mohri
M. Young-Lai
N. Abe
N. Palmer
N. Walkinshaw
O. Guttman
P. Cruz-Alcázar
P. Dupont
P. Dupont
P. Grünwald
P. Milani Comparetti
R. Bailly
R. C. Carrasco
R. C. Carrasco
R. C. Carrasco
R. Gavaldà
R. L. Rivest
R. Neal
Rémi Eyraud
S. F. Chen
S. Kullback
S. Verwer
S. Verwer
Sicco Verwer
T. Cover
T. Goan
T. Y. Young
Y. Esposito
Y. Sakakibara
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/10/2013
Field of study

Contains fulltext : 122433.pdf (preprint version ) (Open Access

Crossref

HAL AMU

Radboud Repository

A Markovian approach to the induction of regular string distributions

Author: C. Kermorvant
D. Llorens
D. Ron
E. Levin
F. Denis
F. Thollard
H. Rulot
J.G. Kemeny
L. Rabiner
M. Ostendorf
P. Dupont
P. Dupont
R. Carrasco
R. Durbin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

We propose in this paper a novel approach to the induction of the structure of Hidden Markov Models (HMMs). The notion of partially observable Markov models (POMMs) is introduced. POMMs form a particular case of HMMs where any state emits a single letter with probability one, but several states can emit the same letter. It is shown that any HMM can be represented by an equivalent POMM. The proposed induction algorithm aims at finding a POMM fitting a sample drawn from an unknown target POMM. The induced model is built to fit the dynamics of the target machine observed in the sample. A POMM is seen as a lumped process of a Markov chain and the induced POMM is constructed to best approximate the stationary distribution and the mean first passage times (MFPT) observed in the sample. The induction relies on iterative state splitting from an initial maximum likelihood model. The transition probabilities of the updated model are found by solving an optimization problem to minimize the difference between the observed MFPT and their values computed in the induced model

Crossref

DIAL UCLouvain

Přibližná redukce konečných automatů pro detekci útoků ve vysokorychlostních sítích

Author: A Bouajjani
A Malcher
ANDREAS MALETTI
C Baier
C Liu
Christel Baier
CR Clark
D Bustan
F Thollard
G Gange
J Champarnaud
K Etessami
L Clemente
M Mohri
M Mohri
P Gawrychowski
R Paige
RC Carrasco
Richard Mayr
Sailesh Kumar
T Jiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 23/02/2018
Field of study

We consider the problem of approximate reduction of non-deterministic automata that appear in hardware-accelerated network intrusion detection systems (NIDSes). We define an error distance of a reduced automaton from the original one as the probability of packets being incorrectly classified by the reduced automaton (wrt the probabilistic distribution of packets in the network traffic). We use this notion to design an approximate reduction procedure that achieves a great size reduction (much beyond the state-of-the-art language preserving techniques) with a controlled and small error. We have implemented our approach and evaluated it on use cases from Snort , a popular NIDS. Our results provide experimental evidence that the method can be highly efficient in practice, allowing NIDSes to follow the rapid growth in the speed of networks.Článek se zaobírá přibližnou redukcí konečných automatů pro detekci útoků ve vysokorychlostních sítích

Crossref

Digital library of Brno University of Technology

Data-Driven Assistance Functions for Industrial Automation Systems

Author: Cannata A.
Christiansen L.
Faltinski S.
Ghahramani Z.
Hofmann M.
Ihaka R.
Jager M.
Lenze SE
MANUFUTURE-EU
Mingyu T.
Narasimhan S.
Niggemann O.
Niggemann O.
Oliver Niggemann
Palensky P.
Reed G.
Stefan Windmann
Thollard F.
Verwer S.
Vodenčarević A.
Wang M.
White T. ed Loukides M.
Windmann S.
Witten I.
Publication venue: 'IOP Publishing'
Publication date
Field of study

Crossref

Learning Finite State Machines

Author: C. Higuera de la
C. Higuera de la
C. Higuera de la
C. Higuera de la
C. Higuera de la
C. Higuera de la
D. Angluin
D. Angluin
D. Angluin
D. Angluin
D. Angluin
D. Angluin
D. Ron
E.M. Gold
E.M. Gold
F. Denis
F. Thollard
J. Oncina
J. Oncina
K.J. Lang
L. Pitt
M.J. Kearns
P. García
Y. Sakakibara
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Crossref