Search CORE

323 research outputs found

Memory-based morphological analysis

Author: Daelemans W.
van den Bosch A.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/1999
Field of study

Tilburg University Repository

Improving sequence segmentation learning by predicting trigrams

Author: Daelemans W.
van den Bosch A.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2005
Field of study

Tilburg University Repository

Meta-Learning for Phonemic Annotation of Corpora

Author: Daelemans W.
Gillis S.
Hoste V.
Tjong Kim Sang E.F.
van den Bosch A.
Weigand H.
Publication venue
Publication date: 01/01/2000
Field of study

We apply rule induction, classifier combination and meta-learning (stacked classifiers) to the problem of bootstrapping high accuracy automatic annotation of corpora with pronunciation information. The task we address in this paper consists of generating phonemic representations reflecting the Flemish and Dutch pronunciations of a word on the basis of its orthographic representation (which in turn is based on the actual speech recordings). We compare several possible approaches to achieve the text-to-pronunciation mapping task: memory-based learning, transformation-based learning, rule induction, maximum entropy modeling, combination of classifiers in stacked learning, and stacking of meta-learners. We are interested both in optimal accuracy and in obtaining insight into the linguistic regularities involved. As far as accuracy is concerned, an already high accuracy level (93% for Celex and 86% for Fonilex at word level) for single classifiers is boosted significantly with additional error reductions of 31% and 38% respectively using combination of classifiers, and a further 5% using combination of meta-learners, bringing overall word level accuracy to 96% for the Dutch variant and 92% for the Flemish variant. We also show that the application of machine learning methods indeed leads to increased insight into the linguistic regularities determining the variation between the two pronunciation variants studied.Comment: 8 page

arXiv.org e-Print Archive

CiteSeerX

Ghent University Academic Bibliography

Archivsystem Ask23

Institutional Repository Universiteit Antwerpen

Tilburg University Repository

Constraint Satisfaction Inference:Non-probabilistic Global Inference for Sequence Labelling

Author: Canisius S.V.M.
Daelemans W.
van den Bosch A.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2006
Field of study

Tilburg University Repository

Forgetting exceptions is harmful in language learning

Author: Daelemans W.
van den Bosch A.
Zavrel J.
Publication venue
Publication date: 01/01/1999
Field of study

Tilburg University Repository

IGTree:Using trees for compression and classification in lazy learing algorithms

Author: Daelemans W.
van den Bosch A.
Weijters T.
Publication venue
Publication date: 01/01/1997
Field of study

Tilburg University Repository

Discrete versus Probabilistic Sequence Classifiers for Domain-specific Entity Chunking

Author: Canisius S.V.M.
Daelemans W.
van den Bosch A.
Publication venue: Belgisch Nederlandse Ver. voor Kunstmatige Intelligentie
Publication date: 01/01/2006
Field of study

Tilburg University Repository

Internal podalic version of second twin: Improving feet identification using a simulation model.

Author: Ceccaldi P.F.
Daelemans C.
Desseauve D.
Farin A.
Jauvion IBM
Publication venue
Publication date: 01/08/2022
Field of study

Podalic version and breech extraction require high obstetrical expertise. Identifying fetal extremities is the first crucial step for trainees. When this skill is not polished enough, it increases the inter-twin delivery interval and can even jeopardize the whole manoeuver. We present a model for simulating and training this specific skill, with obstetrical mannequin, and 3D printed hands and feet. Five feet and five hands (five rights and five lefts of each one) were printed in 3D after initial ultrasound acquisition of a near term fetus. Each foot and hand, was individually set in a condom filled with 100 cc of water and closed with a knot. A Sophie's Mum Birth Simulator Version 4.0 de MODEL-med was placed on the edge of the table. Each hand and foot was inserted into the pelvic mannequin. An evaluation of the students' skills using this model was performed. A significant reduction of the global mean to extract the first foot and all the feet was noticed at three month of interval. This model is an option to train and assess a crucial skill for version and breech extraction

Serveur académique lausannois

До відома авторів

Author: Bosch A. van den
Busser G.J.
Canisius S.V.M.
Daelemans W.
Publication venue: Інститут проблем штучного інтелекту МОН України та НАН України
Publication date: 01/01/2007
Field of study

We describe TADPOLE, a modular memory-based morphosyntactic tagger and dependency parser for Dutch. Though primarily aimed at being accurate, the design of the system is also driven by optimizing speed and memory usage, using a trie-based approximation of k-nearest neighbor classification as the basis of each module. We perform an evaluation of its three main modules: a part-of-speech tagger, a morphological analyzer, and a dependency parser, trained on manually annotated material available for Dutch – the parser is additionally trained on automatically parsed data. A global analysis of the system shows that it is able to process text in linear time close to an estimated 2,500 words per second, while maintaining sufficient accuracy

Наукова електронна бібліотека періодичних видань НАН України (Vernadsky National Library of Ukraine)

Institutional Repository Universiteit Antwerpen

Utrecht University Repository

Tilburg University Repository