Combined optimization of feature selection and algorithm parameters in machine learning of language

Daelemans, Walter; De Meulder, Fien; Hoste, Veronique; Naudts, Bart

research

Combined optimization of feature selection and algorithm parameters in machine learning of language

Authors: Walter Daelemans
Fien De Meulder
Veronique Hoste
Bart Naudts
Publication date: 1 January 2003
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

Comparative machine learning experiments have become an important methodology in empirical approaches to natural language processing (i) to investigate which machine learning algorithms have the 'right bias' to solve specific natural language processing tasks, and (ii) to investigate which sources of information add to accuracy in a learning approach. Using automatic word sense disambiguation as an example task, we show that with the methodology currently used in comparative machine learning experiments, the results may often not be reliable because of the role of and interaction between feature selection and algorithm parameter optimization. We propose genetic algorithms as a practical approach to achieve both higher accuracy within a single approach, and more reliable comparisons

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Crossref

info:doi/10.1007%2F978-3-540-3...

Last time updated on 01/04/2019

Ghent University Academic Bibliography

oai:archive.ugent.be:598083

Last time updated on 12/11/2016