Search CORE

155 research outputs found

A comparison of ID3 and backpropagation for English text-to-speech mapping

Author: Bakiri G.
Dietterich T.G.
Hild Hermann
Publication venue
Publication date: 02/08/2007
Field of study

KITopen

A comparison of ID3 and backpropagation for English text-to-speech mapping

Author: C. R. Rosenberg
D. E. Rumelhart
D. Klatt
G. Bakiri
G. L. Martin
Ghulum Bakiri
Hermann Hild
J. L. McClelland
J. M. Lucassen
J. Mingers
J. R. Quinlan
J. R. Quinlan
J. R. Quinlan
J. R. Quinlan
K. J. Lang
L. Breiman
T. G. Dietterich
T. G. Dietterich
T. G. Dietterich
T. J. Sejnowski
Thomas G. Dietterich
W. Buntine
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recommended from our members

A comparison of ID3 and backpropagation for English text-to-speech mapping

Author: Bakiri Ghulum
Dietterich Thomas G.
Hild Hermann
Publication venue: Oregon State University. Department of Computer Science
Publication date
Field of study

The performance of the error backpropagation (BP) and 1D3 learning algorithms was com- pared on the task of mapping English text to phonemes and stresses. Under the distributed output code developed by Sejnowski and Rosenberg, it is shown that BP consistently out- performs ID3 on this task by several percentage points. Three hypotheses explaining this difference were explored: (a) ID3 is overfitting the training data, (b) BP is able to share hidden units across several output units and hence can learn the output units better, and (c) BP captures statistical information that 1D3 does not. We conclude that only hypothesis (c) is correct. By augmenting ID3 with a simple statistical learning procedure, the performance of BP can be approached but not matched. More complex statistical procedures can improve the performance of both BP and 1D3 substantially. A study of the residual errors suggests that there is still substantial room for improvement in learning methods for text-to-speech mapping

ScholarsArchive@OSU

Self-growing neural network architecture using crisp and fuzzy entropy

Author: Cios Krzysztof J.
Publication venue
Publication date
Field of study

The paper briefly describes the self-growing neural network algorithm, CID2, which makes decision trees equivalent to hidden layers of a neural network. The algorithm generates a feedforward architecture using crisp and fuzzy entropy measures. The results of a real-life recognition problem of distinguishing defects in a glass ribbon and of a benchmark problem of differentiating two spirals are shown and discussed

NASA Technical Reports Server

Recommended from our members

An overview of some recent machine learning research and applications

Author: Metfessel Brent A.
Paul
Publication venue: eScholarship, University of California
Publication date: 27/09/1990
Field of study

eScholarship - University of California

A comparative study of ID3 and backpropagation for English text-to-speech mapping

Author: Bakiri G.
Dietterich T.G.
Hild Hermann
Publication venue
Publication date: 02/08/2007
Field of study

KITopen

Visual pathways from the perspective of cost functions and multi-task deep neural networks

Author: Bohte Sander M.
de Haan Edward H. F.
Losch Max M.
Ramakrishnan Kandan
Scholte H. Steven
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Vision research has been shaped by the seminal insight that we can understand the higher-tier visual cortex from the perspective of multiple functional pathways with different goals. In this paper, we try to give a computational account of the functional organization of this system by reasoning from the perspective of multi-task deep neural networks. Machine learning has shown that tasks become easier to solve when they are decomposed into subtasks with their own cost function. We hypothesize that the visual system optimizes multiple cost functions of unrelated tasks and this causes the emergence of a ventral pathway dedicated to vision for perception, and a dorsal pathway dedicated to vision for action. To evaluate the functional organization in multi-task deep neural networks, we propose a method that measures the contribution of a unit towards each task, applying it to two networks that have been trained on either two related or two unrelated tasks, using an identical stimulus set. Results show that the network trained on the unrelated tasks shows a decreasing degree of feature representation sharing towards higher-tier layers while the network trained on related tasks uniformly shows high degree of sharing. We conjecture that the method we propose can be used to analyze the anatomical and functional organization of the visual system and beyond. We predict that the degree to which tasks are related is a good descriptor of the degree to which they share downstream cortical-units.Comment: 16 pages, 5 figure

arXiv.org e-Print Archive

CWI's Institutional Repository

Recommended from our members

A comparative study of ID3 and backpropagation for English text-to-speech mapping

Author: Bakiri Ghulum
Dietterich Thomas G.
Hild Hermann
Publication venue: Oregon State University. Department of Computer Science
Publication date
Field of study

The performance of the error backpropagation (BP) and ID3 learning algorithms was compared on the task of mapping English text to phonemes and stresses. Under the distributed output code developed by Sejnowski and Rosenberg, it is shown that BP consistently out-performs 1D3 on this task by several percentage points. Three hypotheses explaining this difference were explored: (a) ID3 is overfitting the training data, (b) BP is able to share hidden units across several output units and hence can learn the output units better, and (c) BP captures statistical information that ID3 does not. We conclude that only hypothesis (c) is correct. By augmenting ID3 with a simple statistical learning procedure, the performance of BP can be approached but not matched. More complex statistical procedures can improve the performance of both BP and ID3 substantially. A study of the residual errors suggests that there is still substantial room for improvement in learning methods for text-to-speech mapping

ScholarsArchive@OSU

Do not forget: Full memory in memory-based learning of word pronunciation

Author: Bosch Antal van den
Daelemans Walter
Publication venue
Publication date: 01/01/1998
Field of study

Memory-based learning, keeping full memory of learning material, appears a viable approach to learning NLP tasks, and is often superior in generalisation accuracy to eager learning approaches that abstract from learning material. Here we investigate three partial memory-based learning approaches which remove from memory specific task instance types estimated to be exceptional. The three approaches each implement one heuristic function for estimating exceptionality of instance types: (i) typicality, (ii) class prediction strength, and (iii) friendly-neighbourhood size. Experiments are performed with the memory-based learning algorithm IB1-IG trained on English word pronunciation. We find that removing instance types with low prediction strength (ii) is the only tested method which does not seriously harm generalisation accuracy. We conclude that keeping full memory of types rather than tokens, and excluding minority ambiguities appear to be the only performance-preserving optimisations of memory-based learning.Comment: uses conll98, epsf, and ipamacs (WSU IPA

arXiv.org e-Print Archive

CiteSeerX

Institutional Repository Universiteit Antwerpen

Tilburg University Repository

Comparative Experiments on Disambiguating Word Senses: An Illustration of the Role of Bias in Machine Learning

Author: Mooney Raymond J.
Publication venue
Publication date: 01/01/1996
Field of study

This paper describes an experimental comparison of seven different learning algorithms on the problem of learning to disambiguate the meaning of a word from context. The algorithms tested include statistical, neural-network, decision-tree, rule-based, and case-based classification techniques. The specific problem tested involves disambiguating six senses of the word ``line'' using the words in the current and proceeding sentence as context. The statistical and neural-network methods perform the best on this particular problem and we discuss a potential reason for this observed difference. We also discuss the role of bias in machine learning and its importance in explaining performance differences observed on specific problems.Comment: 10 page

arXiv.org e-Print Archive

CiteSeerX