Search CORE

7 research outputs found

RECONOCIMIENTO DE COMANDOS DE VOZ USANDO LA TRANSFORMADA WAVELET Y MÁQUINAS DE VECTORES DE SOPORTE

Author: IBARGÜEN FRANCISCO J.
MARÍN JORGE I.
MUÑOZ PABLO A.
Publication venue: 'Universidad Tecnologica de Pereira - UTP'
Publication date: 17/08/2006
Field of study

Este artículo muestra el análisis comparativo de un clasificador por red neuronal artificial frente a una máquina de vectores de soporte para una aplicación de reconocimiento de comandos de voz, cuya extracción de características está basada en paquetes wavelet. Para evaluar el desempeño de ambos sistemas, se realizaron pruebas variando el esquema de extracción, diferentes arquitecturas de la red neuronal artificial y diferentes funciones núcleo (kernel) para las máquinas de vectores de soporte. Se encuentra que ambos esquemas de clasificación presentan desempeños similares, con porcentajes de acierto superiores al 96%

Revistas UTP

Clasificación de voces normales y patológicas empleando la transformada wavelet

Author: Rodríguez Villalobos Reinaldo
Publication venue: Universidad Tecnológica de Bolívar
Publication date: 01/01/2006
Field of study

En este trabajo se presenta el desarrollo de la metodología de un sistema de clasificación de voces normales y patológicas utilizando como herramienta de análisis la Transformada Wavelet, ya que se constituye en una herramienta apropiada para el análisis de señales no estacionarias como la voz cuyo contenido espectral varía con el tiempo. Se plantea el uso de redes neuronales tipo perceptrón multicapa utilizando la técnica de aprendizaje backpropagation y el metodo de optimización de Levenberg-Marquardt como estrategia para la etapa de clasificación de voces normales y patológicas. Se describe el marco experimental realizado para cada una de las etapas que conforman la metodología propuesta y con el cual se obtuvieron resultados satisfactorios al momento de clasificar las voces normales y patológicas. Además se presentan detalladamente los algoritmos y la interfaz gráfica desarrollada bajo la plataforma de MATLAB 5.3.Incluye bibliografí

Universidad Tecnológica de Bolívar: Repositorio Digital

Interactive speech-driven facial animation

Author: Hodgkinson Warren
Publication venue
Publication date: 18/07/2008
Field of study

One of the fastest developing areas in the entertainment industry is digital animation. Television programmes and movies frequently use 3D animations to enhance or replace actors and scenery. With the increase in computing power, research is also being done to apply these animations in an interactive manner. Two of the biggest obstacles to the success of these undertakings are control (manipulating the models) and realism. This text describes many of the ways to improve control and realism aspects, in such a way that interactive animation becomes possible. Specifically, lip-synchronisation (driven by human speech), and various modeling and rendering techniques are discussed. A prototype that shows that interactive animation is feasible, is also described.Mr. A. Hardy Prof. S. von Solm

University of Johannesburg Institutional Repository

The Use Of Wavelet Transforms In Phoneme Recognition

Author: Beng T. Tan
Minyue Fu
Phillip Dermody
Publication venue
Publication date
Field of study

This study investigates the usefulness of wavelet transforms in phoneme recognition. Both discrete wavelet transforms (DWT) and sampled continuous wavelet transforms (SCWT) are tested. The wavelet transform is used as a part of the front-end processor which extracts feacuture vectors for a speaker-independent HMM-based phoneme recognizer. The results are evaluated on a portion of TIMIT corpus consisting of 30293 phoneme tokens for training and 14489 phoneme tokens for testing. The test results suggest that SCWT gives considerably better recognition rate than DWT. On the other hand, the improvement of SCWT over Mel-scale cepstral coefficients appears to be marginal. 1. INTRODUCTION The wavelet transform (WT) theory provides an alternative tool for short time analysis of quasi stationary signal, such as speech, as oppose to the traditional short-time Fourier transform (STFT). WT has been applied widely in different speech analysis problems [16, 8, 9, 7, 3]. Scalograms produced by WT an..

CiteSeerX