Skip to main content
Article thumbnail
Location of Repository

Modelling Prosodic Dynamics for Speaker Recognition

By Zdeněk Jančík

Abstract

Most current automatic speaker recognition system extract speaker-depend features by looking at short-term spectral information. This approach ignores long-term information. I explored approach that use the fundamental frequency and energy trajectories for each speaker. This approach models prosody dynamics on single fonemes or syllables. It is known from literature that prosodic systems do not work as well the acoustic one but it improve the system when fusing. I verified this assumption by fusing my results with state of the art acoustic system from BUT. Data from standard evaluation campaigns organized by National Institute of Standarts and Technology are used for all experiments

Topics: jazykový model; speaker validation; language model; n-gram; energy; pitch; ověření mluvčího; prosody; speaker identification; speaker recognition; prosodie; energie; rozpoznání mluvčího; bigram; identifikace mluvčího; základní tón
Publisher: Vysoké učení technické v Brně. Fakulta informačních technologií
Year: 2008
OAI identifier: oai:invenio.nusl.cz:235929
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.nusl.cz/ntk/nusl-23... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.