Search CORE

8 research outputs found

Using MATLAB for Analysis of TRAP System

Author: Grezl F.
Karafiat M.
Publication venue: Společnost pro radioelektronické inženýrství
Publication date: 01/12/2003
Field of study

This article describes a Matlab function for reading and processing file outputs from a structure of classifiers. These classifiers - neural nets - are used in speech recognition based on temporal trajectories (TRAP) of energy in frequency bands. This nonstandard approach is introduced and the program is presented. The utility of resulting figures is enhanced by the possibility of reading and displaying results from all critical bands at once. Resulting analyses are more focused on the reliability of classifiers than on the basic accuracy measure. The first uses colors and their depth to display both cues, reliability and accuracy, in one informative picture. Others are focused on more precise measures, where it is possible to precisely define classifier mistakes

Directory of Open Access Journals

Digital library of Brno University of Technology

Distributed Speech Recognition

Author: Burget L.
Grezl F.
Jain P.
Motlicek P.
Publication venue: Společnost pro radioelektronické inženýrství
Publication date: 01/12/2002
Field of study

This article discusses possibilities of integrating speech technology into wireless technology, allowing voice input for wireless devices. Distributed speech recognition concept and activities related to its standardization are presented. First ETSI DSR MFCC based standard is described. Work on its extension to improve robustness resulting in new standard is also presented

Directory of Open Access Journals

Digital library of Brno University of Technology

Audio Surveillance through Known Event Classiﬁcation

Author: F. Grezl
J. Cernocky
Publication venue: Spolecnost pro radioelektronicke inzenyrstvi
Publication date: 01/12/2009
Field of study

The way of audio surveillance through known event classiﬁcation is presented introducing simple yet efﬁcient framework. The use of the proposed system for unknown event detection is also suggested and evaluated. Further, a speciﬁc audio event is detected with use of audio classiﬁcation, which helps the detection to focus on a signal of speciﬁc behavior. Thus it is shown that the system can be used in several applications

Directory of Open Access Journals

Transcribing meetings with the AMIDA systems

Author: Burget L.
Dines J.
El Hannani A.
Garner P.N.
Grezl F.
Hain T.
Huijbregts M.
Karafiat M.
Lincoln M.
Wan V.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

In this paper, we give an overview of the AMIDA systems for transcription of conference and lecture room meetings. The systems were developed for participation in the Rich Transcription evaluations conducted by the National Institute for Standards and Technology in the years 2007 and 2009 and can process close talking and far field microphone recordings. The paper first discusses fundamental properties of meeting data with special focus on the AMI/AMIDA corpora. This is followed by a description and analysis of improved processing and modeling, with focus on techniques specifically addressing meeting transcription issues such as multi-room recordings or domain variability. In 2007 and 2009, two different strategies of systems building were followed. While in 2007 we used our traditional style system design based on cross adaptation, the 2009 systems were constructed semi-automatically, supported by improved decoders and a new method for system representation. Overall these changes gave a 6%-13% relative reduction in word error rate compared to our 2007 results while at the same time requiring less training material and reducing the real-time factor by five times. The meeting transcription systems are available at www.webasr.org

Infoscience - École polytechnique fédérale de Lausanne

Crossref

White Rose Research Online