Search CORE

30 research outputs found

Метод підвищення ефективності роботи пам’яті в системах пошуку ключових слів у мовному сигналі

Author: Bykov M. M.
Konate K.
Kovtun V. V.
Биков М. М.
Быков Н. М.
Ковтун В. В.
Ковтун В. В.
Конате К.
Конате К.
Publication venue: ВНТУ
Publication date: 01/01/2012
Field of study

Розроблено метод підвищення ефективності роботи структури асоціативної пам’яті для системи пошуку ключових слів в мовному сигналі на основі запропонованих принципів зберігання в окремих комірках асоціативної пам’яті тільки незбіжних частин еталонів та урахування потенційної здійсненності апріорно заданого розподілу ключових слів на класи в пам’яті еталонів. Надано математичне обґрунтування оптимального вибору асоціативної ознаки

Repository of Vinnytsia National Technical University

Fast Keyword Spotting in Telephone Speech

Author: Nouza J.
Silovsky J.
Publication venue: Společnost pro radioelektronické inženýrství
Publication date: 01/01/2009
Field of study

In the paper, we present a system designed for detecting keywords in telephone speech. We focus not only on achieving high accuracy but also on very short processing time. The keyword spotting system can run in three modes: a) an off-line mode requiring less than 0.1xRT, b) an on-line mode with minimum (2 s) latency, and c) a repeated spotting mode, in which pre-computed values allow for additional acceleration. Its performance is evaluated on recordings of Czech spontaneous telephone speech using rather large and complex keyword lists

Directory of Open Access Journals

DSpace@TUL

Digital library of Brno University of Technology

Very Fast Keyword Spotting System with Real Time Factor below 0.01

Author: J Foote
J Málek
J Nouza
X Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/07/2020
Field of study

In the paper we present an architecture of a keyword spotting (KWS) system that is based on modern neural networks, yields good performance on various types of speech data and can run very fast. We focus mainly on the last aspect and propose optimizations for all the steps required in a KWS design: signal processing and likelihood computation, Viterbi decoding, spot candidate detection and confidence calculation. We present time and memory efficient modelling by bidirectional feedforward sequential memory networks (an alternative to recurrent nets) either by standard triphones or so called quasi-monophones, and an entirely forward decoding of speech frames (with minimal need for look back). Several variants of the proposed scheme are evaluated on 3 large Czech datasets (broadcast, internet and telephone, 17 hours in total) and their performance is compared by Detection Error Tradeoff (DET) diagrams and real-time (RT) factors. We demonstrate that the complete system can run in a single pass with a RT factor close to 0.001 if all optimizations (including a GPU for likelihood computation) are applied.Comment: 11 pages, 3 figure

arXiv.org e-Print Archive

Crossref

Kulcsszókeresési kísérletek hangzó híranyagokon beszédhang alapú felismerési technikákkal

Author: Gosztolya Gábor
Tóth László
Publication venue: Szegedi Tudományegyetem
Publication date: 01/01/2010
Field of study

A beszédadatbázisok kereshetővé tételéhez szöveges címkékkel kell azokat ellátni. A kézenfekvő megoldás szószintű átirat készíttetése lenne nagyszótáras beszédfelismerővel. A felismerők azonban zárt szótárral dolgoznak, így előfordulhat, hogy számunkra fontos keresési kifejezéseket (tulajdonneveket, névelemeket) esélyünk sem lesz megtalálni, pusztán mert azok nem szerepelnek a felismerő szótárában. Jelen cikkben olyan megoldásokat hasonlítunk össze, amelyek csupán beszédhang szinten végzik el az előzetes indexálást, így tetszőleges keresési kifejezésre (hangsorozatra) képesek rákeresni. A vizsgált módszerek találati pontossága gyakorlati szempontból is használhatónak ígérkezik, köszönhetően az eleve magas beszédhang-felismerési pontosságnak. A futási időt tekintve azonban még a leggyorsabb módszer is sokkal lassabbnak bizonyul, mint ami egy ilyen alkalmazástól elvárt lenne. Ezért a kés őbbiekben kifinomult indexálási technikák bevetésére lesz szükség

SZTE Publicatio Repozitórium - SZTE - Repository of Publications

Supporting Engagement and Floor Control in Hybrid Meetings

Author: Hofs D.H.W.
Hondorp G.H.W.
Nijholt Antinus
op den Akker Harm
op den Akker Hendrikus J.A.
Zwiers Jakob
Publication venue: Springer
Publication date: 14/07/2009
Field of study

University of Twente Research Information

Fast and Accurate Keyword Spotting System

Author: Lenčéš Marián
Publication venue: Vysoké učení technické v Brně. Fakulta informačních technologií
Publication date: 01/01/2016
Field of study

Tato práce se zabývá rychlou a přesnou detekcí klíčových slov z audio nahrávek. Cílem práce bylo prostudovat možnosti detekce slov a vytvořit několik typů jazykových modelů. Tyto modely následně mezi sebou porovnat. Zaměřujeme se zde na detekci klíčových slov z anglicky namluvených audio nahrávek.This bachelor's thesis deals with fast and accurate detection of keywords from audio records. The aim of was to study possibilities of word detection and to create several types of language models. These were then to be compared to each other. We focus here on the detection of keywords from English spoken audio records.

Digital library of Brno University of Technology

National Repository of Grey Literature