Search CORE

130 research outputs found

End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations

Author: Cernocky Jan
Saraclar Murat
Yusuf Bolaji
Publication venue
Publication date: 15/08/2023
Field of study

Conventional keyword search systems operate on automatic speech recognition (ASR) outputs, which causes them to have a complex indexing and search pipeline. This has led to interest in ASR-free approaches to simplify the search procedure. We recently proposed a neural ASR-free keyword search model which achieves competitive performance while maintaining an efficient and simplified pipeline, where queries and documents are encoded with a pair of recurrent neural network encoders and the encodings are combined with a dot-product. In this article, we extend this work with multilingual pretraining and detailed analysis of the model. Our experiments show that the proposed multilingual training significantly improves the model performance and that despite not matching a strong ASR-based conventional keyword search system for short queries and queries comprising in-vocabulary words, the proposed model outperforms the ASR-based system for long queries and queries that do not appear in the training data.Comment: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 202

arXiv.org e-Print Archive

Pronunciation modelling for conversational speech recognition: a status report from WS97

Author: Byrne B.
Finke Michael
Khudanpur S.
Mcdonough J.
Nock H.
Riley M.
Saraclar M.
Woolers C.
Zavaliagkos G.
Publication venue
Publication date: 02/08/2007
Field of study

KITopen

Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion

Author: A Abad
A Cardenal-Lopez
A Cardenal-López
A Jansen
A Jansen
A Martin
A Moreno
A Moreno
A Moreno-Sandoval
A Stolcke
Alejandro Coucheiro-Limeres
AM Azmi
Antonio Cardenal
Antonio Miguel
B Logan
B Logan
B Ma
B Taras
B Zhang
C Ni
C Parada
Carmen Garcia-Mateo
CJ Chen
D Can
D Karakos
D Povey
D Vergyri
D Vergyri
Doroteo T. Toledano
F Metze
F Metze
GJF Jones
H Joho
H Joho
H Su
H-Y Lee
H-Y Lee
HVD Heuvel
I Szöke
I Szöke
I-F Chen
I-F Chen
J Chiu
J Chiu
J Chiu
J Garofolo
J Li
J Mamou
J Mamou
J Pinto
J Tejedor
J Tejedor
J Trmal
J van Hout
Javier Tejedor
JG Fiscus
Julia Olcoz
Julian David Echeverry-Correa
K Iwata
K Thambiratmann
KM Knill
KM Knill
L Docío-Fernández
L Mangu
Laura Docio-Fernandez
LJ Rodríguez-Fuentes
M Bisani
M Cai
M Ma
M Saraclar
M Wollmer
M Zelenák
MJF Gales
MS Seigel
N Rajput
NF Chen
NF Chen
P Yu
Paula Lopez-Otero
R Justo
S Nakagawa
SP Rath
T Ng
T Ohno
T Sakai
V Mitra
V-B Le
X Anguera
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

The electronic version of this article is the complete one and can be found online at: http://dx.doi.org/10.1186/s13636-015-0063-8Spoken term detection (STD) aims at retrieving data from a speech repository given a textual representation of the search term. Nowadays, it is receiving much interest due to the large volume of multimedia information. STD differs from automatic speech recognition (ASR) in that ASR is interested in all the terms/words that appear in the speech data, whereas STD focuses on a selected list of search terms that must be detected within the speech data. This paper presents the systems submitted to the STD ALBAYZIN 2014 evaluation, held as a part of the ALBAYZIN 2014 evaluation campaign within the context of the IberSPEECH 2014 conference. This is the first STD evaluation that deals with Spanish language. The evaluation consists of retrieving the speech files that contain the search terms, indicating their start and end times within the appropriate speech file, along with a score value that reflects the confidence given to the detection of the search term. The evaluation is conducted on a Spanish spontaneous speech database, which comprises a set of talks from workshops and amounts to about 7 h of speech. We present the database, the evaluation metrics, the systems submitted to the evaluation, the results, and a detailed discussion. Four different research groups took part in the evaluation. Evaluation results show reasonable performance for moderate out-of-vocabulary term rate. This paper compares the systems submitted to the evaluation and makes a deep analysis based on some search term properties (term length, in-vocabulary/out-of-vocabulary terms, single-word/multi-word terms, and in-language/foreign terms).This work has been partly supported by project CMC-V2 (TEC2012-37585-C02-01) from the Spanish Ministry of Economy and Competitiveness. This research was also funded by the European Regional Development Fund, the Galician Regional Government (GRC2014/024, “Consolidation of Research Units: AtlantTIC Project” CN2012/160)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Springer - Publisher Connector

Repositorio Universidad de Zaragoza

Biblos-e Archivo

Can environment or allergy explain international variation in prevalence of wheeze in childhood?

Author: Aarts F. J. H.
Abramidze T.
Abu Huij S.
Addo-Yobo E.
Agolli S.
Aguirre Rodriguez J.
Ait-Khaled N.
Anderson H. R.
Annesi-Maesano I.
Annus T.
Arthur P.
Asher I.
Bagrade L.
Barghuthy F.
Barry D.
Batlles Garrido J.
Beasley R.
Bjorksten B.
Bolle R.
Bonillo Perales A.
Braback L.
Braback L.
Brunekreef B.
Brunekreef B.
Buchele G.
Casno G.
Castejon Robles S.
Chen Y. Z.
Chico M.
Clausen M.
Cooper P. J.
Corbo G. (ORCID:0000-0002-8104-4659)
Crane J.
Crane J.
Daza Torres M.
de Meer G.
de Pereira M. U.
De Sario M.
Dentler C.
Di Domenicantonio R.
Doekes G.
Dolidze N.
El-Sharif N.
Ellwood P.
Escribano Montaner A.
Flohr C.
Foliaki S.
Forastiere F.
Forastiere F.
Forastiere F.
Garcia Hernandez G.
Garcia-Marcos L.
Garcia-Marcos L.
Geyik P.
Gonzalez Gil I.
Gonzalez Jimenez Y.
Gotua M.
Grabocka E.
Gratziou C.
Guillen Perez J. J.
Gurakuqi A.
Hatziagorou E.
Jaensch A.
Jaensch A.
Jansen-van Vliet P. H. N.
Janssen N. A. H.
Jones M. H.
Karsanidze L.
Katsardis C.
Kaur B.
Keil U.
Keil U.
Keil U.
Khubchandani R. P.
Kiladze M.
Kirvassilis F.
Kjellman M.
Kocabas C.
Kuyucu S.
Kvachadze I.
Lai C. K. W.
Lai C. K. W.
Leupold W.
Llopis Gonzalez A.
Losilla Maldonado A.
Luna Paredes C.
Lund E.
Mai X. -M.
Mallol J.
Mantri S.
Martinez Gimeno A.
Martinez Torres A.
Mathur R. S.
Mitchell E.
Momblan deCabo J.
Montefort S.
Morales Suarez-Varela M. M.
Moro Rodriguez A. L.
Nemery B.
Nilsson L.
Novikova I.
Nystad W.
Odhiambo J.
Papadopoulou A.
Pearce N.
Perucci C. A.
Pinana Lopez A.
Pistelli R. (ORCID:0000-0003-3776-2482)
Pitrez P. M.
Priftanji A.
Priftis K.
Qlebo M.
Riikjarv M. -A.
Robertson C.
Rubi Ruiz T.
Ruelius A. -K.
Rukhadze M.
Rzehak P.
Sackesen C.
Sammarro S.
Sandin A.
Saraclar Y.
Schram D.
Sebre D.
Serra M. G.
Shah J. R.
Shkurti A.
Shyti K.
Simenati J.
Stein R. T.
Stewart A.
Strachan D. P.
Strachan D. P.
Strachan D. P.
Sumbuloglu V.
Svabe V.
Tallon Guerola M.
Tsanakas J.
Tuncer A.
van Hage M.
von Mutius E.
von Mutius E.
von Mutius E.
Weiland S. K.
Weiland S. K.
Weiland S. K.
Weiland S. K.
Weinmayr G.
Weinmayr G.
Weinmayr G.
Wickens K.
Williams H.
Wong G. W. K.
Zhong N. S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Asthma prevalence in children varies substantially around the world, but the contribution of known risk factors to this international variation is uncertain. The International Study of Asthma and Allergies in Childhood (ISAAC) Phase Two studied 8–12 year old children in 30 centres worldwide with parent-completed symptom and risk factor questionnaires and aeroallergen skin prick testing. We used multilevel logistic regression modelling to investigate the effect of adjustment for individual and ecological risk factors on the between-centre variation in prevalence of recent wheeze. Adjustment for single individual-level risk factors changed the centre-level variation from a reduction of up to 8.4% (and 8.5% for atopy) to an increase of up to 6.8%. Modelling the 11 most influential environmental factors among all children simultaneously, the centre-level variation changed little overall (2.4% increase). Modelling only factors that decreased the variance, the 6 most influential factors (synthetic and feather quilt, mother’s smoking, heating stoves, dampness and foam pillows) in combination resulted in a 21% reduction in variance. Ecological (centre-level) risk factors generally explained higher proportions of the variation than did individual risk factors. Single environmental factors and aeroallergen sensitisation measured at the individual (child) level did not explain much of the between-centre variation in wheeze prevalence

Crossref

LSHTM Research Online

PubliCatt

St George's Online Research Archive

The pitfall of the echocardiography in congenital heart disease

Author: SARACLAR M
Publication venue: 'Japan Society of Ultrasonics in Medicine'
Publication date: 01/01/2009
Field of study

Crossref