Search CORE

18 research outputs found

5IDER: Unified Query Rewriting for Steering, Intent Carryover, Disfluencies, Entity Carryover and Repair

Author: Akbacak Murat
Li Site
Lu Jiarui
Moniz Joel Ruben Antony
Tseng Bo-Hsiang
Yu Hong
Zhu Xueyun
Publication venue
Publication date: 02/06/2023
Field of study

Providing voice assistants the ability to navigate multi-turn conversations is a challenging problem. Handling multi-turn interactions requires the system to understand various conversational use-cases, such as steering, intent carryover, disfluencies, entity carryover, and repair. The complexity of this problem is compounded by the fact that these use-cases mix with each other, often appearing simultaneously in natural language. This work proposes a non-autoregressive query rewriting architecture that can handle not only the five aforementioned tasks, but also complex compositions of these use-cases. We show that our proposed model has competitive single task performance compared to the baseline approach, and even outperforms a fine-tuned T5 model in use-case compositions, despite being 15 times smaller in parameters and 25 times faster in latency.Comment: Interspeech 202

arXiv.org e-Print Archive

Результаты численного решения уравнения Лапласа для фигур вращения с формами от - 0,0025 до - 0,2500

Author: Akbacak Murat
Franco Horacio
Frandsen Michael
Hasan Savsa
Jameel Huda
Kathol Andreas
Khadivi Shahram
Lei Xin
Mandal Arindam
Mansour Saab
Precoda Kristin
Richey Colleen
Vergyri Dimitra
Wang Wen
Yang Mei
Zheng Jing
Publication venue: Томский политехнический университет
Publication date: 01/01/1965
Field of study

Electronic archive of Tomsk Polytechnic University

Detecting Nonnative Speech Using Speaker Recognition Approaches

Author: Andreas Stolcke
Elizabeth Shriberg
Luciana Ferrer
Murat Akbacak
Nicolas Scheffer
Sachin Kajarekar
Publication venue
Publication date
Field of study

Detecting whether a talker is speaking his native language is useful for speaker recognition, speech recognition, and intelligence applications. We study the problem of detecting nonnative speakers of American English, using two standard speech corpora. We apply approaches effective in speaker verification to this task, including systems based on MLLR, phone N-gram, prosodic, and word Ngram features. Results show equal error rates between 12 % and 20%, depending on the system, test data, and choice of training data. Asymmetries in performance are most likely explained by differences in native language distributions in the corpora. Model combination yields substantial improvements over individual models, with the best result being around 8.6 % EER. While phone Ngrams are widely used in related tasks (e.g., language and dialect ID), we find that it is the least effective model in combination; MLLR, prosody, and word N-gram systems play stronger roles. Overall, results suggest that individual systems and system combinations found useful for speaker ID also offer promise for nonnativeness detection, and that further efforts are warranted in this area. 1

CiteSeerX