Search CORE

47,226 research outputs found

English Conversational Telephone Speech Recognition by Humans and Machines

Author: Audhkhasi Kartik
Cui Xiaodong
Dimitriadis Dimitrios
Hall Phil
Kurata Gakuto
Lim Lynn-Li
Picheny Michael
Ramabhadran Bhuvana
Roomi Bergul
Saon George
Sercu Tom
Thomas Samuel
Publication venue
Publication date: 06/03/2017
Field of study

One of the most difficult speech recognition tasks is accurate recognition of human to human communication. Advances in deep learning over the last few years have produced major speech recognition improvements on the representative Switchboard conversational corpus. Word error rates that just a few years ago were 14% have dropped to 8.0%, then 6.6% and most recently 5.8%, and are now believed to be within striking range of human performance. This then raises two issues - what IS human performance, and how far down can we still drive speech recognition error rates? A recent paper by Microsoft suggests that we have already achieved human performance. In trying to verify this statement, we performed an independent set of human performance measurements on two conversational tasks and found that human performance may be considerably better than what was earlier reported, giving the community a significantly harder goal to achieve. We also report on our own efforts in this area, presenting a set of acoustic and language modeling techniques that lowered the word error rate of our own English conversational telephone LVCSR system to the level of 5.5%/10.3% on the Switchboard/CallHome subsets of the Hub5 2000 evaluation, which - at least at the writing of this paper - is a new performance milestone (albeit not at what we measure to be human performance!). On the acoustic side, we use a score fusion of three models: one LSTM with multiple feature inputs, a second LSTM trained with speaker-adversarial multi-task learning and a third residual net (ResNet) with 25 convolutional layers and time-dilated convolutions. On the language modeling side, we use word and character LSTMs and convolutional WaveNet-style language models

arXiv.org e-Print Archive

Crossref

High-performance computing and networking. Report of the HPCN Embedded Systems Industrial Working Group. III/6071/94-EN, April 1994

Author
Publication venue
Publication date: 01/01/1994
Field of study

Archive of European Integration

TechNews digests: Jan - Mar 2010

Author
Publication venue: British Educational Communications and Technology Agency (BECTA)
Publication date: 01/01/2010
Field of study

TechNews is a technology, news and analysis service aimed at anyone in the education sector keen to stay informed about technology developments, trends and issues. TechNews focuses on emerging technologies and other technology news. TechNews service : digests september 2004 till May 2010 Analysis pieces and News combined publish every 2 to 3 month

Digital Education Resource Archive

Evaluating indoor positioning systems in a shopping mall : the lessons learned from the IPIN 2018 competition

Author: Ali Muhammad Usman
Ben-Moshe Boaz
Chien Ying-Ren
Cho Eunyoung
Ding Zhenxing
Fang Shih-Hau
Hacohen Shlomi
Han Jaeseung
Hur Soojung
Jeong Hyeongyo
Jun Sungwoo
Knauth Stefan
Kronenwett Nikolai
Kuang Jian
Landa Vlad
Landau Yael
Lee Changeun
Lee Keumryeol
Lee Soyeon
Lee Yonghyun
Li Xianghong
Li Yu
Lu Chuanhua
Lungenstrass Tomas
Marbel Revital
Martin Mendoza-Silva German
Niu Xiaoji
Opiela Miroslav
Ortiz Miguel
Pablo Morales Juan
Park Chan Gook
Park Changjun
Park Sangjoon
Park So Young
Park Yongwan
Perez-Navarro Antoni
Perul Johan
Pipelidis Georgios
Plets David
Ramon Jimenez Antonio
Renaudin Valerie
Rew Jehyeok
Seco Fernando
Shimada Atsushi
Shvalb Nir
Taniguchi Rin-Ichiro
Thomas Diego
Torres-Sospedra Joaquin
Trogh Jens
Tsao Yu
Tsiamitros Nikolaos
Uchiyama Hideaki
Vladimirov Blagovest
Wei Dongyan
Xu Feng
Yang Shi-Shen
Ye Feng
Ye Shih-Jyun
Zhang Wenchao
Zhang Ying
Zheng Xingyu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

The Indoor Positioning and Indoor Navigation (IPIN) conference holds an annual competition in which indoor localization systems from different research groups worldwide are evaluated empirically. The objective of this competition is to establish a systematic evaluation methodology with rigorous metrics both for real-time (on-site) and post-processing (off-site) situations, in a realistic environment unfamiliar to the prototype developers. For the IPIN 2018 conference, this competition was held on September 22nd, 2018, in Atlantis, a large shopping mall in Nantes (France). Four competition tracks (two on-site and two off-site) were designed. They consisted of several 1 km routes traversing several floors of the mall. Along these paths, 180 points were topographically surveyed with a 10 cm accuracy, to serve as ground truth landmarks, combining theodolite measurements, differential global navigation satellite system (GNSS) and 3D scanner systems. 34 teams effectively competed. The accuracy score corresponds to the third quartile (75th percentile) of an error metric that combines the horizontal positioning error and the floor detection. The best results for the on-site tracks showed an accuracy score of 11.70 m (Track 1) and 5.50 m (Track 2), while the best results for the off-site tracks showed an accuracy score of 0.90 m (Track 3) and 1.30 m (Track 4). These results showed that it is possible to obtain high accuracy indoor positioning solutions in large, realistic environments using wearable light-weight sensors without deploying any beacon. This paper describes the organization work of the tracks, analyzes the methodology used to quantify the results, reviews the lessons learned from the competition and discusses its future

Ghent University Academic Bibliography

Digital.CSIC

Proceedings of the 20th BCS HCI Group conference Volume Two

Author: Fields Bob
Healey Patrick
Nickerson Louise Valgerdur
Stockman Tony
Publication venue
Publication date: 30/12/2013
Field of study

Queen Mary Research Online