CORE
🇺🇦
make metadata, not war
Services
Services overview
Explore all CORE services
Access to raw data
API
Dataset
FastSync
Content discovery
Recommender
Discovery
OAI identifiers
OAI Resolver
Managing content
Dashboard
Bespoke contracts
Consultancy services
Support us
Support us
Membership
Sponsorship
Community governance
Advisory Board
Board of supporters
Research network
About
About us
Our mission
Team
Blog
FAQs
Contact us
Morph-to-word transduction for accurate and efficient automatic speech recognition and keyword search
Authors
MJF Gales
KM Knill
+4 more
A Ragni
D Saunders
J Vasilakes
P Zahemszky
Publication date
19 June 2017
Publisher
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Doi
Cite
Abstract
© 2017 IEEE. Word units are a popular choice in statistical language modelling. For inflective and agglutinative languages this choice may result in a high out of vocabulary rate. Subword units, such as morphs, provide an interesting alternative to words. These units can be derived in an unsupervised fashion and empirically show lower out of vocabulary rates. This paper proposes a morph-to-word transduction to convert morph sequences into word sequences. This enables powerful word language models to be applied. In addition, it is expected that techniques such as pruning, confusion network decoding, keyword search and many others may benefit from word rather than morph level decision making. However, word or morph systems alone may not achieve optimal performance in tasks such as keyword search so a combination is typically employed. This paper proposes a single index approach that enables word, morph and phone searches to be performed over a single morph index. Experiments are conducted on IARPA Babel program languages including the surprise languages of the OpenKWS 2015 and 2016 competitions
Similar works
Full text
Open in the Core reader
Download PDF
Available Versions
White Rose Research Online
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:eprints.whiterose.ac.uk:15...
Last time updated on 19/11/2019
White Rose Research Online
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:eprints.whiterose.ac.uk:15...
Last time updated on 02/02/2021
Crossref
See this paper in CORE
Go to the repository landing page
Download from data provider
info:doi/10.1109%2Ficassp.2017...
Last time updated on 05/08/2021
Sustaining member
Apollo (Cambridge)
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:www.repository.cam.ac.uk:1...
Last time updated on 24/03/2018