3 research outputs found

    Improving Pronunciation Inference using N-Best List, Acoustics and Orthography

    No full text

    Acoustic-based improving pronunciation inference using n-best list, acoustics and orthography

    No full text
    In this paper, we tackle the problem of pronunciation inference and Out-of-Vocabulary (OOV) enrollment in Automatic Speech Recognition (ASR) applications. We combine linguistic and acoustic information of the OOV word using its spelling and a single instance of its utterance to derive an appropriate phonetic baseform. The novelty of the approach is in its employment of an orthography-driven n-best hypothesis and rescoring strategy of the pronunciation alternatives. We make use of decision trees and heuristic tree search to construct and score the n-best hypotheses space. We use acoustic alignment likelihood and phone transition cost to leverage the empirical evidence and phonotactic priors to rescore the hypotheses and refine the baseforms. Index Terms β€” n-best list, Out-of-Vocabulary, letter-tosound rules, pronunciation modeling, automatic pronunciation learning 1
    corecore