1 research outputs found

    Unified Frame and Segment Based Models for Automatic Speech Recognition

    No full text
    In this paper, we propose an analytically tractable framework that integrates the frame and segment based acoustic modeling techniques. We combine the two approaches by jointly modeling their respective hidden Markov processes. Since the joint process is based on the same mathematical framework, conventional search and training techniques, such as Viterbi and EM algorithms, can be directly applied. It also allows the score from either model to contribute to the training and decoding of the other, reaching a jointly optimal decision. We conducted two series of experiments to verify our hypotheses. In the phone-pair classification experiments, our segment models show a 24 % error reduction over state-of-the-art HMM-based system. The superior quality of segment models contributes to an 8.2 % reduction in word error rates for the unified system on the WSJ dictation task. 1
    corecore