3 research outputs found

    Matching inconsistently spelled names in automatic speech recognizer output for information retrieval

    No full text

    Matching Inconsistently Spelled Names in Automatic Speech Recognizer Output for Information Retrieval

    No full text
    Many proper names are spelled inconsistently in speech recognizer output, posing a problem for applications where locating mentions of named entities is critical. We model the distortion in the spelling of a name due to the speech recognizer as the effect of a noisy channel. The models follow the framework of the IBM translation models. The model is trained using a parallel text of closed caption and automatic speech recognition output. We also test a string edit distance based method. The effectiveness of these models is evaluated on a name query retrieval task. Our methods result in a 60 % improvement in F1. We also demonstrate why the problem has not been critical in TREC and TDT tasks.
    corecore