3 research outputs found
Noise Robust Dialogue Act Recognition for Task-oriented Dialogues
νμλ
Όλ¬Έ (μμ¬)-- μμΈλνκ΅ λνμ : μ κΈ°Β·μ»΄ν¨ν°κ³΅νλΆ, 2015. 8. μ΄μꡬ.λν μμ€ν
κ³Ό μ΄λ©μΌ, κ²μκΈ μμ½ μμ€ν
ꡬμΆμ μμ΄ λν μλ λΆλ₯λ μ€μν μν μ νλ€. μ΄λ κ°κ°μ μμ€ν
λ€μ΄ λ°ν, λ©μΌ, κ²μκΈ ννμ λ°μ΄ν°μ λνμ¬ λν μλλ₯Ό λΆλ₯νκ³ μ΄ μ 보λ₯Ό νμ μμ
μ μ
λ ₯μΌλ‘ μ¬μ©νκΈ° λλ¬Έμ΄λ€. κ·Έλμ λν μλ λΆλ₯ μ±λ₯μ΄ ν΄λΉ μμ€ν
μ μ 체 μ±λ₯μ ν¬κ² μν₯μ μ£ΌκΈ° λλ¬Έμ μ±λ₯ ν₯μ μΈ‘λ©΄μ μμ΄ μ€μνλ€. λν μλ λΆλ₯λ λν λ΄ λ°νμ λν μλλ₯Ό ν λΉνλ λ¬Έμ μ΄λ€.
νΉν λν μμ€ν
μμλ μμ± μΈμ μλ¬κ° μ‘΄μ¬νκΈ° λλ¬Έμ μλ¬μ κ°μΈν λν μλ λΆλ₯ λͺ¨λΈμ΄ νμνλ€. λ°λΌμ λ³Έ λ
Όλ¬Έμμλ λ λͺ
μ μ¬λμ΄ νΉμ λͺ©μ μ κ°μ§κ³ μ§ννλ κ³Όμ μ§ν₯ν λνλΌλ μν©μμ λ°ν, νμ, λν μλλ₯Ό κ³ λ €νμ¬ λν ꡬ쑰λ₯Ό λͺ¨μ¬νλ μμ±λͺ¨λΈμ λ§λ€μ΄ λ
Έμ΄μ¦ λ°μ΄ν°μ λμνμλ€. μ΄ λͺ¨λΈμ κΈ°λ°μ΄ λλ κ°μ μ νμλ μ΄λ ν νμλ₯Ό μννκ³ μ νλ λͺ©μ μ κ°μ§κ³ , κ·Έ λͺ©μ μ λ§λ μ μ ν μ΄ν μ§ν©μ μ¬μ©νμ¬ μλλ°©μκ² λ§μ νλ€λ κ²μ΄λ€. μ¦ μ μν λͺ¨λΈμ μ΄λ¬ν κ°μ μ κ³ λ €νμ¬ λ§λ₯΄μ½ν λͺ¨λΈμ κ°μ νμλ€.
κ³Όμ μ§ν₯ν λ°μ΄ν°μΈ HCRC map task, live chat, SACTI-1 λ§λμΉλ₯Ό μ΄μ©ν μ€νμ ν΅ν΄ μ μν λͺ¨λΈμ΄ κΈ°μ‘΄ λ§λ₯΄μ½ν λͺ¨λΈμ λΉνμ¬ λ λμ μ±λ₯μ 보μ΄κ³ , νμ¬κΉμ§λ λν μλ λΆλ₯ μ±λ₯μ΄ λμ SVM-HMMκ³Ό κ²½μλ ₯ μλ κ²°κ³Όλ₯Ό 보μ΄λ κ²μ νμΈ νμλ€. νΉν λν μμ€ν
μ μμ± μΈμ λͺ¨λμ μλ¬λ₯Ό λͺ¨λ°©ν SACTI-1 λ§λμΉμ λνμ¬ μ μν λͺ¨λΈμ΄ SVM-HMMμ λΉνμ¬ λ
Έμ΄μ¦μ κ°μΈν¨μ 보μλ€.In spoken dialog system, e-mail summary system and thread summary system development, dialogue act classifier plays an important role because the systems depend on the performance of classifying dialogue acts of utterances, e-mails and posts to improve completeness of the system. The dialogue act classification problem is a well-known problem to assign the dialogue acts to utterances in a conversation.
One of the main challenges in the development of robust dialog systems is especially to deal with noisy input due to imperfect results from Automatic Speech Recognition (ASR) module. The challenge in dialogue act recognition is the mapping from noisy user utterances to dialogue acts. In this paper, to cope with noisy utterances, we describe a noise robust generative model of task-oriented conversation that captures both the speaker information and the dialogue act associated with each utterance under the assumption that a speaker says about something by using appropriate vocabulary with the aim of getting someone to do somethings. The proposed model is based on Markov model and is modified to reflect the assumption.
In the experiments, we evaluate the classification results by comparing them to the simple Markov model and state-of-the-art SVM-HMM results. The proposed model is a better conversation model than the simple Markov model and shows the competitive classification results in comparison with SVM-HMM in the task-oriented HCRC map task corpus, live-chat corpus and SACTI-1 corpus. Results based on SACTI-1 corpus which simulates ASR errors particularly show that the proposed model is robust against noisy user utterances.1. μλ‘ 1
1.1 μ°κ΅¬μ λ°°κ²½ 1
1.2 μ°κ΅¬μ λ΄μ© λ° λ²μ 3
1.3 λ
Όλ¬Έμ κ΅¬μ± 6
2. λ¬Έμ μ μ 7
2.1 λνλ¬Έμ ꡬμ±μμ 7
2.2 λν μλ λΆλ₯ λ¬Έμ μ μ 12
2.3 λνλ¬Έμ νΉμ§ λ° λ¬Έμ ν΄κ²°μ μ΄λ €μ΄ μ 13
3. κ΄λ ¨ μ°κ΅¬ 15
3.1 μ§λ νμ΅ κΈ°λ°μ λν μλ λΆλ₯ μ°κ΅¬ 15
3.2 λν μλμ μμ‘΄ κ΄κ³λ₯Ό λͺ¨λΈλ§ ν μ°κ΅¬ 16
3.3 κΈ°μ‘΄ μ°κ΅¬μ νκ³μ 22
4. λ§λ₯΄μ½ν λͺ¨λΈ κΈ°λ° λν μλ λΆλ₯ 24
4.1 λ°°κ²½μ§μ 24
4.1.1 μΈμ΄λͺ¨λΈ 24
4.1.2 λ§λ₯΄μ½ν λͺ¨λΈκ³Ό μλ λ§λ₯΄μ½ν λͺ¨λΈ 25
4.2 μ
μΆλ ₯ λ§λ₯΄μ½ν λͺ¨λΈμ λ³νν λν μλ λΆλ₯ λͺ¨λΈ 26
5. μ±λ₯ νκ° 31
5.1 λν λ§λμΉ 31
5.2 λΉκ΅λͺ¨λΈ λ° κ°λ°νκ²½ 38
5.3 μ±λ₯ νκ° μΈ‘μ μΉ 39
5.4 μ€ν κ²°κ³Ό λ° λΆμ 40
5.4.1 λΆλ₯ μ±λ₯ 41
5.4.2 ASR λ
Έμ΄μ¦μ λν κ°μΈμ± 45
5.4.3 νμ₯μ± 48
6. κ²°λ‘ λ° ν₯ν μ°κ΅¬ 50
6.1 κ²°λ‘ 50
6.2 ν₯ν μ°κ΅¬ 51
μ°Έκ³ λ¬Έν 53
ABSTRACT 57Maste
Impact of ASR N-Best Information on Bayesian Dialogue Act Recognition
A challenge in dialogue act recognition is the mapping from noisy user inputs to dialogue acts. In this paper we describe an approach for re-ranking dialogue act hypotheses based on Bayesian classifiers that incorporate dialogue history and Automatic Speech Recognition (ASR) N-best information. We report results based on the Letβs Go dialogue corpora that show (1) that including ASR N-best information results in improved dialogue act recognition performance (+7 % accuracy), and (2) that competitive results can be obtained from as early as the first system dialogue act, reducing the need to wait for subsequent system dialogue acts.
Impact of ASR N-Best Information on Bayesian Dialogue Act Recognition
A challenge in dialogue act recognition is the mapping from noisy user inputs to dialogue acts. In this paper we describe an approach for re-ranking dialogue act hypotheses based on Bayesian classifiers that incorporate dialogue history and Automatic Speech Recognition (ASR) N-best information. We report results based on the Letβs Go dialogue corpora that show (1) that including ASR N-best information results in improved dialogue act recognition performance (+7 % accuracy), and (2) that competitive results can be obtained from as early as the first system dialogue act, reducing the need to wait for subsequent system dialogue acts.