Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI

التفاصيل البيبلوغرافية
العنوان: Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI
المؤلفون: Asano, Yuya, Hassan, Sabit, Sharma, Paras, Sicilia, Anthony, Atwell, Katherine, Litman, Diane, Alikhani, Malihe
سنة النشر: 2025
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
الوصف: General-purpose automatic speech recognition (ASR) systems do not always perform well in goal-oriented dialogue. Existing ASR correction methods rely on prior user data or named entities. We extend correction to tasks that have no prior user data and exhibit linguistic flexibility such as lexical and syntactic variations. We propose a novel context augmentation with a large language model and a ranking strategy that incorporates contextual information from the dialogue states of a goal-oriented conversational AI and its tasks. Our method ranks (1) n-best ASR hypotheses by their lexical and semantic similarity with context and (2) context by phonetic correspondence with ASR hypotheses. Evaluated in home improvement and cooking domains with real-world users, our method improves recall and F1 of correction by 34% and 16%, respectively, while maintaining precision and false positive rate. Users rated .8-1 point (out of 5) higher when our correction method worked properly, with no decrease due to false positives.
Comment: Accepted to COLING 2025 Industry Track
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2501.06129
رقم الانضمام: edsarx.2501.06129
قاعدة البيانات: arXiv