Recent advances and challenges in task-oriented dialog systems

Z Zhang, R Takanobu, Q Zhu, ML Huang… - Science China …, 2020 - Springer
Due to the significance and value in human-computer interaction and natural language
processing, task-oriented dialog systems are attracting more and more attention in both …

[PDF][PDF] KLUE: Korean Language Understanding Evaluation

S Park - arxiv preprint arxiv:2105.09680, 2021 - academia.edu
We introduce Korean Language Understanding Evaluation (KLUE) benchmark. KLUE is a
collection of 8 Korean natural language understanding (NLU) tasks, including Topic …

" Do you follow me?": A Survey of Recent Approaches in Dialogue State Tracking

L Jacqmin, LM Rojas-Barahona, B Favre - arxiv preprint arxiv:2207.14627, 2022 - arxiv.org
While communicating with a user, a task-oriented dialogue system has to track the user's
needs at each turn according to the conversation history. This process called dialogue state …

Rethinking explainability as a dialogue: A practitioner's perspective

H Lakkaraju, D Slack, Y Chen, C Tan… - arxiv preprint arxiv …, 2022 - arxiv.org
As practitioners increasingly deploy machine learning models in critical domains such as
health care, finance, and policy, it becomes vital to ensure that domain experts function …

Convlab-2: An open-source toolkit for building, evaluating, and diagnosing dialogue systems

Q Zhu, Z Zhang, Y Fang, X Li, R Takanobu, J Li… - arxiv preprint arxiv …, 2020 - arxiv.org
We present ConvLab-2, an open-source toolkit that enables researchers to build task-
oriented dialogue systems with state-of-the-art models, perform an end-to-end evaluation …

Multiwoz 2.4: A multi-domain task-oriented dialogue dataset with essential annotation corrections to improve state tracking evaluation

F Ye, J Manotumruksa, E Yilmaz - arxiv preprint arxiv:2104.00773, 2021 - arxiv.org
The MultiWOZ 2.0 dataset has greatly stimulated the research of task-oriented dialogue
systems. However, its state annotations contain substantial noise, which hinders a proper …

Multi 3 WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems

S Hu, H Zhou, M Hergul, M Gritta, G Zhang… - Transactions of the …, 2023 - direct.mit.edu
Creating high-quality annotated data for task-oriented dialog (ToD) is known to be
notoriously difficult, and the challenges are amplified when the goal is to create equitable …

Mathdial: A dialogue tutoring dataset with rich pedagogical properties grounded in math reasoning problems

J Macina, N Daheim, SP Chowdhury, T Sinha… - arxiv preprint arxiv …, 2023 - arxiv.org
While automatic dialogue tutors hold great potential in making education personalized and
more accessible, research on such systems has been hampered by a lack of sufficiently …

Seqgpt: An out-of-the-box large language model for open domain sequence understanding

T Yu, C Jiang, C Lou, S Huang, X Wang… - Proceedings of the …, 2024 - ojs.aaai.org
Large language models (LLMs) have shown impressive abilities for open-domain NLP
tasks. However, LLMs are sometimes too footloose for natural language understanding …

Overview of the ninth dialog system technology challenge: Dstc9

C Gunasekara, S Kim, LF D'haro… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
This paper introduces the Ninth Dialog System Technology Challenge (DSTC-9). This
edition of the DSTC focuses on applying end-to-end dialog technologies for four distinct …