Recent advances in deep learning based dialogue systems: A systematic survey
Dialogue systems are a popular natural language processing (NLP) task as it is promising in
real-life applications. It is also a complicated task since many NLP tasks deserving study are …
real-life applications. It is also a complicated task since many NLP tasks deserving study are …
A survey of evaluation metrics used for NLG systems
In the last few years, a large number of automatic evaluation metrics have been proposed for
evaluating Natural Language Generation (NLG) systems. The rapid development and …
evaluating Natural Language Generation (NLG) systems. The rapid development and …
: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering
Neural knowledge-grounded generative models for dialogue often produce content that is
factually inconsistent with the source text they rely on. As a consequence, such models are …
factually inconsistent with the source text they rely on. As a consequence, such models are …
InstructDial: Improving zero and few-shot generalization in dialogue through instruction tuning
Instruction tuning is an emergent paradigm in NLP wherein natural language instructions
are leveraged with language models to induce zero-shot performance on unseen tasks …
are leveraged with language models to induce zero-shot performance on unseen tasks …
State-of-the-art generalisation research in NLP: a taxonomy and review
The ability to generalise well is one of the primary desiderata of natural language
processing (NLP). Yet, what'good generalisation'entails and how it should be evaluated is …
processing (NLP). Yet, what'good generalisation'entails and how it should be evaluated is …
Masked graph learning with recurrent alignment for multimodal emotion recognition in conversation
T Meng, F Zhang, Y Shou, H Shao… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Since Multimodal Emotion Recognition in Conversation (MERC) can be applied to public
opinion monitoring, intelligent dialogue robots, and other fields, it has received extensive …
opinion monitoring, intelligent dialogue robots, and other fields, it has received extensive …
Simple LLM prompting is state-of-the-art for robust and multilingual dialogue evaluation
Despite significant research effort in the development of automatic dialogue evaluation
metrics, little thought is given to evaluating dialogues other than in English. At the same time …
metrics, little thought is given to evaluating dialogues other than in English. At the same time …
Evaluating open-domain dialogues in latent space with next sentence prediction and mutual information
The long-standing one-to-many issue of the open-domain dialogues poses significant
challenges for automatic evaluation methods, ie, there may be multiple suitable responses …
challenges for automatic evaluation methods, ie, there may be multiple suitable responses …
Automatic evaluation and moderation of open-domain dialogue systems
The development of Open-Domain Dialogue Systems (ODS) is a trending topic due to the
large number of research challenges, large societal and business impact, and advances in …
large number of research challenges, large societal and business impact, and advances in …
Synthesizing adversarial negative responses for robust response ranking and evaluation
Open-domain neural dialogue models have achieved high performance in response ranking
and evaluation tasks. These tasks are formulated as a binary classification of responses …
and evaluation tasks. These tasks are formulated as a binary classification of responses …