Recent advances in deep learning based dialogue systems: A systematic survey

J Ni, T Young, V Pandelea, F Xue… - Artificial intelligence review, 2023 - Springer
Dialogue systems are a popular natural language processing (NLP) task as it is promising in
real-life applications. It is also a complicated task since many NLP tasks deserving study are …

Visually grounded language learning: a review of language games, datasets, tasks, and models

A Suglia, I Konstas, O Lemon - Journal of Artificial Intelligence Research, 2024 - jair.org
In recent years, several machine learning models have been proposed. They are trained
with a language modelling objective on large-scale text-only data. With such pretraining …

Grounding'grounding'in NLP

KR Chandu, Y Bisk, AW Black - arxiv preprint arxiv:2106.02192, 2021 - arxiv.org
The NLP community has seen substantial recent interest in grounding to facilitate interaction
between language technologies and the world. However, as a community, we use the term …

History for visual dialog: Do we really need it?

S Agarwal, T Bui, JY Lee, I Konstas… - arxiv preprint arxiv …, 2020 - arxiv.org
Visual Dialog involves" understanding" the dialog history (what has been discussed
previously) and the current question (what is asked), in addition to grounding information in …

Dealing with semantic underspecification in multimodal NLP

S Pezzelle - arxiv preprint arxiv:2306.05240, 2023 - arxiv.org
Intelligent systems that aim at mastering language as humans do must deal with its semantic
underspecification, namely, the possibility for a linguistic signal to convey only part of the …

Storytelling with dialogue: A critical role dungeons and dragons dataset

R Rameshkumar, P Bailey - … of the 58th Annual Meeting of the …, 2020 - aclanthology.org
This paper describes the Critical Role Dungeons and Dragons Dataset (CRD3) and related
analyses. Critical Role is an unscripted, live-streamed show where a fixed group of people …

Pragmatics in language grounding: Phenomena, tasks, and modeling approaches

D Fried, N Tomlin, J Hu, R Patel… - arxiv preprint arxiv …, 2022 - arxiv.org
People rely heavily on context to enrich meaning beyond what is literally said, enabling
concise but effective communication. To interact successfully and naturally with people, user …

Effect of visual extensions on natural language understanding in vision-and-language models

T Iki, A Aizawa - arxiv preprint arxiv:2104.08066, 2021 - arxiv.org
A method for creating a vision-and-language (V&L) model is to extend a language model
through structural modifications and V&L pre-training. Such an extension aims to make a …

Meetup! a corpus of joint activity dialogues in a visual environment

N Ilinykh, S Zarrieß, D Schlangen - arxiv preprint arxiv:1907.05084, 2019 - arxiv.org
Building computer systems that can converse about their visual environment is one of the
oldest concerns of research in Artificial Intelligence and Computational Linguistics (see, for …

Refer, reuse, reduce: Generating subsequent references in visual and conversational contexts

E Takmaz, M Giulianelli, S Pezzelle, A Sinclair… - arxiv preprint arxiv …, 2020 - arxiv.org
Dialogue participants often refer to entities or situations repeatedly within a conversation,
which contributes to its cohesiveness. Subsequent references exploit the common ground …