Open-domain conversational agents: Current progress, open problems, and future directions

S Roller, YL Boureau, J Weston, A Bordes… - arxiv preprint arxiv …, 2020 - arxiv.org
We present our view of what is necessary to build an engaging open-domain conversational
agent: covering the qualities of such an agent, the pieces of the puzzle that have been built …

Learning to execute actions or ask clarification questions

Z Shi, Y Feng, A Lipani - arxiv preprint arxiv:2204.08373, 2022 - arxiv.org
Collaborative tasks are ubiquitous activities where a form of communication is required in
order to reach a joint goal. Collaborative building is one of such tasks. We wish to develop …

Craft an iron sword: Dynamically generating interactive game characters by prompting large language models tuned on code

R Volum, S Rao, M Xu, G DesGarennes… - Proceedings of the …, 2022 - aclanthology.org
Abstract Non-Player Characters (NPCs) significantly enhance the player experience in many
games. Historically, players' interactions with NPCs have tended to be highly scripted, to be …

Visual language navigation: A survey and open challenges

SM Park, YG Kim - Artificial Intelligence Review, 2023 - Springer
With the recent development of deep learning, AI models are widely used in various
domains. AI models show good performance for definite tasks such as image classification …

Learning rewards from linguistic feedback

TR Sumers, MK Ho, RD Hawkins… - Proceedings of the …, 2021 - ojs.aaai.org
We explore unconstrained natural language feedback as a learning signal for artificial
agents. Humans use rich and varied language to teach, yet most prior work on interactive …

Artificial intelligence in intelligent vehicles: recent advances and future directions

T Zhang, T Zhao, Y Qin, S Liu - Journal of the Chinese Institute of …, 2023 - Taylor & Francis
ABSTRACT AI has been widely used in intelligent transportation systems, autonomous
driving and automated vehicles in particular. Intelligent vehicles in the future will be a …

LEBP--Language Expectation & Binding Policy: A Two-Stream Framework for Embodied Vision-and-Language Interaction Task Learning Agents

H Liu, Y Liu, H He, H Yang - arxiv preprint arxiv:2203.04637, 2022 - arxiv.org
People always desire an embodied agent that can perform a task by understanding
language instruction. Moreover, they also want to monitor and expect agents to understand …

Transforming human-centered ai collaboration: Redefining embodied agents capabilities through interactive grounded language instructions

S Mohanty, N Arabzadeh, J Kiseleva, A Zholus… - arxiv preprint arxiv …, 2023 - arxiv.org
Human intelligence's adaptability is remarkable, allowing us to adjust to new tasks and multi-
modal environments swiftly. This skill is evident from a young age as we acquire new …

Craftassist: A framework for dialogue-enabled interactive agents

J Gray, K Srinet, Y Jernite, H Yu, Z Chen, D Guo… - arxiv preprint arxiv …, 2019 - arxiv.org
This paper describes an implementation of a bot assistant in Minecraft, and the tools and
platform allowing players to interact with the bot and to record those interactions. The …

The tomcat dataset

A Pyarelal, E Duong, C Shibu… - Advances in …, 2023 - proceedings.neurips.cc
We present a rich, multimodal dataset consisting of data from 40 teams of three humans
conducting simulated urban search-and-rescue (SAR) missions in a Minecraft-based …