Interactive natural language processing
Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within
the field of NLP, aimed at addressing limitations in existing frameworks while aligning with …
the field of NLP, aimed at addressing limitations in existing frameworks while aligning with …
Vision-and-language navigation today and tomorrow: A survey in the era of foundation models
Vision-and-Language Navigation (VLN) has gained increasing attention over recent years
and many approaches have emerged to advance their development. The remarkable …
and many approaches have emerged to advance their development. The remarkable …
Open-ended instructable embodied agents with memory-augmented large language models
Pre-trained and frozen LLMs can effectively map simple scene re-arrangement instructions
to programs over a robot's visuomotor functions through appropriate few-shot example …
to programs over a robot's visuomotor functions through appropriate few-shot example …
Graph Learning for Numeric Planning
Graph learning is naturally well suited for use in symbolic, object-centric planning due to its
ability to exploit relational structures exhibited in planning domains and to take as input …
ability to exploit relational structures exhibited in planning domains and to take as input …
Egocentric planning for scalable embodied task achievement
Embodied agents face significant challenges when tasked with performing actions in diverse
environments, particularly in generalizing across object types and executing suitable actions …
environments, particularly in generalizing across object types and executing suitable actions …
Human–robot dialogue annotation for multi-modal common ground
In this paper, we describe the development of symbolic representations annotated on
human–robot dialogue data to make dimensions of meaning accessible to autonomous …
human–robot dialogue data to make dimensions of meaning accessible to autonomous …
MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making
Long-term memory is significant for agents, in which insights play a crucial role. However,
the emergence of irrelevant insight and the lack of general insight can greatly undermine the …
the emergence of irrelevant insight and the lack of general insight can greatly undermine the …
Vlm agents generate their own memories: Distilling experience into embodied programs of thought
Large-scale generative language and vision-language models (LLMs and VLMs) excel in
few-shot in-context learning for decision making and instruction following. However, they …
few-shot in-context learning for decision making and instruction following. However, they …
Vlm agents generate their own memories: Distilling experience into embodied programs
Large-scale generative language and vision-language models excel in in-context learning
for decision making. However, they require high-quality exemplar demonstrations to be …
for decision making. However, they require high-quality exemplar demonstrations to be …
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models
Recent research on instructable agents has used memory-augmented Large Language
Models (LLMs) as task planners, a technique that retrieves language-program examples …
Models (LLMs) as task planners, a technique that retrieves language-program examples …