Generalized planning in pddl domains with pretrained large language models
Recent work has considered whether large language models (LLMs) can function as
planners: given a task, generate a plan. We investigate whether LLMs can serve as …
planners: given a task, generate a plan. We investigate whether LLMs can serve as …
Goal-conditioned reinforcement learning with imagined subgoals
Goal-conditioned reinforcement learning endows an agent with a large variety of skills, but it
often struggles to solve tasks that require more temporally extended reasoning. In this work …
often struggles to solve tasks that require more temporally extended reasoning. In this work …
Mindstorms in natural language-based societies of mind
Both Minsky's" society of mind" and Schmidhuber's" learning to think" inspire diverse
societies of large multimodal neural networks (NNs) that solve problems by interviewing …
societies of large multimodal neural networks (NNs) that solve problems by interviewing …
Latent plans for task-agnostic offline reinforcement learning
Everyday tasks of long-horizon and comprising a sequence of multiple implicit subtasks still
impose a major challenge in offline robot control. While a number of prior methods aimed to …
impose a major challenge in offline robot control. While a number of prior methods aimed to …
Hierarchical planning for long-horizon manipulation with geometric and symbolic scene graphs
We present a visually grounded hierarchical planning algorithm for long-horizon
manipulation tasks. Our algorithm offers a joint framework of neuro-symbolic task planning …
manipulation tasks. Our algorithm offers a joint framework of neuro-symbolic task planning …
Bits: Bi-level imitation for traffic simulation
Simulation is the key to scaling up validation and verification for robotic systems such as
autonomous vehicles. Despite advances in high-fidelity physics and sensor simulation, a …
autonomous vehicles. Despite advances in high-fidelity physics and sensor simulation, a …
World model as a graph: Learning latent landmarks for planning
Planning, the ability to analyze the structure of a problem in the large and decompose it into
interrelated subproblems, is a hallmark of human intelligence. While deep reinforcement …
interrelated subproblems, is a hallmark of human intelligence. While deep reinforcement …
Clockwork variational autoencoders
Deep learning has enabled algorithms to generate realistic images. However, accurately
predicting long video sequences requires understanding long-term dependencies and …
predicting long video sequences requires understanding long-term dependencies and …
Learning task informed abstractions
Current model-based reinforcement learning methods struggle when operating from
complex visual scenes due to their inability to prioritize task-relevant features. To mitigate …
complex visual scenes due to their inability to prioritize task-relevant features. To mitigate …
Generalization with lossy affordances: Leveraging broad offline data for learning visuomotor tasks
The use of broad datasets has proven to be crucial for generalization for a wide range of
fields. However, how to effectively make use of diverse multi-task data for novel downstream …
fields. However, how to effectively make use of diverse multi-task data for novel downstream …