Learning quadrotor dynamics for precise, safe, and agile flight control

A Saviolo, G Loianno - Annual Reviews in Control, 2023 - Elsevier
This article reviews the state-of-the-art modeling and control techniques for aerial robots
such as quadrotor systems and presents several future research directions in this area. The …

Tapir: Tracking any point with per-frame initialization and temporal refinement

C Doersch, Y Yang, M Vecerik… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried
point on any physical surface throughout a video sequence. Our approach employs two …

Masked world models for visual control

Y Seo, D Hafner, H Liu, F Liu, S James… - … on Robot Learning, 2023 - proceedings.mlr.press
Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient
robot learning from visual observations. Yet the current approaches typically train a single …

Tap-vid: A benchmark for tracking any point in a video

C Doersch, A Gupta, L Markeeva… - Advances in …, 2022 - proceedings.neurips.cc
Generic motion understanding from video involves not only tracking objects, but also
perceiving how their surfaces deform and move. This information is useful to make …

Rekep: Spatio-temporal reasoning of relational keypoint constraints for robotic manipulation

W Huang, C Wang, Y Li, R Zhang, L Fei-Fei - arxiv preprint arxiv …, 2024 - arxiv.org
Representing robotic manipulation tasks as constraints that associate the robot and the
environment is a promising way to encode desired robot behaviors. However, it remains …

Nerf-supervision: Learning dense object descriptors from neural radiance fields

L Yen-Chen, P Florence, JT Barron… - … on robotics and …, 2022 - ieeexplore.ieee.org
Thin, reflective objects such as forks and whisks are common in our daily lives, but they are
particularly chal-lenging for robot perception because it is hard to reconstruct them using …

Algorithms and systems for manipulating multiple objects

Z Pan, A Zeng, Y Li, J Yu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Robot manipulation of multiple objects is an important topic for applications including
warehouse automation, service robots performing cleaning, and large-scale object sorting …

The surprising effectiveness of representation learning for visual imitation

J Pari, NM Shafiullah, SP Arunachalam… - arxiv preprint arxiv …, 2021 - arxiv.org
While visual imitation learning offers one of the most effective ways of learning from visual
demonstrations, generalizing from them requires either hundreds of diverse demonstrations …

3d neural scene representations for visuomotor control

Y Li, S Li, V Sitzmann, P Agrawal… - Conference on Robot …, 2022 - proceedings.mlr.press
Humans have a strong intuitive understanding of the 3D environment around us. The mental
model of the physics in our brain applies to objects of different materials and enables us to …

Learning multi-object dynamics with compositional neural radiance fields

D Driess, Z Huang, Y Li, R Tedrake… - Conference on robot …, 2023 - proceedings.mlr.press
We present a method to learn compositional multi-object dynamics models from image
observations based on implicit object encoders, Neural Radiance Fields (NeRFs), and …