Language-driven physics-based scene synthesis and editing via feature splatting

RZ Qiu, G Yang, W Zeng, X Wang - European Conference on Computer …, 2024 - Springer
Scene representations using 3D Gaussian primitives have produced excellent results in
modeling the appearance of static and dynamic 3D scenes. Many graphics applications …

Graspsplats: Efficient manipulation with 3d feature splatting

M Ji, RZ Qiu, X Zou, X Wang - ar** of object parts is crucial for
practical applications and is becoming prevalent with recent advances in Vision-Language …

Sim-to-real transfer via 3d feature fields for vision-and-language navigation

Z Wang, X Li, J Yang, Y Liu, S Jiang - arxiv preprint arxiv:2406.09798, 2024 - arxiv.org
Vision-and-language navigation (VLN) enables the agent to navigate to a remote location in
3D environments following the natural language instruction. In this field, the agent is usually …

[PDF][PDF] Gendp: 3d semantic fields for category-level generalizable diffusion policy

Y Wang, G Yin, B Huang, T Kelestemur… - … Annual Conference on …, 2024 - robopil.github.io
• Generalizable Representation: Some existing semantic fields or representations are
trained on small-scale datasets and are challenging to transfer to novel scenes or object …

Neural Fields in Robotics: A Survey

MZ Irshad, M Comi, YC Lin, N Heppert… - arxiv preprint arxiv …, 2024 - arxiv.org
Neural Fields have emerged as a transformative approach for 3D scene representation in
computer vision and robotics, enabling accurate inference of geometry, 3D semantics, and …

Nl-slam for oc-vln: Natural language grounded slam for object-centric vln

S Raychaudhuri, D Ta, K Ashton, AX Chang… - arxiv preprint arxiv …, 2024 - arxiv.org
Landmark-based navigation (eg go to the wooden desk) and relative positional navigation
(eg move 5 meters forward) are distinct navigation challenges solved very differently in …

TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning

J Wu, W Chong, R Holmberg, A Prasad, Y Gao… - arxiv preprint arxiv …, 2024 - arxiv.org
Exploiting the promise of recent advances in imitation learning for mobile manipulation will
require the collection of large numbers of human-guided demonstrations. This paper …

Challenges and Opportunities for Large-Scale Exploration with Air-Ground Teams using Semantics

F Cladera, ID Miller, Z Ravichandran, V Murali… - arxiv preprint arxiv …, 2024 - arxiv.org
One common and desirable application of robots is exploring potentially hazardous and
unstructured environments. Air-ground collaboration offers a synergistic approach to …

Environment Modeling for Service Robots From a Task Execution Perspective

Y Zhang, G Tian, CH Zhang, C Hua, W Ding… - arxiv preprint arxiv …, 2025 - arxiv.org
Service robots are increasingly entering the home to provide domestic tasks for residents.
However, when working in an open, dynamic, and unstructured home environment, service …

From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models

T Pulli, S Thalhammer, S Schwaiger… - arxiv preprint arxiv …, 2024 - arxiv.org
Robots are increasingly envisioned to interact in real-world scenarios, where they must
continuously adapt to new situations. To detect and grasp novel objects, zero-shot pose …