Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning
Multimodal task specification is essential for enhanced robotic performance, where\textit
{Cross-modality Alignment} enables the robot to holistically understand complex task …
{Cross-modality Alignment} enables the robot to holistically understand complex task …