Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning

J Li, Z Wang, J Zheng, X Zhou, G Wang, G Song… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Multimodal task specification is essential for enhanced robotic performance, where\textit
{Cross-modality Alignment} enables the robot to holistically understand complex task …