Recasting Generic Pretrained Vision Transformers As Object-Centric Scene Encoders For Manipulation Policies

J Qian, A Panagopoulos, D Jayaraman - arxiv preprint arxiv:2405.15916, 2024 - arxiv.org
Generic re-usable pre-trained image representation encoders have become a standard
component of methods for many computer vision tasks. As visual representations for robots …

Hybrid quantum-classical 3D object detection using multi-channel quantum convolutional neural network

EJ Roh, JY Shim, J Kim, S Park - The Journal of Supercomputing, 2025 - Springer
Abstract 3D object detection has recently shown remarkable progress in the computer vision
field, enabling advanced understanding of the surrounding environment by identifying …