What makes multi-modal learning better than single (provably) Y Huang, C Du, Z Xue, X Chen, H Zhao, L Huang Advances in Neural Information Processing Systems 34, 10944-10956, 2021 | 289 | 2021 |
FUTR3D: A Unified Sensor Fusion Framework for 3D Detection X Chen, T Zhang, Y Wang, Y Wang, H Zhao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 264 | 2022 |
Vip3d: End-to-end visual trajectory prediction via 3d agent queries J Gu, C Hu, T Zhang, X Chen, Y Wang, Y Wang, H Zhao IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022 | 98 | 2022 |
Mutr3d: A multi-camera tracking framework via 3d-to-2d queries T Zhang, X Chen, Y Wang, Y Wang, H Zhao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 77 | 2022 |
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer X Chen, Z Liu, H Tang, L Yi, H Zhao, S Han IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 49 | 2023 |