What makes multi-modal learning better than single (provably) Y Huang, C Du, Z Xue, X Chen, H Zhao, L Huang Advances in Neural Information Processing Systems 34, 10944-10956, 2021 | 289 | 2021 |
On feature decorrelation in self-supervised learning T Hua, W Wang, Z Xue, S Ren, Y Wang, H Zhao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 222 | 2021 |
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives K Grauman, A Westbury, L Torresani, K Kitani, J Malik, T Afouras, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 126 | 2024 |
Dynamic multimodal fusion Z Xue, R Marculescu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 68 | 2023 |
Co-advise: Cross inductive bias distillation S Ren, Z Gao, T Hua, Z Xue, Y Tian, S He, H Zhao Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2022 | 65 | 2022 |
The modality focusing hypothesis: Towards understanding crossmodal knowledge distillation Z Xue, Z Gao, S Ren, H Zhao International Conference on Learning Representations (ICLR), 2023 | 39 | 2023 |
Multimodal knowledge expansion Z Xue, S Ren, Z Gao, H Zhao Proceedings of the IEEE/CVF International Conference on Computer Vision, 854-863, 2021 | 29 | 2021 |
Learning fine-grained view-invariant representations from unpaired ego-exo videos via temporal alignment ZS Xue, K Grauman Advances in Neural Information Processing Systems 36, 53688-53710, 2023 | 24 | 2023 |
Learning object state changes in videos: An open-world perspective Z Xue, K Ashutosh, K Grauman Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 15 | 2024 |
Sugar: Efficient subgraph-level training via resource-aware graph partitioning Z Xue, Y Yang, R Marculescu IEEE Transactions on Computers, 2023 | 13 | 2023 |
Egocentric video task translation Z Xue, Y Song, K Grauman, L Torresani Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 13 | 2023 |
Put myself in your shoes: Lifting the egocentric perspective from exocentric videos M Luo, Z Xue, A Dimakis, K Grauman European Conference on Computer Vision, 407-425, 2024 | 11 | 2024 |
Detours for navigating instructional videos K Ashutosh, Z Xue, T Nagarajan, K Grauman Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 6 | 2024 |
HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness Z Xue, M Luo, C Chen, K Grauman arXiv preprint arXiv:2406.07754, 2024 | 5 | 2024 |
Anytime depth estimation with limited sensing and computation capabilities on mobile devices Y Yang, Z Xue, R Marculescu Conference on Robot Learning, 609-618, 2022 | 4 | 2022 |
On feature decorrelation in selfsupervised learning. 2019 IEEE T Hua, W Wang, Z Xue, Y Wang, S Ren, H Zhao CVF International Conference on Computer Vision (ICCV) 3, 2021 | 4 | 2021 |
Egocentric video task translation@ ego4d challenge 2022 Z Xue, Y Song, K Grauman, L Torresani arXiv preprint arXiv:2302.01891, 2023 | 3 | 2023 |
Sampling graphlets of multiplex networks: a restricted random walk approach S Jiao, Z Xue, X Chen, Y Xu ACM Transactions on the Web (TWEB) 15 (4), 1-31, 2021 | 3 | 2021 |
Action2sound: Ambient-aware generation of action sounds from egocentric videos C Chen, P Peng, A Baid, Z Xue, WN Hsu, D Harwath, K Grauman European Conference on Computer Vision, 277-295, 2024 | 2 | 2024 |
Training-free robust multimodal learning via sample-wise jacobian regularization Z Gao, S Ren, Z Xue, S Li, H Zhao arXiv preprint arXiv:2204.02485, 2022 | 1 | 2022 |