Self-Supervised Video Forensics by Audio-Visual Anomaly Detection C Feng, Z Chen, A Owens CVPR 2023, 10491-10503, 2023 | 63 | 2023 |
Knowledge solver: Teaching llms to search for domain knowledge from knowledge graphs C Feng, X Zhang, Z Fei 2024 AAAI workshop on Responsible Language Models (ReLM), 2023 | 53 | 2023 |
Ava-avd: Audio-visual speaker diarization in the wild EZ Xu, Z Song, C Feng, M Ye, MZ Shou ACM Multimedia 2022, 2021 | 50 | 2021 |
Binding touch to everything: Learning unified multimodal tactile representations F Yang, C Feng, Z Chen, H Park, D Wang, Y Dou, Z Zeng, X Chen, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 38 | 2024 |
Vision-flan: Scaling human-labeled tasks in visual instruction tuning Z Xu, C Feng, R Shao, T Ashby, Y Shen, D Jin, Y Cheng, Q Wang, ... ACL 2024 (Findings), 2024 | 26 | 2024 |
Neurobind: Towards unified multimodal representations for neural signals F Yang, C Feng, D Wang, T Wang, Z Zeng, Z Xu, H Park, P Ji, H Zhao, Y Li, ... arXiv preprint arXiv:2407.14020, 2024 | 7 | 2024 |
This&that: Language-gesture controlled video generation for robot planning B Wang, N Sridhar, C Feng, M Van der Merwe, A Fishman, N Fazeli, ... arXiv preprint arXiv:2407.05530, 2024 | 5 | 2024 |
GPS as a Control Signal for Image Generation C Feng, Z Chen, A Holynski, AA Efros, A Owens arXiv preprint arXiv:2501.12390, 2025 | | 2025 |