Follow
Reuben Tan
Title
Cited by
Cited by
Year
Learning similarity conditions without explicit supervision
R Tan, MI Vasileva, K Saenko, BA Plummer
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
1092019
Logan: Latent graph co-attention network for weakly-supervised video moment retrieval
R Tan, H Xu, K Saenko, BA Plummer
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021
922021
Detecting cross-modal inconsistency to defend against neural fake news
R Tan, BA Plummer, K Saenko
Empirical Methods in Natural Language Processing (EMNLP) 2020, 2081–2106, 2020
772020
Language features matter: Effective language representations for vision-language tasks
A Burns, R Tan, K Saenko, S Sclaroff, BA Plummer
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
362019
Look at what i’m doing: Self-supervised spatial grounding of narrations in instructional videos
R Tan, B Plummer, K Saenko, H Jin, B Russell
Advances in Neural Information Processing Systems 34, 14476-14487, 2021
232021
wman: Weakly-supervised moment alignment network for text-based video segment retrieval
R Tan, H Xu, K Saenko, BA Plummer
182019
Koala: Key frame-conditioned long video-LLM
R Tan, X Sun, P Hu, J Wang, H Deilamsalehy, BA Plummer, B Russell, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
172024
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
R Tan, A Ray, A Burns, BA Plummer, J Salamon, O Nieto, B Russell, ...
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023, 2023
162023
Latent action pretraining from videos
S Ye, J Jang, B Jeon, S Joo, J Yang, B Peng, A Mandlekar, R Tan, ...
arXiv preprint arXiv:2410.11758, 2024
82024
NewsStories: Illustrating articles with visual summaries
R Tan, BA Plummer, K Saenko, JP Lewis, A Sud, T Leung
European Conference on Computer Vision 2022 (Springer, Cham), 644-661, 2022
72022
Temporalbench: Benchmarking fine-grained temporal understanding for multimodal video models
M Cai, R Tan, J Zhang, B Zou, K Zhang, F Yao, F Zhu, J Gu, Y Zhong, ...
arXiv preprint arXiv:2410.10818, 2024
62024
Multiscale video pretraining for long-term activity forecasting
R Tan, M De Lange, M Iuzzolino, BA Plummer, K Saenko, K Ridgeway, ...
arXiv preprint arXiv:2307.12854, 2023
62023
Temporalbench: Towards fine-grained temporal understanding for multimodal video models
M Cai, R Tan, J Zhang, B Zou, K Zhang, F Yao, F Zhu, J Gu, Y Zhong, ...
32024
Socratis: Are large multimodal models emotionally aware?
K Deng, A Ray, R Tan, S Gabriel, BA Plummer, K Saenko
ICCV 2023 Wecia, 2023
32023
EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
M De Lange, H Eghbalzadeh, R Tan, M Iuzzolino, F Meier, K Ridgeway
arXiv preprint arXiv:2307.05784, 2023
22023
SAT: Spatial Aptitude Training for Multimodal Language Models
A Ray, J Duan, R Tan, D Bashkirova, R Hendrix, K Ehsani, A Kembhavi, ...
arXiv preprint arXiv:2412.07755, 2024
2024
Localization Of Narrations In Image Data
H Jin, B Russell, RXH Tan
US Patent App. 17/499,193, 2023
2023
WMAN: WEAKLY-SUPERVISED MOMENT ALIGNMENT NETWORK FOR TEXT-BASED VIDEO SEGMENT RE
R Tan, H Xu, K Saenko, BA Plummer
arXiv preprint arXiv:1909.13784, 2019
2019
Analytic system for measuring bat speed
R Tan
2018
R2D3: Imparting Spatial Reasoning by Reconstructing 3D Scenes from 2D Images
A Ray, D Bashkirova, R Tan, KH Zeng, BA Plummer, R Krishna, K Saenko
The system can't perform the operation now. Try again later.
Articles 1–20