Reuben Tan

Cited by

	All	Since 2020
Citations	423	420
h-index	8	8
i10-index	8	8

120

20192020202120222023202420253 30 60 90 101 115 23

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Kate SaenkoBoston UniversityVerified email at bu.edu
Bryan A PlummerBoston UniversityVerified email at bu.edu
Bryan RussellResearcher, AdobeVerified email at adobe.com
Huijuan XuAssistant Professor, CSE Dept., Pennsylvania State UniversityVerified email at psu.edu
Andrea BurnsGoogle DeepMindVerified email at bu.edu
Mariya I. VasilevaAWSVerified email at illinois.edu
Stan SclaroffDean of Arts & Sciences, Professor of Computer Science, Boston UniversityVerified email at bu.edu
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Hailin JinSenior Principal Scientist, Adobe ResearchVerified email at adobe.com
Justin SalamonSenior Research Scientist, Adobe ResearchVerified email at adobe.com
Oriol NietoSenior Research Engineer at AdobeVerified email at adobe.com
Jianwei YangPrincipal Researcher, Microsoft Research, RedmondVerified email at microsoft.com
Yong Jae LeeAssociate Professor of Computer Sciences, UW-MadisonVerified email at wisc.edu
Yuzhang ShangIllinois Institute of TechnologyVerified email at hawk.iit.edu
Jing GuPh.D. student, University of California, Santa CruzVerified email at ucsc.edu
Fangrui ZhuNortheastern UniversityVerified email at northeastern.edu
Mu CaiFinal-year PhD Student, University of Wisconsin-MadisonVerified email at cs.wisc.edu
Matthias De LangeSenior AI Researcher @Techwolf | PhDVerified email at techwolf.ai
Karl RidgewayFacebookVerified email at fb.com
Michael Louis IuzzolinoMetaVerified email at meta.com

Reuben Tan

Boston University

Verified email at bu.edu

Computer Vision Video Understanding Multimodal Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning similarity conditions without explicit supervision R Tan, MI Vasileva, K Saenko, BA Plummer Proceedings of the IEEE/CVF international conference on computer vision …, 2019	109	2019
Logan: Latent graph co-attention network for weakly-supervised video moment retrieval R Tan, H Xu, K Saenko, BA Plummer Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021	92	2021
Detecting cross-modal inconsistency to defend against neural fake news R Tan, BA Plummer, K Saenko Empirical Methods in Natural Language Processing (EMNLP) 2020, 2081–2106, 2020	77	2020
Language features matter: Effective language representations for vision-language tasks A Burns, R Tan, K Saenko, S Sclaroff, BA Plummer Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	36	2019
Look at what i’m doing: Self-supervised spatial grounding of narrations in instructional videos R Tan, B Plummer, K Saenko, H Jin, B Russell Advances in Neural Information Processing Systems 34, 14476-14487, 2021	23	2021
wman: Weakly-supervised moment alignment network for text-based video segment retrieval R Tan, H Xu, K Saenko, BA Plummer	18	2019
Koala: Key frame-conditioned long video-LLM R Tan, X Sun, P Hu, J Wang, H Deilamsalehy, BA Plummer, B Russell, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	17	2024
Language-Guided Audio-Visual Source Separation via Trimodal Consistency R Tan, A Ray, A Burns, BA Plummer, J Salamon, O Nieto, B Russell, ... The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023, 2023	16	2023
Latent action pretraining from videos S Ye, J Jang, B Jeon, S Joo, J Yang, B Peng, A Mandlekar, R Tan, ... arXiv preprint arXiv:2410.11758, 2024	8	2024
NewsStories: Illustrating articles with visual summaries R Tan, BA Plummer, K Saenko, JP Lewis, A Sud, T Leung European Conference on Computer Vision 2022 (Springer, Cham), 644-661, 2022	7	2022
Temporalbench: Benchmarking fine-grained temporal understanding for multimodal video models M Cai, R Tan, J Zhang, B Zou, K Zhang, F Yao, F Zhu, J Gu, Y Zhong, ... arXiv preprint arXiv:2410.10818, 2024	6	2024
Multiscale video pretraining for long-term activity forecasting R Tan, M De Lange, M Iuzzolino, BA Plummer, K Saenko, K Ridgeway, ... arXiv preprint arXiv:2307.12854, 2023	6	2023
Temporalbench: Towards fine-grained temporal understanding for multimodal video models M Cai, R Tan, J Zhang, B Zou, K Zhang, F Yao, F Zhu, J Gu, Y Zhong, ...	3	2024
Socratis: Are large multimodal models emotionally aware? K Deng, A Ray, R Tan, S Gabriel, BA Plummer, K Saenko ICCV 2023 Wecia, 2023	3	2023
EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video M De Lange, H Eghbalzadeh, R Tan, M Iuzzolino, F Meier, K Ridgeway arXiv preprint arXiv:2307.05784, 2023	2	2023
SAT: Spatial Aptitude Training for Multimodal Language Models A Ray, J Duan, R Tan, D Bashkirova, R Hendrix, K Ehsani, A Kembhavi, ... arXiv preprint arXiv:2412.07755, 2024		2024
Localization Of Narrations In Image Data H Jin, B Russell, RXH Tan US Patent App. 17/499,193, 2023		2023
WMAN: WEAKLY-SUPERVISED MOMENT ALIGNMENT NETWORK FOR TEXT-BASED VIDEO SEGMENT RE R Tan, H Xu, K Saenko, BA Plummer arXiv preprint arXiv:1909.13784, 2019		2019
Analytic system for measuring bat speed R Tan		2018
R2D3: Imparting Spatial Reasoning by Reconstructing 3D Scenes from 2D Images A Ray, D Bashkirova, R Tan, KH Zeng, BA Plummer, R Krishna, K Saenko

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors