Følg
Tengda Han
Tengda Han
Google DeepMind | VGG, University of Oxford
Verificeret mail på robots.ox.ac.uk - Startside
Titel
Citeret af
Citeret af
År
Flamingo: a visual language model for few-shot learning
JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ...
Advances in neural information processing systems 35, 23716-23736, 2022
38482022
Self-supervised Co-training for Video Representation Learning
T Han, W Xie, A Zisserman
Conference on Neural Information Processing Systems (NeurIPS), 2020
4722020
Video representation learning by dense predictive coding
T Han, W Xie, A Zisserman
Workshop on Large-scale Holistic Video Understanding, ICCV, 2019
4412019
Prompting visual-language models for efficient video understanding
C Ju, T Han, K Zheng, Y Zhang, W Xie
European Conference on Computer Vision, 105-124, 2022
4292022
Memory-augmented Dense Predictive Coding for Video Representation Learning
T Han, W Xie, A Zisserman
European Conference on Computer Vision (ECCV), 2020, 2020
2892020
Whisperx: Time-accurate speech transcription of long-form audio
M Bain, J Huh, T Han, A Zisserman
arXiv preprint arXiv:2303.00747, 2023
2352023
Temporal Alignment Networks for Long-term Video
T Han, W Xie, A Zisserman
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, 2022
1002022
Autoad: Movie description in context
T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
612023
Human pose forecasting via deep markov models
S Toyer, A Cherian, T Han, S Gould
2017 International Conference on Digital Image Computing: Techniques and …, 2017
582017
Autoad ii: The sequel-who, when, and what in movie audio description
T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
402023
Open-world text-specified object counting
N Amini-Naieni, K Amini-Naieni, T Han, A Zisserman
arXiv preprint arXiv:2306.01851, 2023
242023
Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers
J Loedeman, MC Stol, T Han, YM Asano
arXiv preprint arXiv:2210.06466, 2022
222022
Human action forecasting by learning task grammars
T Han, J Wang, A Cherian, S Gould
arXiv preprint arXiv:1709.06391, 2017
202017
Autoad iii: The prequel-back to the pixels
T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
162024
Turbo training with token dropout
T Han, W Xie, A Zisserman
arXiv preprint arXiv:2210.04889, 2022
102022
Autoad-zero: A training-free framework for zero-shot audio description
J Xie, T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman
Proceedings of the Asian Conference on Computer Vision, 2265-2281, 2024
62024
A strong baseline for temporal video-text alignment
Z Li, Q Chen, T Han, Y Zhang, Y Wang, W Xie
arXiv preprint arXiv:2312.14055, 2023
52023
CountGD: Multi-modal open-world counting
N Amini-Naieni, T Han, A Zisserman
Advances in Neural Information Processing Systems 37, 48810-48837, 2025
32025
It's Just Another Day: Unique Video Captioning by Discriminative Prompting
T Perrett, T Han, D Damen, A Zisserman
Proceedings of the Asian Conference on Computer Vision, 232-249, 2024
22024
Multi-sentence Grounding for Long-Term Instructional Video
Z Li, Q Chen, T Han, Y Zhang, Y Wang, W Xie
European Conference on Computer Vision, 2024
22024
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–20