Theo dõi
Yusuf Aytar
Yusuf Aytar
Research Scientist, DeepMind
Email được xác minh tại google.com - Trang chủ
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
SoundNet: Learning Sound Representations from Unlabeled Video
Y Aytar, C Vondrick, A Torralba
Neural Information Processing Systems, 2016
13172016
Learning cross-modal embeddings for cooking recipes and food images
A Salvador, N Hynes, Y Aytar, J Marin, F Ofli, I Weber, A Torralba
Proceedings of the IEEE conference on computer vision and pattern …, 2017
7402017
With a little help from my friends: Nearest-neighbor contrastive learning of visual representations
D Dwibedi, Y Aytar, J Tompson, P Sermanet, A Zisserman
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
5402021
Tabula rasa: Model transfer for object category detection
Y Aytar, A Zisserman
2011 international conference on computer vision, 2252-2259, 2011
4532011
Temporal cycle-consistency learning
D Dwibedi, Y Aytar, J Tompson, P Sermanet, A Zisserman
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
3422019
Playing hard exploration games by watching youtube
Y Aytar, T Pfaff, D Budden, T Paine, Z Wang, N De Freitas
Advances in neural information processing systems 31, 2018
3202018
Learning aligned cross-modal representations from weakly aligned data
L Castrejon, Y Aytar, C Vondrick, H Pirsiavash, A Torralba
Proceedings of the IEEE conference on computer vision and pattern …, 2016
2012016
See, hear, and read: Deep aligned representations
Y Aytar, C Vondrick, A Torralba
arXiv preprint arXiv:1706.00932, 2017
1682017
Cross-modal scene networks
Y Aytar, L Castrejon, C Vondrick, H Pirsiavash, A Torralba
IEEE transactions on pattern analysis and machine intelligence 40 (10), 2303 …, 2017
1582017
Scaling data-driven robotics with reward sketching and batch reinforcement learning
S Cabi, SG Colmenarejo, A Novikov, K Konyushkova, S Reed, R Jeong, ...
arXiv preprint arXiv:1909.12200, 2019
1562019
Counting out time: Class agnostic video repetition counting in the wild
D Dwibedi, Y Aytar, J Tompson, P Sermanet, A Zisserman
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1502020
How transferable are CNN-based features for age and gender classification?
G Ozbulak, Y Aytar, HK Ekenel
2016 International Conference of the Biometrics Special Interest Group …, 2016
1432016
Sickle cell detection using a smartphone
SM Knowlton, I Sencan, Y Aytar, J Khoory, MM Heeney, IC Ghiran, ...
Scientific reports 5 (1), 15022, 2015
1392015
Tap-vid: A benchmark for tracking any point in a video
C Doersch, A Gupta, L Markeeva, A Recasens, L Smaira, Y Aytar, ...
Advances in Neural Information Processing Systems 35, 13610-13626, 2022
1362022
Genie: Generative interactive environments
J Bruce, MD Dennis, A Edwards, J Parker-Holder, Y Shi, E Hughes, M Lai, ...
Forty-first International Conference on Machine Learning, 2024
1302024
Tapir: Tracking any point with per-frame initialization and temporal refinement
C Doersch, Y Yang, M Vecerik, D Gokay, A Gupta, Y Aytar, J Carreira, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1282023
Face-to-BMI: Using computer vision to infer body mass index on social media
E Kocabey, M Camurcu, F Ofli, Y Aytar, J Marin, A Torralba, I Weber
Proceedings of the International AAAI Conference on Web and Social Media 11 …, 2017
1062017
Perception test: A diagnostic benchmark for multimodal video models
V Patraucean, L Smaira, A Gupta, A Recasens, L Markeeva, D Banarse, ...
Advances in Neural Information Processing Systems 36, 42748-42761, 2023
1012023
Utilizing semantic word similarity measures for video retrieval
Y Aytar, M Shah, J Luo
2008 IEEE conference on computer vision and pattern recognition, 1-8, 2008
922008
Robocat: A self-improving foundation agent for robotic manipulation
K Bousmalis, G Vezzani, D Rao, C Devin, AX Lee, M Bauza, T Davchev, ...
arXiv preprint arXiv:2306.11706 1 (8), 2023
892023
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–20