DINOv2: Learning Robust Visual Features without Supervision M Oquab, T Darcet, T Moutakanni, H Vo, M Szafraniec, V Khalidov, ... arXiv preprint arXiv:2304.07193, 2023 | 2904* | 2023 |
LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference B Graham, A El-Nouby, H Touvron, P Stock, A Joulin, H Jégou, M Douze International Conference on Computer Vision 2021, 2021 | 965* | 2021 |
Imagebind: One embedding space to bind them all R Girdhar, A El-Nouby, Z Liu, M Singh, KV Alwala, A Joulin, I Misra Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 901 | 2023 |
Resmlp: Feedforward networks for image classification with data-efficient training H Touvron, P Bojanowski, M Caron, M Cord, A El-Nouby, E Grave, ... IEEE transactions on pattern analysis and machine intelligence 45 (4), 5314-5321, 2022 | 838* | 2022 |
XCiT: Cross-Covariance Image Transformers A El-Nouby, H Touvron, M Caron, P Bojanowski, M Douze, A Joulin, ... 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021 | 610* | 2021 |
Training vision transformers for image retrieval A El-Nouby, N Neverova, I Laptev, H Jégou arXiv preprint arXiv:2102.05644, 2021 | 199 | 2021 |
Tell, draw, and repeat: Generating and modifying images based on continual linguistic instruction A El-Nouby, S Sharma, H Schulz, D Hjelm, LE Asri, SE Kahou, Y Bengio, ... Proceedings of the IEEE International Conference on Computer Vision, 10304-10312, 2019 | 166* | 2019 |
Are large-scale datasets necessary for self-supervised pre-training? A El-Nouby, G Izacard, H Touvron, I Laptev, H Jegou, E Grave arXiv preprint arXiv:2112.10740, 2021 | 163 | 2021 |
Three things everyone should know about vision transformers H Touvron, M Cord, A El-Nouby, J Verbeek, H Jégou European Conference on Computer Vision, 497-515, 2022 | 142 | 2022 |
Omnimae: Single model masked pretraining on images and videos R Girdhar, A El-Nouby, M Singh, KV Alwala, A Joulin, I Misra Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 117* | 2023 |
Datacomp-lm: In search of the next generation of training sets for language models J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ... arXiv preprint arXiv:2406.11794, 2024 | 73* | 2024 |
Augmenting convolutional networks with attention-based aggregation H Touvron, M Cord, A El-Nouby, P Bojanowski, A Joulin, G Synnaeve, ... arXiv preprint arXiv:2112.13692, 2021 | 68* | 2021 |
Scalable Pre-training of Large Autoregressive Image Models A El-Nouby, M Klein, S Zhai, MA Bautista, A Toshev, V Shankar, ... International Conference on Machine Learning, 2024, 2024 | 53 | 2024 |
Image compression with product quantized masked image modeling A El-Nouby, MJ Muckley, K Ullrich, I Laptev, J Verbeek, H Jégou arXiv preprint arXiv:2212.07372, 2022 | 36 | 2022 |
Improving statistical fidelity for neural image compression with implicit local likelihood models MJ Muckley, A El-Nouby, K Ullrich, H Jégou, J Verbeek International Conference on Machine Learning, 25426-25443, 2023 | 32* | 2023 |
Real-Time End-to-End Action Detection with Two-Stream Networks A Ali, GW Taylor 2018 15th Conference on Computer and Robot Vision (CRV), 31-38, 2018 | 31* | 2018 |
Skip-Clip: Self-Supervised Spatiotemporal Representation Learning by Future Clip Order Ranking A El-Nouby, S Zhai, GW Taylor, JM Susskind Holistic Video Understanding Workshop ICCV2019, 2019 | 19 | 2019 |
Multimodal autoregressive pre-training of large vision encoders E Fini, M Shukor, X Li, P Dufter, M Klein, D Haldimann, S Aitharaju, ... arXiv preprint arXiv:2411.14402, 2024 | 5 | 2024 |
Variable rate allocation for vector-quantized autoencoders F Baldassarre, A El-Nouby, H Jégou ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |
Are Visual Recognition Models Robust to Image Compression? JM Janeiro, S Frolov, A El-Nouby, J Verbeek arXiv preprint arXiv:2304.04518, 2023 | 3 | 2023 |