Mad: A scalable dataset for language grounding in videos from movie audio descriptions M Soldan, A Pardo, JL Alcázar, F Caba, C Zhao, S Giancola, B Ghanem Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 106 | 2022 |
Refineloc: Iterative refinement for weakly-supervised action localization A Pardo, H Alwassel, F Caba, A Thabet, B Ghanem Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021 | 61 | 2021 |
Moviecuts: A new dataset and benchmark for cut type recognition A Pardo, FC Heilbron, JL Alcázar, A Thabet, B Ghanem European Conference on Computer Vision, 668-685, 2022 | 30 | 2022 |
Learning to cut by watching movies A Pardo, F Caba, JL Alcázar, AK Thabet, B Ghanem Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 24 | 2021 |
BAOD: Budget-Aware Object Detection A Pardo, M Xu, A Thabet, P Arbelaez, B Ghanem arXiv preprint arXiv:1904.05443, 2019 | 24 | 2019 |
Evaluation of Test-Time Adaptation Under Computational Time Constraints M Alfarra, H Itani, A Pardo, M Ramazanova, JC Perez, M Müller, ... Forty-first International Conference on Machine Learning, 2024 | 11* | 2024 |
Exploring missing modality in multimodal egocentric datasets M Ramazanova, A Pardo, H Alwassel, B Ghanem arXiv preprint arXiv:2401.11470, 2024 | 4 | 2024 |
Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration JC Pérez, A Pardo, M Soldan, H Itani, J Leon-Alcazar, B Ghanem arXiv preprint arXiv:2405.17146, 2024 | 2 | 2024 |
Towards Automated Movie Trailer Generation DM Argaw, M Soldan, A Pardo, C Zhao, FC Heilbron, JS Chung, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 2 | 2024 |
Combating Missing Modalities in Egocentric Videos at Test Time M Ramazanova, A Pardo, B Ghanem, M Alfarra arXiv preprint arXiv:2404.15161, 2024 | 1 | 2024 |
MatchDiffusion: Training-free Generation of Match-cuts A Pardo, F Pizzati, T Zhang, A Pondaven, P Torr, JC Perez, B Ghanem arXiv preprint arXiv:2411.18677, 2024 | | 2024 |
Generative Timelines for Instructed Visual Assembly A Pardo, JH Wang, B Ghanem, J Sivic, B Russell, FC Heilbron arXiv preprint arXiv:2411.12293, 2024 | | 2024 |
MotasemAlfarra/Online_Test_Time_Adaptation: Revisiting Test Time Adaptation Under Online Evaluation M Alfarra, H Itani, A Pardo, SY Alhuwaider, M Ramazanova, JC Pérez, ... Github, 2023 | | 2023 |
PardoAlejo/MovieCuts: Learning to cut end-to-end pretrained modules A Pardo, JL Alcázar, AK Thabet, B Ghanem, FD Caba Heilbron Github, 2021 | | 2021 |
Supplementary material for MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions M Soldan, A Pardo, JL Alcázar, FC Heilbron, C Zhao, S Giancola, ... | | |