Obelics: An open web-scale filtered dataset of interleaved image-text documents H Laurençon, L Saulnier, L Tronchon, S Bekman, A Singh, A Lozhkov, ... Advances in Neural Information Processing Systems 36, 2024 | 251 | 2024 |
What matters when building vision-language models? H Laurençon, L Tronchon, M Cord, V Sanh arXiv preprint arXiv:2405.02246, 2024 | 141 | 2024 |
Building and better understanding vision-language models: insights and future directions H Laurençon, A Marafioti, V Sanh, L Tronchon Workshop on Responsibly Building the Next Generation of Multimodal …, 2024 | 29 | 2024 |
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset H Laurençon, L Tronchon, V Sanh arXiv preprint arXiv:2403.09029, 2024 | 14 | 2024 |
Introducing idefics: An open reproduction of state-of-the-art visual language model, 2023 H Laurençon, D van Strien, S Bekman, L Tronchon, L Saulnier, T Wang, ... URL https://huggingface. co/blog/idefics. Accessed, 09-18, 2023 | 13 | 2023 |
What matters when building vision-language models?(2024) H Laurençon, L Tronchon, M Cord, V Sanh URL https://arxiv. org/abs/2405.02246, 0 | 7 | |
Introducing idefics: An open reproduction of state-of-the-art visual language model H Laurencon, D van Strien, S Bekman, L Tronchon, L Saulnier, T Wang, ... August, 2023 | 5 | 2023 |
Obelics: An open web-scale filtered dataset of interleaved image-text documents (2023) H Laurençon, L Saulnier, L Tronchon, S Bekman, A Singh, A Lozhkov, ... URL https://arxiv. org/abs/2306.16527, 0 | 5 | |