Obelics: An open web-scale filtered dataset of interleaved image-text documents H Laurençon, L Saulnier, L Tronchon, S Bekman, A Singh, A Lozhkov, ... Advances in Neural Information Processing Systems 36, 71683-71702, 2023 | 258 | 2023 |
What matters when building vision-language models? H Laurençon, L Tronchon, M Cord, V Sanh Advances in Neural Information Processing Systems 37, 87874-87907, 2025 | 161 | 2025 |
Building and better understanding vision-language models: insights and future directions H Laurençon, A Marafioti, V Sanh, L Tronchon Workshop on Responsibly Building the Next Generation of Multimodal …, 2024 | 35 | 2024 |
Unlocking the conversion of web screenshots into html code with the websight dataset H Laurençon, L Tronchon, V Sanh arXiv preprint arXiv:2403.09029, 2024 | 16 | 2024 |
Introducing idefics: An open reproduction of state-of-the-art visual language model, 2023 H Laurençon, D van Strien, S Bekman, L Tronchon, L Saulnier, T Wang, ... URL https://huggingface. co/blog/idefics. Accessed, 09-18, 2023 | 13 | 2023 |
What matters when building vision-language models?(2024) H Laurençon, L Tronchon, M Cord, V Sanh URL https://api. semanticscholar. org/CorpusID 269587869 (8), 9, 0 | 13 | |
Intr oducing DEFICS: An Open Reproduction of State-of-the-Art Visual Language Model H Laurençon, D van Strien, S Bekman, L Tronchon, L Saulnier, T Wang, ... Hugging Face, 2023 | 5 | 2023 |
Obelics: An open web-scale filtered dataset of interleaved image-text documents (2023) H Laurençon, L Saulnier, L Tronchon, S Bekman, A Singh, A Lozhkov, ... URL https://arxiv. org/abs/2306.16527, 0 | 5 | |