Analogies minus analogy test: measuring regularities in word embeddings L Fournier, E Dupoux, E Dunbar Proceedings of the 24th Conference on Computational Natural Language …, 2020 | 25 | 2020 |
Can Forward Gradient Match Backpropagation? L Fournier, S Rivaud, E Belilovsky, M Eickenberg, E Oyallon Proceedings of the 40th International Conference on Machine Learning 202 …, 2023 | 14 | 2023 |
Preventing dimensional collapse in contrastive local learning with subsampling L Fournier, A Patel, M Eickenberg, E Oyallon, E Belilovsky ICML 2023 Workshop on Localized Learning (LLW), 2023 | 3 | 2023 |
ACCO: Accumulate while you Communicate, Hiding Communications in Distributed LLM Training A Nabli, L Fournier, P Erbacher, L Serrano, E Belilovsky, E Oyallon arXiv preprint arXiv:2406.02613, 2024 | 2 | 2024 |
Paraphrases do not explain word analogies L Fournier, E Dunbar Proceedings of the 16th Conference of the European Chapter of the …, 2021 | 2 | 2021 |
WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average L Fournier, A Nabli, M Aminbeidokhti, M Pedersoli, E Belilovsky, ... arXiv preprint arXiv:2405.17517, 2024 | 1 | 2024 |
Parallelizable training in deep learning through local and distributed approaches L Fournier Sorbonne Université, 2024 | | 2024 |
PETRA: Parallel End-to-end Training with Reversible Architectures S Rivaud, L Fournier, T Pumir, E Belilovsky, M Eickenberg, E Oyallon arXiv preprint arXiv:2406.02052, 2024 | | 2024 |
Cyclic Data Parallelism for Efficient Parallelism of Deep Neural Networks L Fournier, E Oyallon arXiv preprint arXiv:2403.08837, 2024 | | 2024 |