Sledovať
Sainbayar Sukhbaatar
Sainbayar Sukhbaatar
FAIR team, Meta AI
Overená e-mailová adresa na: fb.com - Domovská stránka
Názov
Citované v
Citované v
Rok
End-To-End Memory Networks
S Sukhbaatar, A Szlam, J Weston, R Fergus
33582015
Learning multiagent communication with backpropagation
S Sukhbaatar, A Szlam, R Fergus
Advances in Neural Information Processing Systems, 2244-2252, 2016
14532016
Training Convolutional Networks with Noisy Labels
S Sukhbaatar, J Bruna, M Paluri, L Bourdev, R Fergus
Accepted as a workshop contribution at ICLR 2015, 2014
1009*2014
Intrinsic motivation and automatic curricula via asymmetric self-play
S Sukhbaatar, Z Lin, I Kostrikov, G Synnaeve, A Szlam, R Fergus
arXiv preprint arXiv:1703.05407, 2017
4462017
Simple baseline for visual question answering
B Zhou, Y Tian, S Sukhbaatar, A Szlam, R Fergus
arXiv preprint arXiv:1512.02167, 2015
4302015
Learning when to communicate at scale in multiagent cooperative and competitive tasks
A Singh, T Jain, S Sukhbaatar
arXiv preprint arXiv:1812.09755, 2018
3702018
Adaptive attention span in transformers
S Sukhbaatar, E Grave, P Bojanowski, A Joulin
arXiv preprint arXiv:1905.07799, 2019
3312019
Hash layers for large sparse models
S Roller, S Sukhbaatar, J Weston
advances in neural information processing systems 34, 17555-17566, 2021
1972021
Augmenting self-attention with persistent memory
S Sukhbaatar, E Grave, G Lample, H Jegou, A Joulin
arXiv preprint arXiv:1907.01470, 2019
1282019
Iterative reasoning preference optimization
RY Pang, W Yuan, H He, K Cho, S Sukhbaatar, J Weston
Advances in Neural Information Processing Systems 37, 116617-116637, 2025
922025
Composable planning with attributes
A Zhang, S Sukhbaatar, A Lerer, A Szlam, R Fergus
International Conference on Machine Learning, 5842-5851, 2018
862018
Mazebase: A sandbox for learning from games
S Sukhbaatar, A Szlam, G Synnaeve, S Chintala, R Fergus
arXiv preprint arXiv:1511.07401, 2015
832015
Memory-augmented reinforcement learning for image-goal navigation
L Mezghan, S Sukhbaatar, T Lavril, O Maksymets, D Batra, P Bojanowski, ...
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
782022
Addressing Some Limitations of Transformers with Feedback Memory
A Fan, T Lavril, E Grave, A Joulin, S Sukhbaatar
arXiv preprint arXiv:2002.09402, 2020
77*2020
Learning goal embeddings via self-play for hierarchical reinforcement learning
S Sukhbaatar, E Denton, A Szlam, R Fergus
arXiv preprint arXiv:1811.09083, 2018
662018
Some things are more cringe than others: Preference optimization with the pairwise cringe loss
J Xu, A Lee, S Sukhbaatar, J Weston
arXiv preprint arXiv:2312.16682 18, 2023
632023
System 2 attention (is something you might need too)
J Weston, S Sukhbaatar
arXiv preprint arXiv:2311.11829, 2023
592023
Teaching large language models to reason with reinforcement learning
A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ...
arXiv preprint arXiv:2403.04642, 2024
502024
End-to-end memory networks
JE Weston, AD Szlam, RD Fergus, S Sukhbaatar
US Patent 10,664,744, 2020
462020
Not all memories are created equal: Learning to forget by expiring
S Sukhbaatar, D Ju, S Poff, S Roller, A Szlam, J Weston, A Fan
International Conference on Machine Learning, 9902-9912, 2021
422021
Systém momentálne nemôže vykonať operáciu. Skúste to neskôr.
Články 1–20