Παρακολούθηση
Brandon B Cui
Brandon B Cui
Facebook AI Research
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα fb.com
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Introducing mpt-7b: A new standard for open-source, commercially usable llms
MosaicML NLP Team
Accessed, 2023
276*2023
Trajectory diversity for zero-shot coordination
A Lupu, B Cui, H Hu, J Foerster
International conference on machine learning, 7204-7213, 2021
1112021
Compilergym: Robust, performant compiler optimization environments for ai research
C Cummins, B Wasti, J Guo, B Cui, J Ansel, S Gomez, S Jain, J Liu, ...
2022 IEEE/ACM International Symposium on Code Generation and Optimization …, 2022
802022
Off-belief learning
H Hu, A Lerer, B Cui, L Pineda, N Brown, J Foerster
International Conference on Machine Learning, 4369-4379, 2021
702021
K-level reasoning for zero-shot coordination in hanabi
B Cui, H Hu, L Pineda, J Foerster
Advances in Neural Information Processing Systems 34, 8215-8228, 2021
372021
Adversarial diversity in hanabi
B Cui, A Lupu, S Sokota, H Hu, DJ Wu, JN Foerster
The Eleventh International Conference on Learning Representations, 2023
172023
Control-aware representations for model-based reinforcement learning
B Cui, Y Chow, M Ghavamzadeh
arXiv preprint arXiv:2006.13408, 2020
172020
Variational model-based policy optimization
Y Chow, B Cui, MK Ryu, M Ghavamzadeh
arXiv preprint arXiv:2006.05443, 2020
132020
Critique-out-loud reward models
Z Ankner, M Paul, B Cui, JD Chang, P Ammanabrolu
arXiv preprint arXiv:2408.11791, 2024
122024
Learning space partitions for path planning
K Yang, T Zhang, C Cummins, B Cui, B Steiner, L Wang, JE Gonzalez, ...
Advances in Neural Information Processing Systems 34, 378-391, 2021
112021
Critique-out-loud reward models, 2024
Z Ankner, M Paul, B Cui, JD Chang, P Ammanabrolu
URL https://arxiv. org/abs/2408.11791, 0
6
Off-team learning
B Cui, H Hu, A Lupu, S Sokota, J Foerster
Advances in Neural Information Processing Systems 35, 15407-15419, 2022
12022
Terahertz waveguide with a negative effective index of refraction measured using time domain techniques
S Pandey, B Gupta, B Cui, D Schurig, A Nahata
2016 41st International Conference on Infrared, Millimeter, and Terahertz …, 2016
12016
Self-explaining deviations for coordination
H Hu, S Sokota, D Wu, A Bakhtin, A Lupu, B Cui, J Foerster
Advances in Neural Information Processing Systems 35, 38400-38410, 2022
2022
Community Infrastructure for Applying Reinforcement Learning to Compiler Optimizations
C Cummins, B Wasti, J Guo, B Cui, J Ansel, S Gomez, S Jain, J Liu, ...
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–15