Segueix
Alexander Havrilla
Alexander Havrilla
Correu electrònic verificat a gatech.edu - Pàgina d'inici
Títol
Citada per
Citada per
Any
Illustrating reinforcement learning from human feedback (rlhf)
N Lambert, L Castricato, L von Werra, A Havrilla
Hugging Face Blog 9, 2022
1192022
Arb: Advanced reasoning benchmark for large language models
T Sawada, D Paleka, A Havrilla, P Tadepalli, P Vidas, A Kranias, JJ Nay, ...
arXiv preprint arXiv:2307.13692, 2023
662023
Teaching large language models to reason with reinforcement learning
A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ...
arXiv preprint arXiv:2403.04642, 2024
482024
trlX: A framework for large scale reinforcement learning from human feedback
A Havrilla, M Zhuravinskyi, D Phung, A Tiwari, J Tow, S Biderman, ...
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
372023
Glore: When, where, and how to improve llm reasoning via global and local refinements
A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ...
arXiv preprint arXiv:2402.10963, 2024
302024
Sharp Khinchin-type inequalities for symmetric discrete uniform random variables
A Havrilla, T Tkocz
Israel Journal of Mathematics 246 (1), 281-297, 2021
142021
Understanding the effect of noise in llm training data with algorithmic chains of thought
A Havrilla, M Iyer
arXiv preprint arXiv:2402.04004, 2024
102024
Robust preference learning for storytelling via contrastive reinforcement learning
L Castricato, A Havrilla, S Matiana, M Pieler, A Ye, I Yang, S Frazier, ...
arXiv preprint arXiv:2210.07792, 2022
102022
On deep generative models for approximation and estimation of distributions on manifolds
B Dahal, A Havrilla, M Chen, T Zhao, W Liao
Advances in Neural Information Processing Systems 35, 10615-10628, 2022
82022
Khinchin-type inequalities via Hadamard’s factorisation
A Havrilla, P Nayar, T Tkocz
International Mathematics Research Notices 2023 (3), 2429-2445, 2023
72023
trlX: A scalable framework for RLHF, June 2023
L Castricato, A Havrilla, S Matiana, DV Phung, A Tiwari, J Tow, ...
URL https://github. com/CarperAI/trlx, 0
7
Deep nonparametric estimation of intrinsic data structures by chart autoencoders: Generalization error and robustness
H Liu, A Havrilla, R Lai, W Liao
Applied and Computational Harmonic Analysis 68, 101602, 2024
62024
Understanding scaling laws with statistical and approximation theory for transformer neural networks on intrinsically low-dimensional data
A Havrilla, W Liao
arXiv preprint arXiv:2411.06646, 2024
52024
trlX: A scalable framework for RLHF
L Castricato, A Havrilla, S Matiana, DV Phung, A Tiwari, J Tow, ...
Zenodo. DOI 10, 2023
52023
Illustrating Reinforcement Learning from Human Feedback (RLHF)[WWW Document]
N Lambert, L Castricato, L von Werra, A Havrilla
Hugging Face. URL https://huggingface. co/blog/rlhf (accessed 12.10. 23), 2022
52022
ARB: Advanced Reasoning Benchmark for Large Language Models (2023)
T Sawada, D Paleka, A Havrilla, P Tadepalli, P Vidas, A Kranias, JJ Nay, ...
Publisher: arXiv Version, 0
5
Deep nonparametric estimation of intrinsic data structures by chart autoencoders: Generalization error and robustness
H Liu, A Havrilla, R Lai, W Liao
arXiv preprint arXiv:2303.09863, 2023
42023
A study on improving reasoning in language models
Y Du, A Havrilla, S Sukhbaatar, P Abbeel, R Raileanu
I Can't Believe It's Not Better Workshop: Failure Modes in the Age of …, 2024
22024
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models
A Havrilla, A Dai, L O'Mahony, K Oostermeijer, V Zisler, A Albalak, F Milo, ...
arXiv preprint arXiv:2412.02980, 2024
12024
DFU: scale-robust diffusion model for zero-shot super-resolution image generation
A Havrilla, K Rojas, W Liao, M Tao
arXiv preprint arXiv:2401.06144, 2023
12023
En aquests moments el sistema no pot dur a terme l'operació. Torneu-ho a provar més tard.
Articles 1–20