Alexander Havrilla

Citada per

	Totes	Des de 2020
Citacions	390	389
Índex h	8	8
Índex i10	8	8

260

130

195

2020202120222023202420252 1 3 79 255 48

Accés públic

Mostra-ho tot

3 articles

0 articles

disponibles

no disponibles

Es basa en els requisits de les agències que proporcionen el finançament

Segueix

Alexander Havrilla

Georgia Institute of Technology

Correu electrònic verificat a gatech.edu - Pàgina d'inici

Machine learning Large language modeling


Títol Ordena per cites Ordena per any Ordena per títol	Citada per Citada per	Any
Illustrating reinforcement learning from human feedback (rlhf) N Lambert, L Castricato, L von Werra, A Havrilla Hugging Face Blog 9, 2022	119	2022
Arb: Advanced reasoning benchmark for large language models T Sawada, D Paleka, A Havrilla, P Tadepalli, P Vidas, A Kranias, JJ Nay, ... arXiv preprint arXiv:2307.13692, 2023	66	2023
Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024	48	2024
trlX: A framework for large scale reinforcement learning from human feedback A Havrilla, M Zhuravinskyi, D Phung, A Tiwari, J Tow, S Biderman, ... Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023	37	2023
Glore: When, where, and how to improve llm reasoning via global and local refinements A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ... arXiv preprint arXiv:2402.10963, 2024	30	2024
Sharp Khinchin-type inequalities for symmetric discrete uniform random variables A Havrilla, T Tkocz Israel Journal of Mathematics 246 (1), 281-297, 2021	14	2021
Understanding the effect of noise in llm training data with algorithmic chains of thought A Havrilla, M Iyer arXiv preprint arXiv:2402.04004, 2024	10	2024
Robust preference learning for storytelling via contrastive reinforcement learning L Castricato, A Havrilla, S Matiana, M Pieler, A Ye, I Yang, S Frazier, ... arXiv preprint arXiv:2210.07792, 2022	10	2022
On deep generative models for approximation and estimation of distributions on manifolds B Dahal, A Havrilla, M Chen, T Zhao, W Liao Advances in Neural Information Processing Systems 35, 10615-10628, 2022	8	2022
Khinchin-type inequalities via Hadamard’s factorisation A Havrilla, P Nayar, T Tkocz International Mathematics Research Notices 2023 (3), 2429-2445, 2023	7	2023
trlX: A scalable framework for RLHF, June 2023 L Castricato, A Havrilla, S Matiana, DV Phung, A Tiwari, J Tow, ... URL https://github. com/CarperAI/trlx, 0	7
Deep nonparametric estimation of intrinsic data structures by chart autoencoders: Generalization error and robustness H Liu, A Havrilla, R Lai, W Liao Applied and Computational Harmonic Analysis 68, 101602, 2024	6	2024
Understanding scaling laws with statistical and approximation theory for transformer neural networks on intrinsically low-dimensional data A Havrilla, W Liao arXiv preprint arXiv:2411.06646, 2024	5	2024
trlX: A scalable framework for RLHF L Castricato, A Havrilla, S Matiana, DV Phung, A Tiwari, J Tow, ... Zenodo. DOI 10, 2023	5	2023
Illustrating Reinforcement Learning from Human Feedback (RLHF)[WWW Document] N Lambert, L Castricato, L von Werra, A Havrilla Hugging Face. URL https://huggingface. co/blog/rlhf (accessed 12.10. 23), 2022	5	2022
ARB: Advanced Reasoning Benchmark for Large Language Models (2023) T Sawada, D Paleka, A Havrilla, P Tadepalli, P Vidas, A Kranias, JJ Nay, ... Publisher: arXiv Version, 0	5
Deep nonparametric estimation of intrinsic data structures by chart autoencoders: Generalization error and robustness H Liu, A Havrilla, R Lai, W Liao arXiv preprint arXiv:2303.09863, 2023	4	2023
A study on improving reasoning in language models Y Du, A Havrilla, S Sukhbaatar, P Abbeel, R Raileanu I Can't Believe It's Not Better Workshop: Failure Modes in the Age of …, 2024	2	2024
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models A Havrilla, A Dai, L O'Mahony, K Oostermeijer, V Zisler, A Albalak, F Milo, ... arXiv preprint arXiv:2412.02980, 2024	1	2024
DFU: scale-robust diffusion model for zero-shot super-resolution image generation A Havrilla, K Rojas, W Liao, M Tao arXiv preprint arXiv:2401.06144, 2023	1	2023

En aquests moments el sistema no pot dur a terme l'operació. Torneu-ho a provar més tard.

Articles 1–20

Cites per any

Cites duplicades

Cites combinades

Addició de coautorsCoautors

Segueix

Citada per