Audrey Huang

צוטט על ידי

	הכל	מאז 2020
ציטוטים ביבליוגרפיים	546	447
H-index	10	10
i10-index	10	10

140

105

201520162017201820192020202120222023202420255 12 28 32 18 41 39 73 137 130 27

גישה ציבורית

הצג הכל

8 מאמרים

0 מאמרים

זמין

לא זמין

על סמך ייפוי כח מהמממנים

עקוב אחר

Audrey Huang

University of Illinois Urbana-Champaign

כתובת אימייל מאומתת בדומיין illinois.edu

Reinforcement Learning Machine Learning Optimization


כותרת מיון לפי ציטוט ביבליוגרפי מיון לפי שנה מיון לפי כותרת	צוטט על ידי צוטט על ידי	שנה
Offline reinforcement learning with realizability and single-policy concentrability‏ W Zhan, B Huang, A Huang, N Jiang, J Lee‏ Conference on Learning Theory, 2730-2775, 2022‏	131	2022
Structural conservation of chemotaxis machinery across A rchaea and B acteria‏ A Briegel, DR Ortega, AN Huang, CM Oikonomou, RP Gunsalus, ...‏ Environmental microbiology reports 7 (3), 414-419, 2015‏	93	2015
Asymmetric enzymatic synthesis of allylic amines: a sigmatropic rearrangement strategy‏ CK Prier, TK Hyster, CC Farwell, A Huang, FH Arnold‏ Angewandte Chemie International Edition 55 (15), 4711-4715, 2016‏	82	2016
Graph-structured visual imitation‏ M Sieb, Z Xian, A Huang, O Kroemer, K Fragkiadaki‏ Conference on Robot Learning, 979-989, 2020‏	72	2020
Off-policy risk assessment in contextual bandits‏ A Huang, L Leqi, Z Lipton, K Azizzadenesheli‏ Advances in Neural Information Processing Systems 34, 23714-23726, 2021‏	38	2021
Morphology of the archaellar motor and associated cytoplasmic cone in Thermococcus kodakaraensis‏ A Briegel, CM Oikonomou, YW Chang, A Kjær, AN Huang, KW Kim, ...‏ EMBO reports 18 (9), 1660-1670, 2017‏	35	2017
Reinforcement learning in low-rank mdps with density features‏ A Huang, J Chen, N Jiang‏ International Conference on Machine Learning, 13710-13752, 2023‏	20	2023
On the convergence and optimality of policy gradient for markov coherent risk‏ A Huang, L Leqi, ZC Lipton, K Azizzadenesheli‏ arXiv preprint arXiv:2103.02827, 2021‏	20	2021
Beyond the return: Off-policy function estimation under user-specified error-measuring distributions‏ A Huang, N Jiang‏ Advances in Neural Information Processing Systems 35, 6292-6303, 2022‏	11	2022
Supervised learning with general risk functionals‏ L Leqi, A Huang, Z Lipton, K Azizzadenesheli‏ International Conference on Machine Learning, 12570-12592, 2022‏	11	2022
Off-policy risk assessment for markov decision processes‏ A Huang, L Leqi, Z Lipton, K Azizzadenesheli‏ International Conference on Artificial Intelligence and Statistics, 5022-5050, 2022‏	8	2022
Correcting the mythos of kl-regularization: Direct alignment without overparameterization via chi-squared preference optimization‏ A Huang, W Zhan, T Xie, JD Lee, W Sun, A Krishnamurthy, DJ Foster‏ arXiv e-prints, arXiv: 2407.13399, 2024‏	7	2024
Correcting the mythos of kl-regularization: Direct alignment without overoptimization via chi-squared preference optimization‏ A Huang, W Zhan, T Xie, JD Lee, W Sun, A Krishnamurthy, DJ Foster‏ arXiv preprint arXiv:2407.13399, 2024‏	6	2024
Self-Improvement in Language Models: The Sharpening Mechanism‏ A Huang, A Block, DJ Foster, D Rohatgi, C Zhang, M Simchowitz, JT Ash, ...‏ arXiv preprint arXiv:2412.01951, 2024‏	3	2024
Non-adaptive online finetuning for offline reinforcement learning‏ A Huang, M Ghavamzadeh, N Jiang, M Petrik‏ Reinforcement Learning Conference, 2024‏	3	2024
Physically optimizing inference‏ A Huang, B Sheldan, DA Sivak, M Thomson‏ arXiv preprint arXiv:1805.07512, 2018‏	3	2018
Timing as an Action: Learning When to Observe and Act‏ H Zhou, A Huang, K Azizzadenesheli, D Childers, Z Lipton‏ International Conference on Artificial Intelligence and Statistics, 3979-3987, 2024‏	1	2024
RiskyZoo: A Library for Risk-Sensitive Supervised Learning‏ W Wong, A Huang, L Leqi, K Azizzadenesheli, ZC Lipton‏ ICML 2022 Workshop on Responsible Decision Making in Dynamic Environments, 2022‏	1	2022
Structure of the archaellar motor and associated cytoplasmic cone in Thermococcus kodakaraensis‏ A Briegel, CM Oikonomou, YW Chang, A Kjaer, AN Huang, KW Kim, ...‏ bioRxiv, 108209, 2017‏	1	2017
Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification‏ D Rohatgi, A Block, A Huang, A Krishnamurthy, DJ Foster‏ arXiv preprint arXiv:2502.12465, 2025‏		2025

המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.

מאמרים 1–20

ציטוטים ביבליוגרפיים בשנה

ציטוטים ביביליוגרפיים כפולים

ציטוטים ביביליוגרפיים שמוזגו

הוסף מחברים שותפיםמחברים משותפים

עקוב אחר

צוטט על ידי