Offline reinforcement learning with realizability and single-policy concentrability W Zhan, B Huang, A Huang, N Jiang, J Lee Conference on Learning Theory, 2730-2775, 2022 | 131 | 2022 |
Structural conservation of chemotaxis machinery across A rchaea and B acteria A Briegel, DR Ortega, AN Huang, CM Oikonomou, RP Gunsalus, ... Environmental microbiology reports 7 (3), 414-419, 2015 | 93 | 2015 |
Asymmetric enzymatic synthesis of allylic amines: a sigmatropic rearrangement strategy CK Prier, TK Hyster, CC Farwell, A Huang, FH Arnold Angewandte Chemie International Edition 55 (15), 4711-4715, 2016 | 82 | 2016 |
Graph-structured visual imitation M Sieb, Z Xian, A Huang, O Kroemer, K Fragkiadaki Conference on Robot Learning, 979-989, 2020 | 72 | 2020 |
Off-policy risk assessment in contextual bandits A Huang, L Leqi, Z Lipton, K Azizzadenesheli Advances in Neural Information Processing Systems 34, 23714-23726, 2021 | 38 | 2021 |
Morphology of the archaellar motor and associated cytoplasmic cone in Thermococcus kodakaraensis A Briegel, CM Oikonomou, YW Chang, A Kjær, AN Huang, KW Kim, ... EMBO reports 18 (9), 1660-1670, 2017 | 35 | 2017 |
Reinforcement learning in low-rank mdps with density features A Huang, J Chen, N Jiang International Conference on Machine Learning, 13710-13752, 2023 | 20 | 2023 |
On the convergence and optimality of policy gradient for markov coherent risk A Huang, L Leqi, ZC Lipton, K Azizzadenesheli arXiv preprint arXiv:2103.02827, 2021 | 20 | 2021 |
Beyond the return: Off-policy function estimation under user-specified error-measuring distributions A Huang, N Jiang Advances in Neural Information Processing Systems 35, 6292-6303, 2022 | 11 | 2022 |
Supervised learning with general risk functionals L Leqi, A Huang, Z Lipton, K Azizzadenesheli International Conference on Machine Learning, 12570-12592, 2022 | 11 | 2022 |
Off-policy risk assessment for markov decision processes A Huang, L Leqi, Z Lipton, K Azizzadenesheli International Conference on Artificial Intelligence and Statistics, 5022-5050, 2022 | 8 | 2022 |
Correcting the mythos of kl-regularization: Direct alignment without overparameterization via chi-squared preference optimization A Huang, W Zhan, T Xie, JD Lee, W Sun, A Krishnamurthy, DJ Foster arXiv e-prints, arXiv: 2407.13399, 2024 | 7 | 2024 |
Correcting the mythos of kl-regularization: Direct alignment without overoptimization via chi-squared preference optimization A Huang, W Zhan, T Xie, JD Lee, W Sun, A Krishnamurthy, DJ Foster arXiv preprint arXiv:2407.13399, 2024 | 6 | 2024 |
Self-Improvement in Language Models: The Sharpening Mechanism A Huang, A Block, DJ Foster, D Rohatgi, C Zhang, M Simchowitz, JT Ash, ... arXiv preprint arXiv:2412.01951, 2024 | 3 | 2024 |
Non-adaptive online finetuning for offline reinforcement learning A Huang, M Ghavamzadeh, N Jiang, M Petrik Reinforcement Learning Conference, 2024 | 3 | 2024 |
Physically optimizing inference A Huang, B Sheldan, DA Sivak, M Thomson arXiv preprint arXiv:1805.07512, 2018 | 3 | 2018 |
Timing as an Action: Learning When to Observe and Act H Zhou, A Huang, K Azizzadenesheli, D Childers, Z Lipton International Conference on Artificial Intelligence and Statistics, 3979-3987, 2024 | 1 | 2024 |
RiskyZoo: A Library for Risk-Sensitive Supervised Learning W Wong, A Huang, L Leqi, K Azizzadenesheli, ZC Lipton ICML 2022 Workshop on Responsible Decision Making in Dynamic Environments, 2022 | 1 | 2022 |
Structure of the archaellar motor and associated cytoplasmic cone in Thermococcus kodakaraensis A Briegel, CM Oikonomou, YW Chang, A Kjaer, AN Huang, KW Kim, ... bioRxiv, 108209, 2017 | 1 | 2017 |
Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification D Rohatgi, A Block, A Huang, A Krishnamurthy, DJ Foster arXiv preprint arXiv:2502.12465, 2025 | | 2025 |