Følg
Yibo Jiang
Yibo Jiang
Verifisert e-postadresse på uchicago.edu - Startside
Tittel
Sitert av
Sitert av
År
Beyond reverse KL: Generalizing direct preference optimization with diverse divergence constraints
C Wang, Y Jiang, C Yang, H Liu, Y Chen
International conference on learning representations (ICLR), Spotlight, 2023
652023
Invariant and transportable representations for anti-causal domain shifts
Y Jiang, V Veitch
Advances in Neural Information Processing Systems (NeurIPS) 35, 20782-20794, 2022
352022
Learning nonparametric latent causal graphs with unknown interventions
Y Jiang, B Aragam
Advances in Neural Information Processing Systems (NeurIPS) 36, 2023
302023
The geometry of categorical and hierarchical concepts in large language models
K Park, YJ Choe, Y Jiang, V Veitch
International conference on learning representations (ICLR), Oral, 2024
192024
On the origins of linear representations in large language models
Y Jiang, G Rajendran, P Ravikumar, B Aragam, V Veitch
International conference on machine learning (ICML), 2024
162024
Associative memory in iterated overparameterized sigmoid autoencoders
Y Jiang, C Pehlevan
International conference on machine learning (ICML), 4828-4838, 2020
152020
Meta-learning to cluster
Y Jiang, N Verma
arXiv preprint arXiv:1910.14134, 2019
132019
Uncovering meanings of embeddings via partial orthogonality
Y Jiang, B Aragam, V Veitch
Advances in Neural Information Processing Systems (NeurIPS) 36, 2023
112023
Humanity's Last Exam
L Phan, A Gatti, Z Han, N Li, J Hu, H Zhang, S Shi, M Choi, A Agrawal, ...
arXiv preprint arXiv:2501.14249, 2025
62025
Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers
Y Jiang, G Rajendran, P Ravikumar, B Aragam
Advances in Neural Information Processing Systems (NeurIPS) 37, 2024
52024
Model-agnostic meta-learning using runge-kutta methods
DJ Im, Y Jiang, N Verma
arXiv preprint arXiv:1910.07368, 2019
42019
Quantifying generalization complexity for large language models
Z Qi, H Luo, X Huang, Z Zhao, Y Jiang, X Fan, H Lakkaraju, J Glass
International conference on learning representations (ICLR), 2024
32024
Direct acquisition optimization for low-budget active learning
Z Zhao, Y Jiang, Y Chen
arXiv preprint arXiv:2402.06045, 2024
32024
Beyond reward hacking: causal rewards for large language model alignment
C Wang, Z Zhao, Y Jiang, Z Chen, C Zhu, Y Chen, J Liu, L Zhang, X Fan, ...
arXiv preprint arXiv:2501.09620, 2025
2025
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–14