Folgen
Yibo Jiang
Yibo Jiang
Bestätigte E-Mail-Adresse bei uchicago.edu - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Beyond reverse KL: Generalizing direct preference optimization with diverse divergence constraints
C Wang, Y Jiang, C Yang, H Liu, Y Chen
International conference on learning representations (ICLR), Spotlight, 2023
602023
Invariant and transportable representations for anti-causal domain shifts
Y Jiang, V Veitch
Advances in Neural Information Processing Systems (NeurIPS) 35, 20782-20794, 2022
342022
Learning nonparametric latent causal graphs with unknown interventions
Y Jiang, B Aragam
Advances in Neural Information Processing Systems (NeurIPS) 36, 2023
292023
The geometry of categorical and hierarchical concepts in large language models
K Park, YJ Choe, Y Jiang, V Veitch
International conference on learning representations (ICLR), Oral, 2024
162024
On the origins of linear representations in large language models
Y Jiang, G Rajendran, P Ravikumar, B Aragam, V Veitch
International conference on machine learning (ICML), 2024
152024
Associative memory in iterated overparameterized sigmoid autoencoders
Y Jiang, C Pehlevan
International conference on machine learning (ICML), 4828-4838, 2020
152020
Meta-learning to cluster
Y Jiang, N Verma
arXiv preprint arXiv:1910.14134, 2019
132019
Uncovering meanings of embeddings via partial orthogonality
Y Jiang, B Aragam, V Veitch
Advances in Neural Information Processing Systems (NeurIPS) 36, 2023
102023
Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers
Y Jiang, G Rajendran, P Ravikumar, B Aragam
Advances in Neural Information Processing Systems (NeurIPS) 37, 2024
52024
Direct acquisition optimization for low-budget active learning
Z Zhao, Y Jiang, Y Chen
arXiv preprint arXiv:2402.06045, 2024
42024
Model-agnostic meta-learning using runge-kutta methods
DJ Im, Y Jiang, N Verma
arXiv preprint arXiv:1910.07368, 2019
42019
Quantifying generalization complexity for large language models
Z Qi, H Luo, X Huang, Z Zhao, Y Jiang, X Fan, H Lakkaraju, J Glass
International conference on learning representations (ICLR), 2024
32024
Beyond reward hacking: causal rewards for large language model alignment
C Wang, Z Zhao, Y Jiang, Z Chen, C Zhu, Y Chen, J Liu, L Zhang, X Fan, ...
arXiv preprint arXiv:2501.09620, 2025
2025
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–13