Følg
Siyu Chen
Siyu Chen
Ph.D. of S&DS, Yale University
Verificeret mail på yale.edu - Startside
Titel
Citeret af
Citeret af
År
Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality
S Chen, H Sheen, T Wang, Z Yang
arXiv preprint arXiv:2402.19442, 2024
392024
Adaptive model design for Markov decision process
S Chen, D Yang, J Li, S Wang, Z Yang, Z Wang
International Conference on Machine Learning, 3679-3700, 2022
132022
Implicit Regularization of Gradient Flow on One-Layer Softmax Attention
H Sheen, S Chen, T Wang, HH Zhou
arXiv preprint arXiv:2403.08699, 2024
82024
Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers
S Chen, H Sheen, T Wang, Z Yang
arXiv preprint arXiv:2409.10559, 2024
72024
Learning to incentivize information acquisition: Proper scoring rules meet principal-agent model
S Chen, J Wu, Y Wu, Z Yang
International Conference on Machine Learning, 5194-5218, 2023
72023
Unveiling the Statistical Foundations of Chain-of-Thought Prompting Methods
X Hu, F Zhang, S Chen, Z Yang
arXiv preprint arXiv:2408.14511, 2024
62024
From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems
J He, S Chen, F Zhang, Z Yang
arXiv preprint arXiv:2405.19883, 2024
62024
Wasserstein flow meets replicator dynamics: A mean-field analysis of representation learning in actor-critic
Y Zhang, S Chen, Z Yang, M Jordan, Z Wang
Advances in Neural Information Processing Systems 34, 15993-16006, 2021
62021
Contractual Reinforcement Learning: Pulling Arms with Invisible Hands
J Wu, S Chen, M Wang, H Wang, H Xu
arXiv preprint arXiv:2407.01458, 2024
52024
Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks
S Chen, M Wang, Z Yang
arXiv preprint arXiv:2307.14085, 2023
42023
A unified framework of policy learning for contextual bandit with confounding bias and missing observations
S Chen, Y Wang, Z Wang, Z Yang
arXiv preprint arXiv:2303.11187, 2023
42023
Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model
S Chen, B Wu, M Lu, Z Yang, T Wang
NeurIPS 2024 Workshop on Mathematics of Modern Machine Learning, 0
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–12