Követés
Hongkang Li
Cím
Hivatkozott rá
Hivatkozott rá
Év
A Theoretical Understanding of shallow Vision Transformers: Learning, Generalization, and Sample Complexity
H Li, M Wang, S Liu, PY Chen
International Conference on Learning Representations 2023, 2023
762023
Generalization guarantee of training graph convolutional networks with graph topology sampling
H Li, M Wang, S Liu, PY Chen, J Xiong
International Conference on Machine Learning, 13014-13051, 2022
282022
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
H Li, M Wang, S Lu, X Cui, PY Chen
ICML 2024, 2024
26*2024
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with -Greedy Exploration
S Zhang, H Li, M Wang, M Liu, PY Chen, S Lu, S Liu, K Murugesan, ...
neurips 2023, 2023
242023
What Improves the Generalization of Graph Transformer? A Theoretical Dive into Self-attention and Positional Encoding
H Li, M Wang, T Ma, S Liu, Z Zhang, PY Chen
ICML 2024, 2023
132023
Transformers as Multi-Task Feature Selectors: Generalization Analysis of In-Context Learning
H Li, M Wang, S Lu, H Wan, X Cui, PY Chen
NeurIPS 2023 Workshop on Mathematics of Modern Machine Learning, 2023
112023
Learning and generalization of one-hidden-layer neural networks, going beyond standard gaussian data
H Li, S Zhang, M Wang
2022 56th Annual Conference on Information Sciences and Systems (CISS), 37-42, 2022
102022
How does promoting the minority fraction affect generalization? A theoretical study of one-hidden-layer neural network on group imbalance
H Li, S Zhang, Y Zhang, M Wang, S Liu, PY Chen
IEEE Journal of Selected Topics in Signal Processing, 2024
82024
Enhancing Graph Transformers with Hierarchical Distance Structural Encoding
Y Luo, H Li, L Shi, XM Wu
arXiv preprint arXiv:2308.11129, 2024
7*2024
How Do Nonlinear Transformers Acquire Generalization-Guaranteed CoT Ability?
H Li, M Wang, S Lu, X Cui, PY Chen
High-dimensional Learning Dynamics 2024: The Emergence of Structure and …, 2024
42024
Learning on transformers is provable low-rank and sparse: A one-layer analysis
H Li, M Wang, S Zhang, S Liu, PY Chen
2024 IEEE 13rd Sensor Array and Multichannel Signal Processing Workshop (SAM …, 2024
32024
Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis
H Li, M Wang, S Lu, X Cui, PY Chen
arXiv preprint arXiv:2410.02167, 2024
12024
How Can Personalized Context Help? Exploring Joint Retrieval of Passage and Personalized Context
H Wan, H Li, S Lu, X Cui, M Danilevsky
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–13