Transformers as Algorithms: Generalization and Stability in In-context Learning Y Li, ME Ildiz, D Papailiopoulos, S Oymak International Conference on Machine Learning, 2023 | 168* | 2023 |
Transformers as support vector machines DA Tarzanagh, Y Li, C Thrampoulidis, S Oymak arXiv preprint arXiv:2308.16898, 2023 | 80 | 2023 |
Provable benefits of overparameterization in model compression: From double descent to pruning neural networks X Chang, Y Li, S Oymak, C Thrampoulidis Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 6974-6983, 2021 | 63 | 2021 |
Visualize your IP-over-optical network in realtime: A P4-based flexible multilayer in-band network telemetry (ML-INT) system B Niu, J Kong, S Tang, Y Li, Z Zhu IEEE Access 7, 82413-82423, 2019 | 60 | 2019 |
Max-margin token selection in attention mechanism DA Tarzanagh, Y Li, X Zhang, S Oymak Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 54* | 2023 |
Dissecting chain-of-thought: Compositionality through in-context filtering and learning Y Li, K Sreenivasan, A Giannou, D Papailiopoulos, S Oymak Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 36* | 2023 |
Mechanics of next token prediction with self-attention Y Li, Y Huang, ME Ildiz, AS Rawat, S Oymak International Conference on Artificial Intelligence and Statistics, 685-693, 2024 | 25 | 2024 |
From Self-Attention to Markov Models: Unveiling the Dynamics of Generative Transformers ME Ildiz, Y Huang, Y Li, AS Rawat, S Oymak arXiv preprint arXiv:2402.13512, 2024 | 13 | 2024 |
Provable and efficient continual representation learning Y Li, M Li, MS Asif, S Oymak arXiv preprint arXiv:2203.02026, 2022 | 8 | 2022 |
Stochastic contextual bandits with long horizon rewards Y Qin, Y Li, F Pasqualetti, M Fazel, S Oymak The Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023 | 5 | 2023 |
Provable pathways: Learning multiple tasks over multiple paths Y Li, S Oymak The Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023 | 4 | 2023 |
Network nervous system: When multilayer telemetry meets AI-assisted service provisioning J Kong, B Niu, S Tang, Y Li, H Fang, W Lu, Z Zhu 2019 18th International Conference on Optical Communications and Networks …, 2019 | 4 | 2019 |
Fine-grained analysis of in-context linear estimation: Data, architecture, and beyond Y Li, AS Rawat, S Oymak arXiv preprint arXiv:2407.10005, 2024 | 2 | 2024 |
Leveraging multilayer telemetry to realize AI-assisted service provisioning in IP over elastic optical networks Z Zhu, B Niu, J Kong, S Tang, Y Li, H Fang, W Lu 2019 24th OptoElectronics and Communications Conference (OECC) and 2019 …, 2019 | 2 | 2019 |
Can Mamba In-Context Learn Task Mixtures? Y Li, X Wei, H Zhao, T Ma ICML 2024 Workshop on In-Context Learning, 2024 | 1 | 2024 |
On the fairness of multitask representation learning Y Li, S Oymak ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |