Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts H Nguyen, P Akbarian, F Yan, N Ho International Conference on Learning Representation (ICLR), 2024 | 14 | 2024 |
Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts? H Nguyen, P Akbarian, N Ho International Conference on Machine Learning (ICML), 2024 | 8 | 2024 |
A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts H Nguyen, P Akbarian, TT Nguyen, N Ho International Conference on Machine Learning (ICML), 2024 | 6 | 2024 |
Understanding expert structures on minimax parameter estimation in contaminated mixture of experts F Yan, H Nguyen, D Le, P Akbarian, N Ho International Conference on Artificial Intelligence and Statistics (AISTATS …, 2025 | 1 | 2025 |
Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts H Nguyen, P Akbarian, T Pham, T Nguyen, S Zhang, N Ho International Conference on Learning Representations (ICLR), 2025 | 1 | 2025 |
Quadratic Gating Functions in Mixture of Experts: A Statistical Insight P Akbarian*, H Nguyen*, X Han*, N Ho arXiv preprint arXiv:2410.11222, 2024 | 1 | 2024 |
Sigmoid Self-Attention is Better than Softmax Self-Attention: A Mixture-of-Experts Perspective F Yan, H Nguyen, P Akbarian, N Ho, A Rinaldo arXiv preprint arXiv:2502.00281, 2025 | | 2025 |
Improving Computational Complexity in Statistical Models with Local Curvature Information P Akbarian*, T Ren*, J Zhuo, S Sanghavi, N Ho International Conference on Machine Learning (ICML), 2024 | | 2024 |
Improving Counterfactual Explanations for Time Series Classification Models in Healthcare Settings T Han, J Henderson, P Akbarian, J Ghosh NeurIPS 2022 Workshop on Learning from Time Series for Health, 2022 | | 2022 |