Tensorir: An abstraction for automatic tensorized program optimization S Feng, B Hou, H Jin, W Lin, J Shao, R Lai, Z Ye, L Zheng, CH Yu, Y Yu, ... Proceedings of the 28th ACM International Conference on Architectural …, 2023 | 71 | 2023 |
Recurrent residual module for fast inference in videos B Pan, W Lin, X Fang, C Huang, B Zhou, C Lu Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 44 | 2018 |
Tensor program optimization with probabilistic programs J Shao, X Zhou, S Feng, B Hou, R Lai, H Jin, W Lin, M Masuda, CH Yu, ... Advances in Neural Information Processing Systems 35, 35783-35796, 2022 | 30 | 2022 |
Cross-stream selective networks for action recognition B Pan, J Sun, W Lin, L Wang, W Lin Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 14 | 2019 |
Relax: composable abstractions for end-to-end dynamic machine learning R Lai, J Shao, S Feng, SS Lyubomirsky, B Hou, W Lin, Z Ye, H Jin, Y Jin, ... arXiv preprint arXiv:2311.02103, 2023 | 10 | 2023 |
Flashinfer: Efficient and customizable attention engine for llm inference serving Z Ye, L Chen, R Lai, W Lin, Y Zhang, S Wang, T Chen, B Kasikci, V Grover, ... arXiv preprint arXiv:2501.01005, 2025 | 5 | 2025 |