Openelm: An efficient language model family with open training and inference framework S Mehta, MH Sekhavat, Q Cao, M Horton, Y Jin, C Sun, SI Mirzadeh, ... Workshop on Efficient Systems for Foundation Models II@ ICML2024, 2024 | 31 | 2024 |
Openelm: An efficient language model family with open-source training and inference framework S Mehta, MH Sekhavat, Q Cao, M Horton, Y Jin, C Sun, I Mirzadeh, ... arXiv e-prints, arXiv: 2404.14619, 2024 | 27 | 2024 |
OpenELM: An Efficient Language Model Family with Open Training and Inference Framework. arXiv. org, April 2024 S Mehta, MH Sekhavat, Q Cao, M Horton, Y Jin, C Sun, I Mirzadeh, ... URL https://arxiv. org/abs/2404.14619 v1, 2024 | 6 | 2024 |
Catlip: Clip-level visual recognition accuracy with 2.7 x faster pre-training on web-scale image-text data S Mehta, M Horton, F Faghri, MH Sekhavat, M Najibi, M Farajtabar, ... arXiv preprint arXiv:2404.15653, 2024 | 3 | 2024 |
Computational Bottlenecks of Training Small-scale Large Language Models S Ashkboos, I Mirzadeh, K Alizadeh, MH Sekhavat, M Nabi, M Farajtabar, ... arXiv preprint arXiv:2410.19456, 2024 | 2 | 2024 |
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models K Alizadeh, I Mirzadeh, H Shahrokhi, D Belenko, F Sun, M Cho, ... arXiv preprint arXiv:2410.10846, 2024 | 1 | 2024 |
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models K Alizadeh-Vahid, SI Mirzadeh, H Shahrkokhi, D Belenko, F Sun, M Cho, ... NeurIPS Efficient Natural Language and Speech Processing Workshop, 443-455, 2024 | | 2024 |
An Efficient and Streaming Audio Visual Active Speaker Detection System A Kundu, Y Jin, M Sekhavat, M Horton, D Tormoen, D Naik arXiv preprint arXiv:2409.09018, 2024 | | 2024 |