The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 2281 | 2024 |
Linqits: Big data on little clients ES Chung, JD Davis, J Lee ACM SIGARCH Computer Architecture News 41 (3), 261-272, 2013 | 147 | 2013 |
The llama 3 herd of models, 2024 A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... URL https://arxiv. org/abs/2407.21783 2407, 21783, 0 | 101 | |
Mnnfast: A fast and scalable system architecture for memory-augmented neural networks H Jang, J Kim, JE Jo, J Lee, J Kim Proceedings of the 46th International Symposium on Computer Architecture …, 2019 | 84 | 2019 |
The llama 3 herd of models A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ... arXiv e-prints, arXiv: 2407.21783, 2024 | 69 | 2024 |
Dcs: a fast and scalable device-centric server architecture J Ahn, D Kwon, Y Kim, M Ajdari, J Lee, J Kim Proceedings of the 48th International Symposium on Microarchitecture, 559-571, 2015 | 36 | 2015 |
GPUdmm: A high-performance and memory-oblivious GPU architecture using dynamic memory management Y Kim, J Lee, JE Jo, J Kim 2014 IEEE 20th International Symposium on High Performance Computer …, 2014 | 35 | 2014 |
First-generation inference accelerator deployment at facebook M Anderson, B Chen, S Chen, S Deng, J Fix, M Gschwind, A Kalaiah, ... arXiv preprint arXiv:2107.04140, 2021 | 34 | 2021 |
Rpstacks: Fast and accurate processor design space exploration using representative stall-event stacks J Lee, H Jang, J Kim 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 255-267, 2014 | 27 | 2014 |
WSMeter: A performance evaluation methodology for Google's production warehouse-scale computers J Lee, C Kim, K Lin, L Cheng, R Govindaraju, J Kim Proceedings of the Twenty-Third International Conference on Architectural …, 2018 | 26 | 2018 |
ScaleGPU: GPU architecture for memory-unaware GPU programming Y Kim, J Lee, D Kim, J Kim IEEE Computer Architecture Letters 13 (2), 101-104, 2013 | 20 | 2013 |
Dcs-ctrl: a fast and flexible device-control mechanism for device-centric server architecture D Kwon, J Ahn, D Chae, M Ajdari, J Lee, S Bae, Y Kim, J Kim 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018 | 16 | 2018 |
DiagSim: Systematically diagnosing simulators for healthy simulations JE Jo, GH Lee, H Jang, J Lee, M Ajdari, J Kim ACM Transactions on Architecture and Code Optimization (TACO) 15 (1), 1-27, 2018 | 9 | 2018 |
RpStacks-MT: A high-throughput design evaluation methodology for multi-core processors H Jang, JE Jo, J Lee, J Kim 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018 | 4 | 2018 |
StressRight: Finding the right stress for accurate in-development system evaluation J Lee, H Jang, J Jo, G Lee, J Kim 2017 IEEE International Symposium on Performance Analysis of Systems and …, 2017 | 3 | 2017 |
Dtstorage: Dynamic tape-based storage for cost-effective and highly-available streaming service J Lee, J Ahn, C Park, J Kim 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2016 | 3 | 2016 |
DLS: a fast and flexible neural network training system with fine-grained heterogeneous device orchestration P Park, J Lee, H Jeong, J Kim IEEE Transactions on Parallel and Distributed Systems 33 (11), 3194-3206, 2022 | 1 | 2022 |
Fast, Light-weight, and Accurate Performance Evaluation using Representative Datacenter Behaviors J Lee, D Min, I Byun, H Jang, J Kim Proceedings of the 24th International Middleware Conference, 220-233, 2023 | | 2023 |
WSMeter: A Fast, Accurate, and Low-Cost Performance Evaluation for Warehouse-Scale Computers J Lee, C Kim, K Lin, L Cheng, R Govindaraju, J Kim | | 2018 |