Neupims: Npu-pim heterogeneous acceleration for batched llm inferencing G Heo, S Lee, J Cho, H Choi, S Lee, H Ham, G Kim, D Mahajan, J Park Proceedings of the 29th ACM International Conference on Architectural …, 2024 | 26 | 2024 |
Hardware-hardened sandbox enclaves for trusted serverless computing J Park, S Kang, S Lee, T Kim, J Park, Y Kwon, J Huh ACM Transactions on Architecture and Code Optimization 21 (1), 1-25, 2024 | 4 | 2024 |
Improving Data Reuse in NPU On-chip Memory with Interleaved Gradient Order for DNN Training J Kim, S Na, S Lee, S Lee, J Huh Proceedings of the 56th Annual IEEE/ACM International Symposium on …, 2023 | 4 | 2023 |
Efficient LLM Inference with Activation Checkpointing and Hybrid Caching S Lee, H Kim, S Hwang, G Heo, M Noh, J Huh arXiv preprint arXiv:2501.01792, 2025 | | 2025 |
Supporting Trusted Virtual Machines with Hardware-Based Secure Remote Memory T Heo, S Kang, S Lee, S Hwang, J Park, J Huh Proceedings of the 2024 ACM SIGPLAN International Symposium on Memory …, 2024 | | 2024 |