Tensordimm: A practical near-memory processing architecture for embeddings and tensor operations in deep learning Y Kwon, Y Lee, M Rhu Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019 | 248 | 2019 |
Tensor casting: Co-designing algorithm-architecture for personalized recommendation training Y Kwon, Y Lee, M Rhu 2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021 | 48 | 2021 |
Smartsage: training large-scale graph neural networks using in-storage processing architectures Y Lee, J Chung, M Rhu Proceedings of the 49th Annual International Symposium on Computer …, 2022 | 47 | 2022 |
Understanding the implication of non-volatile memory for large-scale graph neural network training Y Lee, Y Kwon, M Rhu IEEE Computer Architecture Letters 20 (2), 118-121, 2021 | 11 | 2021 |
PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models Y Lee, H Kim, M Rhu 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture …, 2024 | 2 | 2024 |
FPGA-Accelerated Data Preprocessing for Personalized Recommendation Systems H Kim, Y Lee, M Rhu IEEE Computer Architecture Letters, 2023 | 2 | 2023 |
Neural network acceleration system and operating method thereof M Rhu, Y Kwon, Y Lee US Patent App. 16/922,333, 2021 | 1 | 2021 |
TensorDIMM Y Kwon, Y Lee, M Rhu Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019 | 1 | 2019 |
Debunking the CUDA Myth Towards GPU-based AI Systems Y Lee, J Lim, J Bang, E Cho, H Jeong, T Kim, H Kim, J Lee, J Im, R Hwang, ... arXiv preprint arXiv:2501.00210, 2024 | | 2024 |
Characterization and Analysis of the 3D Gaussian Splatting Rendering Pipeline J Lee, Y Lee, Y Kwon, M Rhu IEEE Computer Architecture Letters, 2024 | | 2024 |