Graphpim: Enabling instruction-level pim offloading in graph computing frameworks L Nai, R Hadidi, J Sim, H Kim, P Kumar, H Kim 2017 IEEE International symposium on high performance computer architecture …, 2017 | 358 | 2017 |
Batch-aware unified memory management in GPUs for irregular workloads H Kim, J Sim, P Gera, R Hadidi, H Kim Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020 | 83 | 2020 |
Cairo: A compiler-assisted technique for enabling instruction-level offloading of processing-in-memory R Hadidi, L Nai, H Kim, H Kim ACM Transactions on Architecture and Code Optimization (TACO) 14 (4), 1-25, 2017 | 77 | 2017 |
Traversing large graphs on GPUs with unified memory P Gera, H Kim, P Sao, H Kim, D Bader Proceedings of the VLDB Endowment 13 (7), 1119-1133, 2020 | 59 | 2020 |
Performance characterisation and simulation of Intel's integrated GPU architecture P Gera, H Kim, H Kim, S Hong, V George, CK Luk 2018 IEEE International Symposium on Performance Analysis of Systems and …, 2018 | 40 | 2018 |
OpenCL performance evaluation on modern multicore CPUs JH Lee, N Nigania, H Kim, K Patel, H Kim Scientific Programming 2015 (1), 859491, 2015 | 39 | 2015 |
Coda: Enabling co-location of computation and data for multiple gpu systems H Kim, R Hadidi, L Nai, H Kim, N Jayasena, Y Eckert, O Kayiran, G Loh ACM Transactions on Architecture and Code Optimization (TACO) 15 (3), 1-23, 2018 | 35 | 2018 |
Per-page control of physical address space distribution among memory modules NS Jayasena, H Kim, H Kim US Patent 10,282,309, 2019 | 32 | 2019 |
CoolPIM: Thermal-aware source throttling for efficient PIM instruction offloading L Nai, R Hadidi, H Xiao, H Kim, J Sim, H Kim 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018 | 26 | 2018 |
Flashgpu: Placing new flash next to gpu cores J Zhang, M Kwon, H Kim, H Kim, M Jung Proceedings of the 56th Annual Design Automation Conference 2019, 1-6, 2019 | 22 | 2019 |
Accelerating application start-up with nonvolatile memory in android systems H Kim, H Lim, D Manatunga, H Kim, GH Park IEEE Micro 35 (1), 15-25, 2015 | 21 | 2015 |
Understanding energy aspects of processing-near-memory for HPC workloads H Kim, H Kim, S Yalamanchili, AF Rodrigues Proceedings of the 2015 International Symposium on Memory Systems, 276-282, 2015 | 19 | 2015 |
LCP: A low-communication parallelization method for fast neural network inference in image recognition R Hadidi, B Asgari, J Cao, Y Bae, DE Shim, H Kim, SK Lim, MS Ryoo, ... arXiv preprint arXiv:2003.06464, 2020 | 9* | 2020 |
Thermal-aware processing-in-memory instruction offloading L Nai, R Hadidi, H Xiao, H Kim, J Sim, H Kim Journal of Parallel and Distributed Computing 130, 193-207, 2019 | 8 | 2019 |
Harmonica: An fpga-based data parallel soft core C Kersey, S Yalamanchili, H Kim, N Nigania, H Kim 2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom …, 2014 | 4 | 2014 |
LCP: A Low-Communication Parallelization Method for Fast Neural Network Inference for IoT R Hadidi, B Asgari, J Cao, Y Bae, H Kim, SK Lim, MS Ryoo, H Kim 2023 Congress in Computer Science, Computer Engineering, & Applied Computing …, 2023 | | 2023 |
Louvre: Lightweight Ordering Using Versioning for Release Consistency P Kumar, P Gera, H Kim, H Kim arXiv preprint arXiv:1710.10746, 2017 | | 2017 |
CODA: Enabling Co-location of Computation and Data for Near-Data Processing H Kim, R Hadidi, L Nai, H Kim, N Jayasena, Y Eckert, O Kayiran, GH Loh arXiv preprint arXiv:1710.09517, 2017 | | 2017 |
SimProf: A Sampling Framework for Data Analytic Workloads JC Huang, L Nai, P Kumar, H Kim, H Kim 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017 | | 2017 |
Chad D. Kersey1, Hyesoon Kim2, Sudhakar Yalamanchili1 N Braswell, J Gazhenko, P Gera, M Gupta, H Kim, JH Lee, T O’Neal, ... | | |