The memory-bounded speedup model and its impacts in computing

XH Sun, X Lu - Journal of Computer Science and Technology, 2023 - Springer
With the surge of big data applications and the worsening of the memory-wall problem, the
memory system, instead of the computing unit, becomes the commonly recognized major …

Enabling PIM-based AES encryption for online video streaming

Y Liu, L Wang, A Qouneh, X Fu - Journal of Systems Architecture, 2022 - Elsevier
Encryption of streaming video is becoming critical to the success of commercial enterprise
and to consumers alike. To meet copyright and privacy requirements, encrypting video data …

CHROME: Concurrency-aware holistic cache management framework with online reinforcement learning

X Lu, H Najafi, J Liu, XH Sun - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Cache management is a critical aspect of computer architecture, encompassing techniques
such as cache replacement, bypassing, and prefetching. Existing research has often …

CARE: A concurrency-aware enhanced lightweight cache management framework

X Lu, R Wang, XH Sun - 2023 IEEE International Symposium …, 2023 - ieeexplore.ieee.org
Improving cache performance is a lasting research topic. While utilizing data locality to
enhance cache performance becomes more and more difficult, data access concurrency …

Identifying Optimal Workload Offloading Partitions for CPU-PIM Graph Processing Accelerators

S Xu, C Li, L Luo, W Zhou, L Yan… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
The integrated architecture that features both in-memory logic and host processors, or so-
called “processing-in-memory”(PIM) architecture, is an emerging and promising solution to …

CoaT: Compiler-assisted Two-Stage Offloading Approach for Data-Intensive Applications Under NMP Framework

S Maity, M Goel, M Ghose - IEEE Transactions on Emerging …, 2024 - ieeexplore.ieee.org
As we head toward a data-centric era, conventional computing systems become inadequate
to meet the evolving demands of the applications. As a result, the near-memory processing …

ACES: Accelerating Sparse Matrix Multiplication with Adaptive Execution Flow and Concurrency-Aware Cache Optimizations

X Lu, B Long, X Chen, Y Han, XH Sun - Proceedings of the 29th ACM …, 2024 - dl.acm.org
Sparse matrix-matrix multiplication (SpMM) is a critical computational kernel in numerous
scientific and machine learning applications. SpMM involves massive irregular memory …

AceMiner: Accelerating Graph Pattern Matching using PIM with Optimized Cache System

L Yan, X Lu, X Chen, S Xu, X Zou… - 2024 IEEE 42nd …, 2024 - ieeexplore.ieee.org
Graph pattern matching (GPM), a critical algorithm for discovering specific patterns within
complex structures, is becoming increasingly important in the data-driven world. GPM …

Research on Performance Optimization of Spark Distributed Computing Platform.

Q He, F Zhang, G Bian, W Zhang… - Computers, Materials & …, 2024 - search.ebscohost.com
Spark, a distributed computing platform, has rapidly developed in the field of big data. Its in-
memory computing feature reduces disk read overhead and shortens data processing time …

Data Locality Aware Computation Offloading in Near Memory Processing Architecture for Big Data Applications

S Maity, M Goel, M Ghose - 2023 IEEE 30th International …, 2023 - ieeexplore.ieee.org
The data-intensive applications of today's big data era often produce a large memory
footprint. As a result, a significant volume of data needs to travel from memory to the CPU …