The memory-bounded speedup model and its impacts in computing
With the surge of big data applications and the worsening of the memory-wall problem, the
memory system, instead of the computing unit, becomes the commonly recognized major …
memory system, instead of the computing unit, becomes the commonly recognized major …
Enabling PIM-based AES encryption for online video streaming
Encryption of streaming video is becoming critical to the success of commercial enterprise
and to consumers alike. To meet copyright and privacy requirements, encrypting video data …
and to consumers alike. To meet copyright and privacy requirements, encrypting video data …
CHROME: Concurrency-aware holistic cache management framework with online reinforcement learning
Cache management is a critical aspect of computer architecture, encompassing techniques
such as cache replacement, bypassing, and prefetching. Existing research has often …
such as cache replacement, bypassing, and prefetching. Existing research has often …
CARE: A concurrency-aware enhanced lightweight cache management framework
Improving cache performance is a lasting research topic. While utilizing data locality to
enhance cache performance becomes more and more difficult, data access concurrency …
enhance cache performance becomes more and more difficult, data access concurrency …
Identifying Optimal Workload Offloading Partitions for CPU-PIM Graph Processing Accelerators
S Xu, C Li, L Luo, W Zhou, L Yan… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
The integrated architecture that features both in-memory logic and host processors, or so-
called “processing-in-memory”(PIM) architecture, is an emerging and promising solution to …
called “processing-in-memory”(PIM) architecture, is an emerging and promising solution to …
CoaT: Compiler-assisted Two-Stage Offloading Approach for Data-Intensive Applications Under NMP Framework
As we head toward a data-centric era, conventional computing systems become inadequate
to meet the evolving demands of the applications. As a result, the near-memory processing …
to meet the evolving demands of the applications. As a result, the near-memory processing …
ACES: Accelerating Sparse Matrix Multiplication with Adaptive Execution Flow and Concurrency-Aware Cache Optimizations
Sparse matrix-matrix multiplication (SpMM) is a critical computational kernel in numerous
scientific and machine learning applications. SpMM involves massive irregular memory …
scientific and machine learning applications. SpMM involves massive irregular memory …
AceMiner: Accelerating Graph Pattern Matching using PIM with Optimized Cache System
Graph pattern matching (GPM), a critical algorithm for discovering specific patterns within
complex structures, is becoming increasingly important in the data-driven world. GPM …
complex structures, is becoming increasingly important in the data-driven world. GPM …
Research on Performance Optimization of Spark Distributed Computing Platform.
Q He, F Zhang, G Bian, W Zhang… - Computers, Materials & …, 2024 - search.ebscohost.com
Spark, a distributed computing platform, has rapidly developed in the field of big data. Its in-
memory computing feature reduces disk read overhead and shortens data processing time …
memory computing feature reduces disk read overhead and shortens data processing time …
Data Locality Aware Computation Offloading in Near Memory Processing Architecture for Big Data Applications
The data-intensive applications of today's big data era often produce a large memory
footprint. As a result, a significant volume of data needs to travel from memory to the CPU …
footprint. As a result, a significant volume of data needs to travel from memory to the CPU …