Constable: Improving Performance and Power Efficiency by Safely Eliminating Load Instruction Execution

R Bera, A Ranganathan, J Rakshit… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Load instructions often limit instruction-level parallelism (ILP) in modern processors due to
data and resource dependences they cause. Prior techniques like Load Value Prediction …

[HTML][HTML] Optimization of High-Performance Computing Job Scheduling Based on Offline Reinforcement Learning

S Li, W Dai, Y Chen, B Liang - Applied Sciences, 2024 - mdpi.com
In large-scale, distributed high-performance computing systems, the increasing complexity
of job scheduling has expanded along with the growth of computational resources and job …

[HTML][HTML] LSTM-CRP: Algorithm-Hardware Co-Design and Implementation of Cache Replacement Policy Using Long Short-Term Memory

Y Wang, Y Meng, J Wang, C Yang - Big Data and Cognitive Computing, 2024 - mdpi.com
As deep learning has produced dramatic breakthroughs in many areas, it has motivated
emerging studies on the combination between neural networks and cache replacement …

ACES: Accelerating Sparse Matrix Multiplication with Adaptive Execution Flow and Concurrency-Aware Cache Optimizations

X Lu, B Long, X Chen, Y Han, XH Sun - Proceedings of the 29th ACM …, 2024 - dl.acm.org
Sparse matrix-matrix multiplication (SpMM) is a critical computational kernel in numerous
scientific and machine learning applications. SpMM involves massive irregular memory …

AceMiner: Accelerating Graph Pattern Matching using PIM with Optimized Cache System

L Yan, X Lu, X Chen, S Xu, X Zou… - 2024 IEEE 42nd …, 2024 - ieeexplore.ieee.org
Graph pattern matching (GPM), a critical algorithm for discovering specific patterns within
complex structures, is becoming increasingly important in the data-driven world. GPM …

Intelligent Data Management via Machine Learning: From Storage Hierarchy to Information Hierarchy

T Zhang - 2025 - diva-portal.org
Abstract Zhang, T. 2025. Intelligent Data Management via Machine Learning. From Storage
Hierarchy to Information Hierarchy. Digital Comprehensive Summaries of Uppsala …

Utilizing Concurrent Data Accesses for Data-Driven and AI Applications

X Lu - 2024 - search.proquest.com
In the evolving landscape of data-driven and AI applications, the imperative for reducing
data access delay has never been more critical, especially as these applications …