Constable: Improving Performance and Power Efficiency by Safely Eliminating Load Instruction Execution
Load instructions often limit instruction-level parallelism (ILP) in modern processors due to
data and resource dependences they cause. Prior techniques like Load Value Prediction …
data and resource dependences they cause. Prior techniques like Load Value Prediction …
[HTML][HTML] Optimization of High-Performance Computing Job Scheduling Based on Offline Reinforcement Learning
S Li, W Dai, Y Chen, B Liang - Applied Sciences, 2024 - mdpi.com
In large-scale, distributed high-performance computing systems, the increasing complexity
of job scheduling has expanded along with the growth of computational resources and job …
of job scheduling has expanded along with the growth of computational resources and job …
[HTML][HTML] LSTM-CRP: Algorithm-Hardware Co-Design and Implementation of Cache Replacement Policy Using Long Short-Term Memory
Y Wang, Y Meng, J Wang, C Yang - Big Data and Cognitive Computing, 2024 - mdpi.com
As deep learning has produced dramatic breakthroughs in many areas, it has motivated
emerging studies on the combination between neural networks and cache replacement …
emerging studies on the combination between neural networks and cache replacement …
ACES: Accelerating Sparse Matrix Multiplication with Adaptive Execution Flow and Concurrency-Aware Cache Optimizations
Sparse matrix-matrix multiplication (SpMM) is a critical computational kernel in numerous
scientific and machine learning applications. SpMM involves massive irregular memory …
scientific and machine learning applications. SpMM involves massive irregular memory …
AceMiner: Accelerating Graph Pattern Matching using PIM with Optimized Cache System
Graph pattern matching (GPM), a critical algorithm for discovering specific patterns within
complex structures, is becoming increasingly important in the data-driven world. GPM …
complex structures, is becoming increasingly important in the data-driven world. GPM …
Intelligent Data Management via Machine Learning: From Storage Hierarchy to Information Hierarchy
T Zhang - 2025 - diva-portal.org
Abstract Zhang, T. 2025. Intelligent Data Management via Machine Learning. From Storage
Hierarchy to Information Hierarchy. Digital Comprehensive Summaries of Uppsala …
Hierarchy to Information Hierarchy. Digital Comprehensive Summaries of Uppsala …
Utilizing Concurrent Data Accesses for Data-Driven and AI Applications
X Lu - 2024 - search.proquest.com
In the evolving landscape of data-driven and AI applications, the imperative for reducing
data access delay has never been more critical, especially as these applications …
data access delay has never been more critical, especially as these applications …