Deep configuration performance learning: A systematic survey and taxonomy
Performance is arguably the most crucial attribute that reflects the quality of a configurable
software system. However, given the increasing scale and complexity of modern software …
software system. However, given the increasing scale and complexity of modern software …
Predictive performance modeling for distributed batch processing using black box monitoring and machine learning
In many domains, the previous decade was characterized by increasing data volumes and
growing complexity of data analyses, creating new demands for batch processing on …
growing complexity of data analyses, creating new demands for batch processing on …
Accelerometer: Understanding acceleration opportunities for data center overheads at hyperscale
At global user population scale, important microservices in warehouse-scale data centers
can grow to account for an enormous installed base of servers. With the end of Dennard …
can grow to account for an enormous installed base of servers. With the end of Dennard …
Lean techniques impact evaluation methodology based on a co-simulation framework for manufacturing systems
Lean implementation plays a major role in optimizing productivity and reducing waste.
Applying the adequate integration of Lean Techniques (LT) can ensure a higher profitable …
Applying the adequate integration of Lean Techniques (LT) can ensure a higher profitable …
Exploring GPU performance, power and energy-efficiency bounds with Cache-aware Roofline Modeling
Optimization, portability and development of GPGPU applications are not trivial tasks, since
the capabilities and organization of GPU processing elements and memory subsystem …
the capabilities and organization of GPU processing elements and memory subsystem …
[HTML][HTML] Evaluating arm and risc-v architectures for high-performance computing with docker and kubernetes
This paper thoroughly assesses the ARM and RISC-V architectures in the context of high-
performance computing (HPC). It includes an analysis of Docker and Kubernetes …
performance computing (HPC). It includes an analysis of Docker and Kubernetes …
[PDF][PDF] A comprehensive review of efficient ray-tracing techniques for wireless communication
A comprehensive review of ray tracing (RT) techniques for wireless communication systems
is presented in this paper. The conventional techniques are described with respect to the …
is presented in this paper. The conventional techniques are described with respect to the …
Logca: A high-level performance model for hardware accelerators
With the end of Dennard scaling, architects have increasingly turned to special-purpose
hardware accelerators to improve the performance and energy efficiency for some …
hardware accelerators to improve the performance and energy efficiency for some …
Parallelization and optimization of NSGA-II on sunway TaihuLight system
X Liu, J Sun, L Zheng, S Wang, Y Liu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Sunway TaihuLight system is the first supercomputer offering a peak performance over 100
PFlops, which can be utilized to parallelize Non-dominated Sorting Genetic Algorithm II …
PFlops, which can be utilized to parallelize Non-dominated Sorting Genetic Algorithm II …
Hpc ontology: Towards a unified ontology for managing training datasets and ai models for high-performance computing
Machine learning (ML) techniques have been widely studied to address various challenges
of productively and efficiently running large-scale scientific applications on heterogeneous …
of productively and efficiently running large-scale scientific applications on heterogeneous …