Deep configuration performance learning: A systematic survey and taxonomy

J Gong, T Chen - ACM Transactions on Software Engineering and …, 2024 - dl.acm.org
Performance is arguably the most crucial attribute that reflects the quality of a configurable
software system. However, given the increasing scale and complexity of modern software …

Predictive performance modeling for distributed batch processing using black box monitoring and machine learning

C Witt, M Bux, W Gusew, U Leser - Information Systems, 2019 - Elsevier
In many domains, the previous decade was characterized by increasing data volumes and
growing complexity of data analyses, creating new demands for batch processing on …

Accelerometer: Understanding acceleration opportunities for data center overheads at hyperscale

A Sriraman, A Dhanotia - Proceedings of the Twenty-Fifth International …, 2020 - dl.acm.org
At global user population scale, important microservices in warehouse-scale data centers
can grow to account for an enormous installed base of servers. With the end of Dennard …

Lean techniques impact evaluation methodology based on a co-simulation framework for manufacturing systems

J Possik, A Zouggar-Amrani, B Vallespir… - … Journal of Computer …, 2022 - Taylor & Francis
Lean implementation plays a major role in optimizing productivity and reducing waste.
Applying the adequate integration of Lean Techniques (LT) can ensure a higher profitable …

Exploring GPU performance, power and energy-efficiency bounds with Cache-aware Roofline Modeling

A Lopes, F Pratas, L Sousa, A Ilic - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Optimization, portability and development of GPGPU applications are not trivial tasks, since
the capabilities and organization of GPU processing elements and memory subsystem …

[HTML][HTML] Evaluating arm and risc-v architectures for high-performance computing with docker and kubernetes

V Dakić, L Mršić, Z Kunić, G Đambić - Electronics, 2024 - mdpi.com
This paper thoroughly assesses the ARM and RISC-V architectures in the context of high-
performance computing (HPC). It includes an analysis of Docker and Kubernetes …

[PDF][PDF] A comprehensive review of efficient ray-tracing techniques for wireless communication

TK Geok, F Hossain, MN Kamaruddin… - International Journal …, 2018 - researchgate.net
A comprehensive review of ray tracing (RT) techniques for wireless communication systems
is presented in this paper. The conventional techniques are described with respect to the …

Logca: A high-level performance model for hardware accelerators

MSB Altaf, DA Wood - ACM SIGARCH Computer Architecture News, 2017 - dl.acm.org
With the end of Dennard scaling, architects have increasingly turned to special-purpose
hardware accelerators to improve the performance and energy efficiency for some …

Parallelization and optimization of NSGA-II on sunway TaihuLight system

X Liu, J Sun, L Zheng, S Wang, Y Liu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Sunway TaihuLight system is the first supercomputer offering a peak performance over 100
PFlops, which can be utilized to parallelize Non-dominated Sorting Genetic Algorithm II …

Hpc ontology: Towards a unified ontology for managing training datasets and ai models for high-performance computing

C Liao, PH Lin, G Verma… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
Machine learning (ML) techniques have been widely studied to address various challenges
of productively and efficiently running large-scale scientific applications on heterogeneous …