Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey on compiler autotuning using machine learning
Since the mid-1990s, researchers have been trying to use machine-learning-based
approaches to solve a number of different compiler optimization problems. These …
approaches to solve a number of different compiler optimization problems. These …
Deep configuration performance learning: A systematic survey and taxonomy
Performance is arguably the most crucial attribute that reflects the quality of a configurable
software system. However, given the increasing scale and complexity of modern software …
software system. However, given the increasing scale and complexity of modern software …
End-to-end deep learning of optimization heuristics
Accurate automatic optimization heuristics are necessary for dealing with thecomplexity and
diversity of modern hardware and software. Machine learning is aproven technique for …
diversity of modern hardware and software. Machine learning is aproven technique for …
Bliss: auto-tuning complex applications using a pool of diverse lightweight learning models
As parallel applications become more complex, auto-tuning becomes more desirable,
challenging, and time-consuming. We propose, Bliss, a novel solution for auto-tuning …
challenging, and time-consuming. We propose, Bliss, a novel solution for auto-tuning …
Asymo: scalable and efficient deep-learning inference on asymmetric mobile cpus
On-device deep learning (DL) inference has attracted vast interest. Mobile CPUs are the
most common hardware for on-device inference and many inference frameworks have been …
most common hardware for on-device inference and many inference frameworks have been …
Synthesizing benchmarks for predictive modeling
Predictive modeling using machine learning is an effective method for building compiler
heuristics, but there is a shortage of benchmarks. Typical machine learning experiments …
heuristics, but there is a shortage of benchmarks. Typical machine learning experiments …
CLBlast: A tuned OpenCL BLAS library
C Nugteren - Proceedings of the International Workshop on OpenCL, 2018 - dl.acm.org
This work introduces CLBlast, an open-source BLAS library providing optimized OpenCL
routines to accelerate dense linear algebra for a wide variety of devices. It is targeted at …
routines to accelerate dense linear algebra for a wide variety of devices. It is targeted at …
[HTML][HTML] Kernel Tuner: A search-optimizing GPU code auto-tuner
B van Werkhoven - Future Generation Computer Systems, 2019 - Elsevier
A very common problem in GPU programming is that some combination of thread block
dimensions and other code optimization parameters, like tiling or unrolling factors, results in …
dimensions and other code optimization parameters, like tiling or unrolling factors, results in …
A benchmark set of highly-efficient CUDA and OpenCL kernels and its dynamic autotuning with Kernel Tuning Toolkit
In recent years, the heterogeneity of both commodity and supercomputers hardware has
increased sharply. Accelerators, such as GPUs or Intel Xeon Phi co-processors, are often …
increased sharply. Accelerators, such as GPUs or Intel Xeon Phi co-processors, are often …
Romou: Rapidly generate high-performance tensor kernels for mobile gpus
Mobile GPU, as a ubiquitous and powerful accelerator, plays an important role in
accelerating on-device DNN (Deep Neural Network) inference. The frequent-upgrade and …
accelerating on-device DNN (Deep Neural Network) inference. The frequent-upgrade and …