Model-based optimization of EULAG kernel on Intel Xeon Phi through load imbalancing A Lastovetsky, L Szustak, R Wyrzykowski IEEE Transactions on Parallel and Distributed Systems 28 (3), 787-797, 2016 | 53 | 2016 |
Adaptation of MPDATA heterogeneous stencil computation to Intel Xeon Phi coprocessor L Szustak, K Rojek, T Olas, L Kuczynski, K Halbiniak, P Gepner Scientific Programming, 14, 2015 | 41 | 2015 |
Using Intel Xeon Phi coprocessor to accelerate computations in MPDATA algorithm L Szustak, K Rojek, P Gepner Parallel Processing and Applied Mathematics: 10th International Conference …, 2014 | 31 | 2014 |
Parallelization of 2D MPDATA EULAG algorithm on hybrid architectures with GPU accelerators R Wyrzykowski, L Szustak, K Rojek Parallel Computing 40 (8), 425-447, 2014 | 30 | 2014 |
Performance enhancement of a dynamic K-means algorithm through a parallel adaptive strategy on multicore CPUs G Laccetti, M Lapegna, V Mele, D Romano, L Szustak Journal of Parallel and Distributed Computing 145, 34-41, 2020 | 28 | 2020 |
Adaptation of fluid model EULAG to graphics processing unit architecture KA Rojek, M Ciznicki, B Rosa, P Kopta, M Kulczewski, K Kurowski, ... Concurrency and Computation: Practice and Experience 27 (4), 937-957, 2015 | 26 | 2015 |
Towards efficient decomposition and parallelization of MPDATA on hybrid CPU-GPU cluster R Wyrzykowski, L Szustak, K Rojek, A Tomas Large-Scale Scientific Computing: 9th International Conference, LSSC 2013 …, 2014 | 21 | 2014 |
Correlation of performance optimizations and energy consumption for stencil-based application on Intel Xeon scalable processors L Szustak, R Wyrzykowski, T Olas, V Mele IEEE Transactions on Parallel and Distributed Systems 31 (11), 2582-2593, 2020 | 20 | 2020 |
Toward efficient distribution of MPDATA stencil computation on Intel MIC architecture L Szustak, K Rojek, R Wyrzykowski, P Gepner Proce. HiStencils 14 (51-56), 2.6, 2014 | 20 | 2014 |
Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems L Szustak The Journal of Supercomputing 74 (4), 1534-1546, 2018 | 16 | 2018 |
Model-driven adaptation of double-precision matrix multiplication to the cell processor architecture R Wyrzykowski, K Rojek, L Szustak Parallel Computing 38 (4-5), 260-276, 2012 | 14 | 2012 |
Porting and optimization of solidification application for CPU–MIC hybrid platforms L Szustak, K Halbiniak, L Kuczynski, J Wrobel, A Kulawik The International Journal of High Performance Computing Applications 32 (4 …, 2018 | 13 | 2018 |
Adaptation of multidimensional positive definite advection transport algorithm to modern high-performance computing platforms, Int B Rosa, L Szustak, AA Wyszogrodzki, K Rojek, D Wojcik, R Wyrzykowski Journal of Modeling and Optimization 5 (3), 2015 | 13 | 2015 |
Assessment of offload-based programming environments for hybrid CPU–MIC platforms in numerical modeling of solidification K Halbiniak, R Wyrzykowski, L Szustak, T Olas Simulation Modelling Practice and Theory 87, 48-72, 2018 | 12 | 2018 |
Performance exploration of various C/C++ compilers for AMD EPYC processors in numerical modeling of solidification K Halbiniak, R Wyrzykowski, L Szustak, A Kulawik, N Meyer, P Gepner Advances in Engineering Software 166, 103078, 2022 | 11 | 2022 |
Performance analysis for stencil-based 3D MPDATA algorithm on GPU architecture K Rojek, L Szustak, R Wyrzykowski Parallel Processing and Applied Mathematics: 10th International Conference …, 2014 | 11 | 2014 |
Using blue gene/P and GPUs to accelerate computations in the EULAG model R Wyrzykowski, K Rojek, Ł Szustak Large-Scale Scientific Computing: 8th International Conference, LSSC 2011 …, 2012 | 11 | 2012 |
Parallelization of EULAG model on multicore architectures with GPU accelerators K Rojek, L Szustak Parallel Processing and Applied Mathematics: 9th International Conference …, 2012 | 11 | 2012 |
Architectural adaptation and performance-energy optimization for CFD application on AMD EPYC Rome L Szustak, R Wyrzykowski, L Kuczynski, T Olas IEEE Transactions on Parallel and Distributed Systems 32 (12), 2852-2866, 2021 | 10 | 2021 |
Performance portable parallel programming of heterogeneous stencils across shared-memory platforms with modern Intel processors L Szustak, P Bratek The International Journal of High Performance Computing Applications 33 (3 …, 2019 | 9 | 2019 |