A taxonomy of scheduling in general-purpose distributed computing systems

TL Casavant, JG Kuhl - IEEE Transactions on software …, 1988 - ieeexplore.ieee.org
One measure of the usefulness of a general-purpose distributed computing system is the
system's ability to provide a level of performance commensurate to the degree of multiplicity …

[PDF][PDF] A guide to dynamic load balancing in distributed computer systems

AM Alakeel - International journal of computer science and …, 2010 - researchgate.net
Load balancing is the process of redistributing the work load among nodes of the distributed
system to improve both resource utilization and job response time while also avoiding a …

Models of machines and computation for map** in multicomputers

MG Norman, P Thanisch - ACM Computing Surveys (CSUR), 1993 - dl.acm.org
Nor M lt always easy to assess the relevance of a new result to a particular problem.
Furthermore, changes in parallel computing technology have made some of the earlier work …

Topology-aware gpu scheduling for learning workloads in cloud environments

M Amaral, J Polo, D Carrera, S Seelam… - Proceedings of the …, 2017 - dl.acm.org
Recent advances in hardware, such as systems with multiple GPUs and their availability in
the cloud, are enabling deep learning in various domains including health care …

Topology-aware task map** for reducing communication contention on large parallel machines

T Agarwal, A Sharma, A Laxmikant… - Proceedings 20th IEEE …, 2006 - ieeexplore.ieee.org
Communication latencies constitute a significant factor in the performance of parallel
applications. With techniques such as wormhole routing, the variation in no-load latencies …

[PDF][PDF] A parallel genetic algorithm for the graph partitioning problem

EG Talbi, P Bessiere - Proceedings of the 5th International Conference …, 1991 - dl.acm.org
Genetic algorithms are stochastic search and optimization techniques which cart be used for
a wicle range of applications. This paper addresses the application of genetic algorithms to …

Hypernet: A communication-efficient architecture for constructing massively parallel computers

K Hwang, J Ghosh - IEEE Transactions on Computers, 1987 - ieeexplore.ieee.org
A new class of modular networks is proposed for hierarchically constructing massively
parallel computer systems for distributed supercomputing and AI applications. These …

Rectilinear partitioning of irregular data parallel computations

DM Nicol - Journal of Parallel and Distributed Computing, 1994 - Elsevier
This paper describes new map** algorithms for domain-oriented data-parallel
computations, where the workload is distributed irregularly throughout the domain, but …

[PDF][PDF] Type architectures, shared memory, and the corollary of modest potential

L Snyder - Annual review of computer science, 1986 - courses.cs.washington.edu
Likewise, when a long series of identical computations is to be performed, such as those
required for the formation of numerical tables, the machine can be brought into play so as to …

Avoiding hot-spots on two-level direct networks

A Bhatele, N Jain, WD Gropp, LV Kale - Proceedings of 2011 …, 2011 - dl.acm.org
A low-diameter, fast interconnection network is going to be a prerequisite for building
exascale machines. A two-level direct network has been proposed by several groups as a …