Cerberus: The power of choices in datacenter topology design-a throughput perspective
The bandwidth and latency requirements of modern datacenter applications have led
researchers to propose various topology designs using static, dynamic demand-oblivious …
researchers to propose various topology designs using static, dynamic demand-oblivious …
A throughput-centric view of the performance of datacenter topologies
While prior work has explored many proposed datacenter designs, only two designs, Clos-
based and expander-based, are generally considered practical because they can scale …
based and expander-based, are generally considered practical because they can scale …
Watch out for the bully! job interference study on dragonfly network
X Yang, J Jenkins, M Mubarak… - SC'16: Proceedings of …, 2016 - ieeexplore.ieee.org
High-radix, low-diameter dragonfly networks will be a common choice in next-generation
supercomputers. Preliminary studies show that random job placement with adaptive routing …
supercomputers. Preliminary studies show that random job placement with adaptive routing …
Analyzing network health and congestion in dragonfly-based supercomputers
The dragonfly topology is a popular choice for building high-radix, low-diameter, hierarchical
networks with high-bandwidth links. On Cray installations of the dragonfly network, job …
networks with high-bandwidth links. On Cray installations of the dragonfly network, job …
Evaluating HPC networks via simulation of parallel workloads
This paper presents an evaluation and comparison of three topologies that are popular for
building interconnection networks in large-scale supercomputers: torus, fat-tree, and …
building interconnection networks in large-scale supercomputers: torus, fat-tree, and …
Flexfly: Enabling a reconfigurable dragonfly through silicon photonics
The Dragonfly topology provides low-diameter connectivity for high-performance computing
with all-to-all global links at the inter-group level. Our traffic matrix characterization of various …
with all-to-all global links at the inter-group level. Our traffic matrix characterization of various …
Study of workload interference with intelligent routing on dragonfly
Dragonfly interconnect is a crucial network technol-ogy for supercomputers. To support
exascale systems, network resources are shared such that links and routers are not …
exascale systems, network resources are shared such that links and routers are not …
Predicting the performance impact of different fat-tree configurations
The fat-tree topology is one of the most commonly used network topologies in HPC systems.
Vendors support several options that can be configured when deploying fat-tree networks on …
Vendors support several options that can be configured when deploying fat-tree networks on …
Q-adaptive: A multi-agent reinforcement learning based routing on dragonfly network
High-radix interconnects such as Dragonfly and its variants rely on adaptive routing to
balance network traffic for optimum performance. Ideally, adaptive routing attempts to …
balance network traffic for optimum performance. Ideally, adaptive routing attempts to …
Megafly: A topology for exascale systems
In this paper we explore network topologies suitable for future exascale systems that need to
support over fifty thousand endpoints. With the increased necessity to use optics at higher …
support over fifty thousand endpoints. With the increased necessity to use optics at higher …