Fri May 29 06:56:33 UTC 2020

Fri May 29 06:56:33 UTC 2020

test phlog post 1

--

Network contention frequently dominates the run time of parallel
algorithms and limits scaling performance. Most previous studies
mitigate or eliminate contention by utilizing one of several ap-
proaches: communication-minimizing algorithms; hotspot-avoiding
routing schemes; topology-aware task mapping; or improving global
network properties, such as bisection bandwidth, edge-expansion,
partitioning, and network diameter. In practice, parallel jobs
often use only a fraction of a host system. How do processor al-
location policies affect contention within a partition? We uti-
lize edge-isoperimetric analysis of network graphs to determine
whether a network partition has optimal internal bisection. In-
creasing the bisection allows a more efficient use of the network
resources, decreasing or completely eliminating the link con-
tention. We first study torus networks and characterize partition
geometries that maximize internal bisection bandwidth. We examine
the allocation policies of Mira and JUQUEEN, the two largest pub-
licly-accessible Blue Gene/Q torus-based supercomputers. Our
analysis demonstrates that the bisection bandwidth of their cur-
rent partitions can often be improved by changing the partitions'
geometries. These can yield up to a X2 speedup for contention-
bound workloads. Benchmarking experiments validate the predic-
tions. Our analysis applies to allocation policies of other net-
works.