Skip to content

vishwakaria/topology-inference

Repository files navigation

topology-inference

Description

Infer EC2 cluster topology using RDMA reads

  • Measure time taken for RDMA read on each peer node
  • Cluster the read times to identify which nodes belong to the same spine

Build

  • smddputils viskaria -hbuild -C dlc-pt1131 -src /fsx/viskaria/topology -script build.sh

Run

  • ssh compute-dy-p4d24xlarge-10 "mpirun --hostfile /fsx/viskaria/topology/hostlist -np 32 --oversubscribe --allow-run-as-root /fsx/viskaria/topology/build/main"

About

Infer EC2 cluster topology using RDMA reads

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published