Unified Designs of Multi-rail-aware MPI Allreduce and Alltoall Operations Across Diverse GPU and Interconnect Systems C. Chen, J. Yao, L. Xu, H. Subramoni, D. Panda 39th IEEE International Parallel & Distributed Processing Symposium, Jun 2025.