Quick Overview: OFI 2.0 Update” Jianxin Xiong, Intel 9:00-9:30 am PT. "Accelerating MPI AllReduce Communication with Efficient GPU-Based Compression Schemes on Modern GPU Clusters" Hari ... "High Performance & Scalable MPI library over Broadcom RoCE" Mustafa Abduljabbar, The Ohio State University; Hemal Shah, ...
Ofa Virtual Workshop 2024 Day - Detailed Overview & Context
OFI 2.0 Update” Jianxin Xiong, Intel 9:00-9:30 am PT. "Accelerating MPI AllReduce Communication with Efficient GPU-Based Compression Schemes on Modern GPU Clusters" Hari ... "High Performance & Scalable MPI library over Broadcom RoCE" Mustafa Abduljabbar, The Ohio State University; Hemal Shah, ... "OFI Integrated Shared Memory Offload" Speakers: Alexia Ingerson, Intel; Shi Jin, Amazon; and Amir Shehata, Oak Ridge National ... "Scaling Large Language Model Training using Hybrid GPU-based Compression in MVAPICH" Speakers: Aamir Shafi and Lang ... "Managing Composable Disaggregated Infrastructure With
"RecoNIC: RDMA-enabled Compute Offloading on FPGA-based SmartNIC" Speaker: Guanwen Zhong, AMD 10:45-11:15 am PT. "Designing In-Network Computing Aware Reduction Collectives in MPI" Speakers: Dhabaleswar Panda and Bharath Ramesh, ... Status of OpenFabrics Interfaces (OFI) Support in MPICH” Yanfei Guo, Argonne National Laboratory 9:45-10:15 am PT. "Optimized All-to-all Connection Establishment for High-Performance MPI Libraries over InfiniBand" Mustafa Abduljabbar and ... "How to setup RDMA CI using the FSDP cluster" and "How to do manual RDMA testing using the FSDP cluster" Doug Ledford, ... "An Integrated Deep Reinforcement Learning Agent for Sunfish and HPC Workload Manager Composable Disaggregated ...
Opening Remarks Phil Cayton, Intel 8:00-8:05 am PT. "System Composability Using CXL" Kurtis Bowman, CXL Consortium MWG Co-Chair 9:45-10:15 am PT.