DMA what?
RDMA is a kernel feature to share buffers between hosts. The general idea is setting up a peer-to-peer DMA between NIC and transferring / sharing buffers for GPU workloads.
Starting in RHCOS 9 :
In the old way, each VM we create has a PCIE pass-through to GPU. which means that only one application / service can run on one GPU.
The Nvidia GPU operator is now extended and supports vGPU - taking one physical GPU and sharing to multiple instances.
The next evolution of GPU is using MIG ( multi instance GPU ) which partition a GPU based on Ampere architecture ( A100 + ) to different hardware slices which is more secure /isolated and better performance.
Links: