RDMA
Remote Direct Memory Access - a technology that allows direct memory access between computers without involving the CPU.
RDMA enables high-throughput, low-latency networking by bypassing the traditional network stack. It's commonly used in HPC clusters and AI training systems via InfiniBand or RoCE (RDMA over Converged Ethernet). RDMA dramatically reduces CPU overhead for network operations.
Explore More Terms
Kernel
The core component of an operating system that manages system resources and provides services to applications.
Tensor Core
Specialized processing units in NVIDIA GPUs designed for matrix operations common in deep learning.
CUDA
NVIDIA's parallel computing platform and programming model for general computing on GPUs.
NVMe
Non-Volatile Memory Express - a specification for accessing solid-state storage over PCIe.
cgroups
Control groups - a Linux kernel feature for limiting, accounting, and isolating resource usage of process collections.
Kubernetes
An open-source container orchestration platform for automating deployment, scaling, and management of containerized applications.