2026.06.11

Training a Model on Multiple GPUs with Data Parallelism