2026.07.02

Training a Model on Multiple GPUs with Data Parallelism