Last updated: November 3, 2025
This guide is about training nanochat on 2 DGX Sparks linked via QSFP/CX7. I estimate the training to take about 5 days (about half the training time on 1 DGX Spark).
Follow the following NVIDIA tutorials to link and test your Spark cluster: