Skip to content

Instantly share code, notes, and snippets.

@trevin-creator
trevin-creator / two-node-exo-bootstrap.md
Created February 27, 2026 15:08
Qwen3.5-122B-A10B on 2× Mac Studio M4 Max via Exo + Thunderbolt 5 RDMA – Resilient Day-0 Bootstrap Guide

What This Is

A phase-gated bootstrap guide for running large models across two Apple Silicon nodes using EXO. Battle-tested on Mac Studio M4 Max (128 GB unified) pairs over Thunderbolt 5.

Reference performance (Qwen3.5-122B-A10B-4bit):

  • ~52 tok/s sustained at 512–1024 token range
  • Stable at concurrency=2 (p95 ~10.37s)