This document provides the definitive setup for a high-performance, local AI development environment on an M4 Max (36GB RAM). It leverages Speculative Decoding, AST-based Indexing, and Kernel-level Memory Tuning.
On an M4 Max with 36GB, the "Sweet Spot" is running a 35B-parameter model (The Architect) and a 1.5B-parameter model (The Draft Model).