To install MLX
pip install mlx[cuda13]Then install mlx-lm:
pip install mlx-lm
Run generation:
mlx_lm.generate --model Qwen/Qwen3-4B-Instruct-2507 --prompt "Tell me a story about Einstein"
LoRA fine-tune:
mlx_lm.lora --model Qwen/Qwen3-4B-Instruct-2507 --data mlx-community/WikiSQL --train
Awesome, thanks! Struggled with this until I found your solution.