Select any text in Windows and have it read aloud by a modern AI voice. This setup uses AutoHotkey for shortcuts and uv/Python for high-quality, low-latency streaming TTS.
- Modern AI Voices: Uses Microsoft Edge's Neural TTS (
en-US-AndrewNeural). - Zero-Latency Streaming: Audio starts playing as it streams, no waiting for full downloads.
- Local Caching: Repeatedly read text plays instantly from the local
audio_cache. - Atomic Dependencies: Uses
uvscript metadata to handle all Python requirements automatically. - Intelligent Logging: Automatic log rotation at 1MB to prevent bloat.
AppsKey(Menu Key): Read selected text aloud.Ctrl + AppsKey: Force Read (Useful for Neovim/Terminals by sending 'y' + 'Ctrl+C').Shift + AppsKey: Stop reading immediately (kills the player process).
- AutoHotkey v2:
scoop install autohotkey - Python 3.12+:
scoop install python - uv:
scoop install uv - ffmpeg:
scoop install ffmpeg(Required forffplay)
- Save
tts_reader_edge.pyandGlobalTTS.ahkto your preferred script folder. - Update the
uvPathandscriptPathinGlobalTTS.ahkto match your local file system. - Add
GlobalTTS.ahkto your Windows Startup folder (shell:startup) for global availability.
To change the voice, edit tts_reader_edge.py and change the VOICE variable. Popular options:
en-US-AndrewNeural(Male, Very Natural)en-US-AvaNeural(Female, Very Natural)en-GB-SoniaNeural(Female, British)