Generate cinematic, human-sounding AI dialogue for game cut scenes using Microsoft Neural Voices, completely scriptable from Python.
The script takes a dialogue script with two characters, generates separate voice lines, then merges them into a single cutscene_edge.wav ready for Unity, Unreal, Godot, etc.
⚠️ This uses Microsoft’s neural TTS over the internet via Edge-TTS. Text is sent to Microsoft’s service; audio comes back. No API key required.
Works great with: