AI voice generator — at a glance
{
"engines": ["Google Gemini 3.1 Flash TTS", "Microsoft Edge TTS"],
"gemini_voices": 13,
"edge_voices": "400+ across 100+ languages",
"price_gemini": "$0.12 per 1,000 characters",
"price_edge_tts": "$0.00 (free, unlimited)",
"max_chars_per_generation": 50000,
"subscription": false,
"watermark": false,
"commercial_use": "permitted"
}
Engines + pricing from ViralMint's open-source pipeline (tts_service.py), updated 2026-06.
Frequently asked
What voices do creators actually use?
Kore and Puck are the two recommended voices for natural narration. Kore is firm and clear, Puck is upbeat and lively — most creators pick one as a default and use the others for character work or alternate styles. Click the play button on each voice card to hear a 5-second sample before you commit.
Is there a free tier?
Registered accounts get a small daily free allowance that covers light TTS usage. Beyond that, you pay per generation — typically a fraction of a cent for a short clip. The desktop app also bundles Edge TTS (Microsoft's free engine) for fully unlimited voiceover at the cost of slightly less natural delivery on some voices.
Can I use these voices commercially?
Yes. The TTS license permits commercial use of the generated audio. ViralMint adds no extra restriction — the mp3 you download is yours to use in YouTube videos, TikTok, podcasts, ads, courses, anything.
What happens to my script and audio?
The script goes through ViralMint's cloud handler to the Gemini 3.1 Flash TTS API. The generated mp3 is stored in your Library for 30 days so you can re-download it; after that the bytes are deleted. We don't keep training data and we don't share scripts with anyone.
How long can the script be?
Up to 50,000 characters per generation — about a 50-minute narration. For longer scripts, split them into chunks; the desktop app's Smart Video pipeline does this automatically when generating videos longer than the per-call limit.
Is there a free AI voice generator to download?
Yes — the ViralMint desktop app is a free download for macOS, Windows, and Linux, and it bundles Microsoft's Edge TTS engine: 400+ voices across 100+ languages at zero cost, with no character cap and no watermark. The premium Gemini 3.1 Flash TTS voices (Kore, Puck, and 11 more) are available pay-per-character at $0.12 per 1,000 characters with no subscription. So you get a genuinely free, downloadable AI voice generator, plus optional higher-fidelity voices when you want them.