Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Gemini 3.1 Flash TTS – with directed prompts (simonwillison.net)
17 points by aanet 1 day ago | hide | past | favorite | 5 comments
 help



Hope they release an offline model for Ollama, a small one easy to work with for TTS in other languages.

So a free model? Tons of other people doing similar with amazing results https://huggingface.co/spaces/Inferless/Open-Source-TTS-Gall...

No matter what I wrote in the audio profile, AI Studio never followed it, regardless of scene or context.

For example, I tried to get a male voice and kept getting female ones. Not sure if it's an AI Studio bug or I was doing something wrong.


voice is determined by the voice parameter, you can't control it via the prompt, the prompt only directs how the chosen voices delivers the lines.

The 3 examples, with three distinct styles, are fascinating.

I'd like to see one with cockney accent, just for lulz




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: