BananaMind TTS V2

English fixed-voice text-to-speech with a Tacotron-lite acoustic model and self-trained HiFi-GAN vocoder.

Text

Max steps

200 1600

Stop threshold

0.2 0.9

Attention window

0 32

Normalize waveform

Use FP32 vocoder

Generated audio

Examples

·

Built with Gradio logo

·

·