Chatterbox TTS Demo
Generate high-quality speech from text with reference audio styling.
Text to synthesize (max chars 300)
Reference Audio File (Optional)
0:00
0:00
High volume
1x
Exaggeration (Neutral = 0.5, extreme values can be unstable)
↺
0.25
2
CFG/Pace
↺
0.2
1
More options
▼
Generate
Output Audio