Chatterbox TTS
Use servers/tts/chatterbox/server.py for a local clone endpoint. It defaults to Chatterbox-Turbo, preserving bracket delivery tags and matching TomoriBot’s fastest current behavior.
python -m venv servers\tts\chatterbox\.venvservers\tts\chatterbox\.venv\Scripts\Activate.ps1python -m pip install --upgrade pippip install numpypip install -r servers\tts\chatterbox\requirements.txtpython servers\tts\chatterbox\server.pyRegister with /provider custom-endpoint add:
- capability:
speech - api_style:
tts-clone - endpoint_url: your wrapper URL, usually
http://127.0.0.1:8011 - script_markup:
bracket-tags - supports_instruct:
false
Then run /model speech, /speech voice-add, and /speech voice-assign.
Use /speech chatterbox parameters to tune the Chatterbox request payload:
turbodefaults totrue. When enabled, TomoriBot keeps supported Chatterbox-Turbo event tags and strips unsupported bracket descriptors before the wrapper usesChatterboxTurboTTS.model.generate(...).cfg_weightdefaults to0.5. Minimum is0; TomoriBot does not set a hard maximum. It only applies whenturboisfalse; lower values can help slow fast reference voices, while higher values follow the reference more strongly.exaggerationdefaults to0.5. Minimum is0; TomoriBot does not set a hard maximum. It only applies whenturboisfalse; higher values make delivery more expressive or dramatic and may speed speech up.
Supported Turbo event tags are [clear throat], [sigh], [shush], [cough], [groan], [sniff], [gasp], [chuckle], and [laugh]. Unsupported descriptors such as [stammers], [blushes], or [smiles] are stripped instead of being sent to TTS.
When turbo is disabled, TomoriBot strips all bracket descriptors before sending text to TTS, then the wrapper lazily loads the standard ChatterboxTTS model and calls model.generate(..., cfg_weight, exaggeration).