Voice formatting plan
Format LLM output for natural-sounding speech
Format LLM output for natural-sounding speech
Voice formatting automatically transforms raw text from your language model (LLM) into a format that sounds natural when spoken by a text-to-speech (TTS) provider. This process—called Voice Input Formatted—is enabled by default for all assistants.
Formatting helps with things like:
$42.50 → “forty two dollars and fifty cents”)ST → “STREET”)123-456-7890 → “1 2 3 4 5 6 7 8 9 0”)You can turn off formatting if you want the TTS to read the raw LLM output.
When enabled, the formatter runs a series of transformations on your text, each handled by a specific function. Here’s the order and what each function does:
You can control some aspects of formatting:
Formatting is on by default. To disable, set:
Controls when numbers are read as digits instead of words.
2025 (current year)2025, numbers above this are read as digits.300000).Add exact or regex-based substitutions to customize output.
hello with hi:
Currently, only replacements and the number-to-digits cutoff are customizable. Other options are not exposed.
To disable all formatting and use raw LLM output, set either of these to false: