Text to speech
Convert text into a spoken audio file. Only voices with supportsDirectToolExecution set to true can be used. Optionally choose a voice, language, speed, and pronunciation overrides.
Authentication
Request
Voice id from GET /v1/resources/tts-voices. A default voice is used when null. Only voices with supportsDirectToolExecution set to true are accepted.
ISO-639-1 language hint for pronunciation (e.g. en, es, zh).
When true, automatically expands numbers, symbols, acronyms, and other non-word tokens into their spoken forms before synthesis so the voice pronounces them correctly (e.g. $100 → one hundred dollars, NASA → nasa, 3rd → third). Defaults to false when omitted.
When true, generated files are temporary. Temporary files are guaranteed to be available for 24 hours, after which they may be archived at any time. Temporary files are not analyzed (no description, transcript, or embedding will be generated), so they will not appear in search results. Defaults to false.
Response
Execution accepted; poll until complete.
Execution id (e.g. vg_exec_...).