Skip to content
OpenAI logo

TTS-1 HD

Text-to-SpeechOpenAIProxied

OpenAI's high-definition text-to-speech model producing higher quality audio output.

Model Info
Terms and Licenselink
More informationlink

Usage

TypeScript
const response = await env.AI.run(
'openai/tts-1-hd',
{
text: 'Hello! Welcome to Cloudflare AI Gateway. Let me show you what we can do.',
},
{
gateway: { id: 'default' },
}
)
console.log(response)
Response 200

Examples

Storytelling — HD narration with the Fable voice
TypeScript
const response = await env.AI.run(
'openai/tts-1-hd',
{
text: 'Once upon a time, in a kingdom beyond the clouds, there lived a young inventor who dreamed of building machines that could think.',
voice: 'fable',
speed: 0.9,
},
{
gateway: { id: 'default' },
}
)
console.log(response)
Response 200
Podcast Style — Conversational podcast narration
TypeScript
const response = await env.AI.run(
'openai/tts-1-hd',
{
text: "So here's the thing about large language models — they're not actually thinking. They're predicting the next token based on patterns in their training data. But the results can be surprisingly coherent.",
voice: 'echo',
},
{
gateway: { id: 'default' },
}
)
console.log(response)
Response 200
Shimmer Voice — Bright and expressive voice
TypeScript
const response = await env.AI.run(
'openai/tts-1-hd',
{
text: 'Breaking news: scientists have discovered a new species of deep-sea fish that produces its own light using bioluminescence.',
voice: 'shimmer',
},
{
gateway: { id: 'default' },
}
)
console.log(response)
Response 200

Parameters

text
stringrequiredmaxLength: 4096The text to generate audio for. Maximum length is 4096 characters.
voice
stringrequireddefault: alloyenum: alloy, echo, fable, onyx, nova, shimmerThe voice to use when generating the audio. Defaults to alloy.
response_format
stringrequireddefault: mp3enum: mp3, opus, wav, aac, flacThe output format for the audio. Supported formats are mp3, opus, wav, aac and flac.
speed
numberrequireddefault: 1minimum: 0.25maximum: 4The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.

API Schemas

{
"$schema": "https://json-schema.org/draft/2020-12/schema",
"type": "object",
"properties": {
"text": {
"description": "The text to generate audio for. Maximum length is 4096 characters.",
"type": "string",
"maxLength": 4096
},
"voice": {
"description": "The voice to use when generating the audio. Defaults to alloy.",
"default": "alloy",
"type": "string",
"enum": [
"alloy",
"echo",
"fable",
"onyx",
"nova",
"shimmer"
]
},
"response_format": {
"description": "The output format for the audio. Supported formats are mp3, opus, wav, aac and flac.",
"default": "mp3",
"type": "string",
"enum": [
"mp3",
"opus",
"wav",
"aac",
"flac"
]
},
"speed": {
"description": "The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.",
"default": 1,
"type": "number",
"minimum": 0.25,
"maximum": 4
}
},
"required": [
"text",
"voice",
"response_format",
"speed"
],
"additionalProperties": false
}