Skip to content
OpenAI logo

TTS-1 HD

Text-to-SpeechOpenAIProxied

OpenAI's high-definition text-to-speech model producing higher quality audio output.

Model Info
Terms and Licenselink
More informationlink
PricingView pricing in the Cloudflare dashboard

Usage

TypeScript
const response = await env.AI.run(
'openai/tts-1-hd',
{
response_format: 'mp3',
speed: 1,
text: 'Hello! Welcome to Cloudflare AI Gateway. Let me show you what we can do.',
voice: 'alloy',
},
)
console.log(response)

Examples

Storytelling — HD narration with the Fable voice
TypeScript
const response = await env.AI.run(
'openai/tts-1-hd',
{
response_format: 'mp3',
speed: 0.9,
text: 'Once upon a time, in a kingdom beyond the clouds, there lived a young inventor who dreamed of building machines that could think.',
voice: 'fable',
},
)
console.log(response)
Podcast Style — Conversational podcast narration
TypeScript
const response = await env.AI.run(
'openai/tts-1-hd',
{
response_format: 'mp3',
speed: 1,
text: "So here's the thing about large language models — they're not actually thinking. They're predicting the next token based on patterns in their training data. But the results can be surprisingly coherent.",
voice: 'echo',
},
)
console.log(response)
Shimmer Voice — Bright and expressive voice
TypeScript
const response = await env.AI.run(
'openai/tts-1-hd',
{
response_format: 'mp3',
speed: 1,
text: 'Breaking news: scientists have discovered a new species of deep-sea fish that produces its own light using bioluminescence.',
voice: 'shimmer',
},
)
console.log(response)

Parameters

response_format
stringrequireddefault: mp3enum: mp3, opus, wav, aac, flacThe output format for the audio. Supported formats are mp3, opus, wav, aac and flac.
speed
numberrequireddefault: 1maximum: 4minimum: 0.25The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
text
stringrequiredmaxLength: 4096The text to generate audio for. Maximum length is 4096 characters.
voice
stringrequireddefault: alloyenum: alloy, echo, fable, onyx, nova, shimmerThe voice to use when generating the audio. Defaults to alloy.

API Schemas (Raw)

Input
Output