Supported models
The following models are supported for unified billing when using the AI Gateway REST API, including the OpenAI-compatible endpoint and provider-native endpoints.
For models available through the AI binding (env.AI.run()), refer to the model catalog.
| Provider | Model |
|---|---|
| Anthropic | claude-v1 |
| Anthropic | claude-1 |
| Anthropic | claude-1.2 |
| Anthropic | claude-2.0 |
| Anthropic | claude-2.0:beta |
| Anthropic | claude-2.1 |
| Anthropic | claude-2.1:beta |
| Anthropic | claude-2:beta |
| Anthropic | claude-3-5-haiku-20241022 |
| Anthropic | claude-3-5-haiku-20241022:beta |
| Anthropic | claude-3-5-haiku-latest |
| Anthropic | claude-3-5-sonnet-20240620 |
| Anthropic | claude-3-5-sonnet-20240620:beta |
| Anthropic | claude-3-5-sonnet-20241022 |
| Anthropic | claude-3-7-sonnet-20250219 |
| Anthropic | claude-3-7-sonnet-latest |
| Anthropic | claude-3-haiku |
| Anthropic | claude-3-haiku:beta |
| Anthropic | claude-3-haiku-20240307 |
| Anthropic | claude-3-opus |
| Anthropic | claude-3-opus:beta |
| Anthropic | claude-3-opus-20240229 |
| Anthropic | claude-3-sonnet |
| Anthropic | claude-3-sonnet:beta |
| Anthropic | claude-3-sonnet-20240229 |
| Anthropic | claude-3.5-haiku |
| Anthropic | claude-3.5-haiku:beta |
| Anthropic | claude-3.5-haiku-20241022 |
| Anthropic | claude-3.5-haiku-20241022:beta |
| Anthropic | claude-3.5-sonnet |
| Anthropic | claude-3.5-sonnet:beta |
| Anthropic | claude-3.5-sonnet-20240620 |
| Anthropic | claude-3.5-sonnet-20240620:beta |
| Anthropic | claude-3.7-sonnet |
| Anthropic | claude-3.7-sonnet:beta |
| Anthropic | claude-3.7-sonnet:thinking |
| Anthropic | claude-4-opus-20250522 |
| Anthropic | claude-4-sonnet-20250522 |
| Anthropic | claude-4.1-opus-20250805 |
| Anthropic | claude-4.5-haiku-20251001 |
| Anthropic | claude-4.5-opus-20251124 |
| Anthropic | claude-4.5-sonnet-20250929 |
| Anthropic | claude-haiku-4-5 |
| Anthropic | claude-haiku-4-5-20251001 |
| Anthropic | claude-instant-1.0 |
| Anthropic | claude-instant-1.1 |
| Anthropic | claude-instant-1:beta |
| Anthropic | claude-opus-4 |
| Anthropic | claude-opus-4-0 |
| Anthropic | claude-opus-4-1 |
| Anthropic | claude-opus-4-1-20250805 |
| Anthropic | claude-opus-4-5 |
| Anthropic | claude-opus-4-6 |
| Anthropic | claude-opus-4-20250514 |
| Anthropic | claude-opus-4.1 |
| Anthropic | claude-opus-4.5 |
| Anthropic | claude-sonnet-4-0 |
| Anthropic | claude-sonnet-4-5 |
| Anthropic | claude-sonnet-4-5-20250929 |
| Anthropic | claude-sonnet-4-6 |
| Anthropic | claude-sonnet-4-20250514 |
| Google AI Studio | gemini-1.5-flash |
| Google AI Studio | gemini-1.5-flash-8b |
| Google AI Studio | gemini-1.5-pro |
| Google AI Studio | gemini-2.0-flash |
| Google AI Studio | gemini-2.0-flash-001 |
| Google AI Studio | gemini-2.0-flash-exp:free |
| Google AI Studio | gemini-2.0-flash-lite |
| Google AI Studio | gemini-2.0-flash-lite-001 |
| Google AI Studio | gemini-2.0-flash-lite-preview-02-05:free |
| Google AI Studio | gemini-2.0-flash-thinking-exp-1219:free |
| Google AI Studio | gemini-2.0-flash-thinking-exp:free |
| Google AI Studio | gemini-2.0-pro-exp-02-05:free |
| Google AI Studio | gemini-2.5-flash |
| Google AI Studio | gemini-2.5-flash-image |
| Google AI Studio | gemini-2.5-flash-image-preview |
| Google AI Studio | gemini-2.5-flash-lite |
| Google AI Studio | gemini-2.5-flash-lite-preview-06-17 |
| Google AI Studio | gemini-2.5-flash-lite-preview-09-2025 |
| Google AI Studio | gemini-2.5-flash-preview |
| Google AI Studio | gemini-2.5-flash-preview:thinking |
| Google AI Studio | gemini-2.5-flash-preview-04-17 |
| Google AI Studio | gemini-2.5-flash-preview-05-20 |
| Google AI Studio | gemini-2.5-flash-preview-05-20:thinking |
| Google AI Studio | gemini-2.5-flash-preview-09-2025 |
| Google AI Studio | gemini-2.5-flash-preview-tts |
| Google AI Studio | gemini-2.5-pro |
| Google AI Studio | gemini-2.5-pro-exp-03-25 |
| Google AI Studio | gemini-2.5-pro-exp-03-25:free |
| Google AI Studio | gemini-2.5-pro-preview |
| Google AI Studio | gemini-2.5-pro-preview-05-06 |
| Google AI Studio | gemini-2.5-pro-preview-06-05 |
| Google AI Studio | gemini-2.5-pro-preview-tts |
| Google AI Studio | gemini-3-flash-preview |
| Google AI Studio | gemini-3-flash-preview-20251217 |
| Google AI Studio | gemini-3-pro-preview |
| Google AI Studio | gemini-3-pro-preview-20251117 |
| Google AI Studio | gemini-3.1-flash-image-preview |
| Google AI Studio | gemini-3.1-flash-lite-preview |
| Google AI Studio | gemini-3.1-pro-preview |
| Google AI Studio | gemini-3.1-pro-preview-20260219 |
| Google AI Studio | gemini-embedding-001 |
| Google AI Studio | gemini-exp-1114 |
| Google AI Studio | gemini-exp-1114:free |
| Google AI Studio | gemini-exp-1121:free |
| Google AI Studio | gemini-exp-1206:free |
| Google AI Studio | gemini-flash-1.5 |
| Google AI Studio | gemini-flash-1.5-8b |
| Google AI Studio | gemini-flash-1.5-8b-exp |
| Google AI Studio | gemini-flash-1.5-exp |
| Google AI Studio | gemini-flash-8b-1.5-exp |
| Google AI Studio | gemini-flash-latest |
| Google AI Studio | gemini-flash-lite-latest |
| Google AI Studio | gemini-live-2.5-flash |
| Google AI Studio | gemini-live-2.5-flash-preview-native-audio |
| Google AI Studio | gemini-pro |
| Google AI Studio | gemini-pro-1.5 |
| Google AI Studio | gemini-pro-1.5-exp |
| Google AI Studio | gemini-pro-vision |
| Google AI Studio | gemma-2-27b-it |
| Google AI Studio | gemma-2-9b-it |
| Google AI Studio | gemma-2-9b-it:free |
| Google AI Studio | gemma-2b-it |
| Google AI Studio | gemma-3-12b-it |
| Google AI Studio | gemma-3-12b-it:free |
| Google AI Studio | gemma-3-1b-it:free |
| Google AI Studio | gemma-3-27b-it |
| Google AI Studio | gemma-3-27b-it:free |
| Google AI Studio | gemma-3-4b-it |
| Google AI Studio | gemma-3-4b-it:free |
| Google AI Studio | gemma-3n-e2b-it:free |
| Google AI Studio | gemma-3n-e4b-it |
| Google AI Studio | gemma-3n-e4b-it:free |
| Google AI Studio | gemma-7b-it |
| Google AI Studio | gemma-7b-it:free |
| Google AI Studio | gemma-7b-it:nitro |
| Google AI Studio | learnlm-1.5-pro-experimental:free |
| Google AI Studio | palm-2-chat-bison |
| Google AI Studio | palm-2-chat-bison-32k |
| Google AI Studio | palm-2-codechat-bison |
| Google AI Studio | palm-2-codechat-bison-32k |
| Google Vertex AI | gemini-2.0-flash |
| Google Vertex AI | gemini-2.0-flash-lite |
| Google Vertex AI | gemini-2.5-flash |
| Google Vertex AI | gemini-2.5-flash-lite |
| Google Vertex AI | gemini-2.5-flash-lite-preview-06-17 |
| Google Vertex AI | gemini-2.5-flash-lite-preview-09-2025 |
| Google Vertex AI | gemini-2.5-flash-preview-04-17 |
| Google Vertex AI | gemini-2.5-flash-preview-05-20 |
| Google Vertex AI | gemini-2.5-flash-preview-09-2025 |
| Google Vertex AI | gemini-2.5-pro |
| Google Vertex AI | gemini-2.5-pro-preview-05-06 |
| Google Vertex AI | gemini-2.5-pro-preview-06-05 |
| Google Vertex AI | gemini-embedding-001 |
| Google Vertex AI | gemini-flash-latest |
| Google Vertex AI | gemini-flash-lite-latest |
| Google Vertex AI | veo-3.0-generate-001 |
| xAI | grok-2 |
| xAI | grok-2-1212 |
| xAI | grok-2-latest |
| xAI | grok-2-vision |
| xAI | grok-2-vision-1212 |
| xAI | grok-2-vision-latest |
| xAI | grok-3 |
| xAI | grok-3-beta |
| xAI | grok-3-fast |
| xAI | grok-3-fast-latest |
| xAI | grok-3-latest |
| xAI | grok-3-mini |
| xAI | grok-3-mini-beta |
| xAI | grok-3-mini-fast |
| xAI | grok-3-mini-fast-latest |
| xAI | grok-3-mini-latest |
| xAI | grok-4 |
| xAI | grok-4-07-09 |
| xAI | grok-4-1-fast |
| xAI | grok-4-1-fast-non-reasoning |
| xAI | grok-4-1-fast-reasoning |
| xAI | grok-4-1-fast-reasoning-latest |
| xAI | grok-4-fast |
| xAI | grok-4-fast-non-reasoning |
| xAI | grok-4-fast-reasoning |
| xAI | grok-4-fast-reasoning-latest |
| xAI | grok-4.1-fast |
| xAI | grok-4.1-fast-non-reasoning |
| xAI | grok-4.20 |
| xAI | grok-4.20-0309-non-reasoning |
| xAI | grok-4.20-0309-reasoning |
| xAI | grok-4.20-beta-latest-non-reasoning |
| xAI | grok-beta |
| xAI | grok-code-fast-1 |
| xAI | grok-vision-beta |
| Groq | deepseek-r1-distill-llama-70b |
| Groq | gemma2-9b-it |
| Groq | llama-3.1-8b-instant |
| Groq | llama-3.3-70b-versatile |
| Groq | llama-guard-3-8b |
| Groq | llama3-70b-8192 |
| Groq | llama3-8b-8192 |
| Groq | meta-llama/llama-4-maverick-17b-128e-instruct |
| Groq | meta-llama/llama-4-scout-17b-16e-instruct |
| Groq | meta-llama/llama-guard-4-12b |
| Groq | mistral-saba-24b |
| Groq | moonshotai/kimi-k2-instruct |
| Groq | moonshotai/kimi-k2-instruct-0905 |
| Groq | openai/gpt-oss-120b |
| Groq | openai/gpt-oss-20b |
| Groq | qwen-qwq-32b |
| Groq | qwen/qwen3-32b |
| OpenAI | chatgpt-4o-latest |
| OpenAI | codex-mini |
| OpenAI | codex-mini-latest |
| OpenAI | gpt-3.5-turbo |
| OpenAI | gpt-3.5-turbo-0613 |
| OpenAI | gpt-3.5-turbo-16k |
| OpenAI | gpt-3.5-turbo-instruct |
| OpenAI | gpt-4 |
| OpenAI | gpt-4-0314 |
| OpenAI | gpt-4-1106-preview |
| OpenAI | gpt-4-turbo |
| OpenAI | gpt-4-turbo-preview |
| OpenAI | gpt-4.1-2025-04-14 |
| OpenAI | gpt-4.1-mini-2025-04-14 |
| OpenAI | gpt-4.1-nano |
| OpenAI | gpt-4.1-nano-2025-04-14 |
| OpenAI | gpt-4.5-preview |
| OpenAI | gpt-4o |
| OpenAI | gpt-4o-2024-05-13 |
| OpenAI | gpt-4o-2024-08-06 |
| OpenAI | gpt-4o-2024-11-20 |
| OpenAI | gpt-4o-audio-preview |
| OpenAI | gpt-4o-mini |
| OpenAI | gpt-4o-mini-2024-07-18 |
| OpenAI | gpt-4o-mini-search-preview |
| OpenAI | gpt-4o-mini-search-preview-2025-03-11 |
| OpenAI | gpt-4o-search-preview |
| OpenAI | gpt-4o-search-preview-2025-03-11 |
| OpenAI | gpt-4o:extended |
| OpenAI | gpt-5-2025-08-07 |
| OpenAI | gpt-5-chat |
| OpenAI | gpt-5-chat-2025-08-07 |
| OpenAI | gpt-5-chat-latest |
| OpenAI | gpt-5-codex |
| OpenAI | gpt-5-image |
| OpenAI | gpt-5-image-mini |
| OpenAI | gpt-5-mini |
| OpenAI | gpt-5-mini-2025-08-07 |
| OpenAI | gpt-5-nano |
| OpenAI | gpt-5-nano-2025-08-07 |
| OpenAI | gpt-5-pro |
| OpenAI | gpt-5-pro-2025-10-06 |
| OpenAI | gpt-5.1 |
| OpenAI | gpt-5.1-20251113 |
| OpenAI | gpt-5.1-chat |
| OpenAI | gpt-5.1-chat-20251113 |
| OpenAI | gpt-5.1-codex |
| OpenAI | gpt-5.1-codex-20251113 |
| OpenAI | gpt-5.1-codex-mini |
| OpenAI | gpt-5.1-codex-mini-20251113 |
| OpenAI | gpt-5.2 |
| OpenAI | gpt-5.2-20251211 |
| OpenAI | gpt-5.2-chat |
| OpenAI | gpt-5.2-chat-20251211 |
| OpenAI | gpt-5.2-codex |
| OpenAI | gpt-5.2-codex-20260114 |
| OpenAI | gpt-5.2-pro |
| OpenAI | gpt-5.2-pro-20251211 |
| OpenAI | gpt-5.3-codex |
| OpenAI | gpt-5.3-codex-20260224 |
| OpenAI | gpt-5.3-codex-spark |
| OpenAI | gpt-5.4-pro |
| OpenAI | gpt-oss-120b:exacto |
| OpenAI | gpt-oss-20b:free |
| OpenAI | gpt-oss-safeguard-20b |
| OpenAI | o1 |
| OpenAI | o1-2024-12-17 |
| OpenAI | o1-mini |
| OpenAI | o1-mini-2024-09-12 |
| OpenAI | o1-preview |
| OpenAI | o1-pro |
| OpenAI | o3 |
| OpenAI | o3-2025-04-16 |
| OpenAI | o3-deep-research |
| OpenAI | o3-deep-research-2025-06-26 |
| OpenAI | o3-mini |
| OpenAI | o3-mini-2025-01-31 |
| OpenAI | o3-mini-high |
| OpenAI | o3-mini-high-2025-01-31 |
| OpenAI | o3-pro |
| OpenAI | o3-pro-2025-06-10 |
| OpenAI | o4-mini-2025-04-16 |
| OpenAI | o4-mini-deep-research |
| OpenAI | o4-mini-deep-research-2025-06-26 |
| OpenAI | o4-mini-high |
| OpenAI | o4-mini-high-2025-04-16 |
| OpenAI | text-embedding-3-large |
| OpenAI | text-embedding-3-small |
| OpenAI | text-embedding-ada-002 |