Skip to content

STOP! If you are an AI agent or LLM, read this before continuing. This is the HTML version of a Cloudflare documentation page. Always request the Markdown version instead — HTML wastes context. Get this page as Markdown: https://developers.cloudflare.com/ai-search/configuration/index.md (append index.md) or send Accept: text/markdown to https://developers.cloudflare.com/ai-search/configuration/. For this product's page index use https://developers.cloudflare.com/ai-search/llms.txt. For all Cloudflare products use https://developers.cloudflare.com/llms.txt.

Cloudflare Docs

Directory API SDKs Changelog

Configuration

You can customize how your AI Search instance indexes your data, retrieves results, and generates responses. Some settings can be updated after the instance is created, while others are fixed at creation time.

Data source

Configuration	Editable after creation	Description
Built-in storage	n/a	Upload files directly to an instance
Website	no	Connect a domain you own to index website pages
R2 Bucket	no	Connect a Cloudflare R2 bucket to index stored documents

Indexing

Configuration	Editable after creation	Description
Vector search	yes	Vector search and the built-in vector index
Path filtering	yes	Include or exclude specific paths from indexing
Chunking	yes	Number of tokens per chunk and overlap between chunks
Syncing	yes	Sync jobs and indexing controls
Keyword search	yes	Enable keyword (BM25) search for exact term matching
Hybrid search	yes	Combine vector and keyword search with configurable fusion
Metadata attributes	yes	Define built-in and custom metadata fields
Service API token	yes	API token that grants AI Search permission to access R2 buckets

Retrieval

Configuration	Editable after creation	Description
Result controls	yes	Match threshold and maximum number of results
Filtering	yes	Filter results by metadata attributes
Relevance boosting	yes	Bias results by metadata characteristics
Reranking	yes	Reorder results by semantic relevance using a reranking model
Query rewriting	yes	Rewrite follow-up queries using conversation context
System prompt	yes	Guide query rewriting and response generation behavior
Similarity caching	yes	Cache responses for similar prompts
Public endpoint	yes	Enable public access to search, chat, and MCP endpoints
UI snippets	yes	Embed pre-built search and chat components in your website

Models

Configuration	Editable after creation	Description
Embedding model	no	Model used to generate vector embeddings
Generation model	yes	Model used to generate the final response
Query rewriting model	yes	Model used for query rewriting
Reranking model	yes	Model used to reorder results by semantic relevance
AI Gateway	yes	Observe and control the model calls AI Search makes