Configuration

You can customize how your AI Search instance indexes your data, and retrieves and generates responses for queries. Some settings can be updated after the instance is created, while others are fixed at creation time.

The table below lists all available configuration options:

Configuration	Editable after creation	Description
Data source	no	The source where your knowledge base is stored
Path filtering	yes	Include or exclude specific paths from indexing
Chunk size	yes	Number of tokens per chunk
Chunk overlap	yes	Number of overlapping tokens between chunks
Embedding model	no	Model used to generate vector embeddings
Query rewrite	yes	Enable or disable query rewriting before retrieval
Query rewrite model	yes	Model used for query rewriting
Query rewrite system prompt	yes	Custom system prompt to guide query rewriting behavior
Match threshold	yes	Minimum similarity score required for a vector match
Maximum number of results	yes	Maximum number of vector matches returned (`top_k`)
Reranking	yes	Rerank to reorder retrieved results by semantic relevance using a reranking model after initial retrieval
Generation model	yes	Model used to generate the final response
Generation system prompt	yes	Custom system prompt to guide response generation
Similarity caching	yes	Enable or disable caching of responses for similar (not just exact) prompts
Similarity caching threshold	yes	Controls how similar a new prompt must be to a previous one to reuse its cached response
AI Gateway	yes	AI Gateway for monitoring and controlling model usage
AI Search name	no	Name of your AI Search instance
Service API token	yes	API token that grants AI Search permission to configure resources on your account