Configuration
You can customize how your AI Search instance indexes your data, and retrieves and generates responses for queries. Some settings can be updated after the instance is created, while others are fixed at creation time.
The table below lists all available configuration options:
Configuration | Editable after creation | Description |
---|---|---|
Data source | no | The source where your knowledge base is stored |
Chunk size | yes | Number of tokens per chunk |
Chunk overlap | yes | Number of overlapping tokens between chunks |
Embedding model | no | Model used to generate vector embeddings |
Query rewrite | yes | Enable or disable query rewriting before retrieval |
Query rewrite model | yes | Model used for query rewriting |
Query rewrite system prompt | yes | Custom system prompt to guide query rewriting behavior |
Match threshold | yes | Minimum similarity score required for a vector match |
Maximum number of results | yes | Maximum number of vector matches returned (top_k ) |
Generation model | yes | Model used to generate the final response |
Generation system prompt | yes | Custom system prompt to guide response generation |
Similarity caching | yes | Enable or disable caching of responses for similar (not just exact) prompts |
Similarity caching threshold | yes | Controls how similar a new prompt must be to a previous one to reuse its cached response |
AI Gateway | yes | AI Gateway for monitoring and controlling model usage |
AI Search name | no | Name of your AI Search instance |
Service API token | yes | API token granted to AI Search to give it permission to configure resources on your account. |
Was this helpful?
- Resources
- API
- New to Cloudflare?
- Directory
- Sponsorships
- Open Source
- Support
- Help Center
- System Status
- Compliance
- GDPR
- Company
- cloudflare.com
- Our team
- Careers
- © 2025 Cloudflare, Inc.
- Privacy Policy
- Terms of Use
- Report Security Issues
- Trademark
-