Skip to content

Data source

You can upload files directly to an instance or connect an external data source.

Data SourceDescription
Built-in storageUpload files directly to an instance. Available by default on every new instance.
WebsiteConnect a domain you own to index website pages.
R2 BucketConnect a Cloudflare R2 bucket to index stored documents.

Supported file types

AI Search can ingest a variety of file types. The following plain text files and rich format files are supported.

Plain text file types

FormatFile extensionsMime Type
Text.txt, .rsttext/plain
Log.logtext/plain
Config.ini, .conf, .env, .properties, .gitignore, .editorconfig, .tomltext/plain, text/toml
Markdown.markdown, .md, .mdxtext/markdown
LaTeX.tex, .latexapplication/x-tex, application/x-latex
Script.sh, .bat, .ps1application/x-sh, application/x-msdos-batch, text/x-powershell
SGML.sgmltext/sgml
JSON.jsonapplication/json
YAML.yaml, .ymlapplication/x-yaml
CSS.csstext/css
JavaScript.jsapplication/javascript
PHP.phpapplication/x-httpd-php
Python.pytext/x-python
Ruby.rbtext/x-ruby
Java.javatext/x-java-source
C.ctext/x-c
C++.cpp, .cxxtext/x-c++
C Header.h, .hpptext/x-c-header
Go.gotext/x-go
Rust.rstext/rust
Swift.swifttext/swift
Dart.darttext/dart
EMACS Lisp.elapplication/x-elisp, text/x-elisp, text/x-emacs-lisp

Rich format file types

AI Search uses Markdown Conversion to convert rich format files to markdown. The following table lists the supported formats that will be converted to Markdown:

Format

File extensions

Mime Types

PDF Documents

.pdf

application/pdf

Images 1

.jpeg, .jpg, .png, .webp, .svg

image/jpeg, image/png, image/webp, image/svg+xml

HTML Documents

.html, .htm

text/html

XML Documents

.xml

application/xml

Microsoft Office Documents

.xlsx, .xlsm, .xlsb, .xls, .et, .docx

application/vnd.openxmlformats-officedocument.spreadsheetml.sheet, application/vnd.ms-excel.sheet.macroenabled.12, application/vnd.ms-excel.sheet.binary.macroenabled.12, application/vnd.ms-excel, application/vnd.openxmlformats-officedocument.wordprocessingml.document

Open Document Format

.ods, .odt

application/vnd.oasis.opendocument.spreadsheet, application/vnd.oasis.opendocument.text

CSV

.csv

text/csv

Apple Documents

.numbers

application/vnd.apple.numbers

1 Image conversion uses two Workers AI models for object detection and summarization. See Workers AI pricing for more details.

File limits

AI Search has a file size limit of up to 4 MB.

Files that exceed this limit will not be indexed and will show up in the error logs.