---
title: Data source
description: Connect a website, R2 bucket, or upload files directly to your AI Search instance for indexing.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/ai-search/llms.txt  
> Use this file to discover all available pages before exploring further.

[Skip to content](#%5Ftop) 

# Data source

You can upload files directly to an instance or connect an external data source.

| Data Source                                                                                                 | Description                                                                       |
| ----------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------- |
| [Built-in storage](https://developers.cloudflare.com/ai-search/configuration/data-source/built-in-storage/) | Upload files directly to an instance. Available by default on every new instance. |
| [Website](https://developers.cloudflare.com/ai-search/configuration/data-source/website/)                   | Connect a domain you own to index website pages.                                  |
| [R2 Bucket](https://developers.cloudflare.com/ai-search/configuration/data-source/r2/)                      | Connect a Cloudflare R2 bucket to index stored documents.                         |

## Supported file types

AI Search can ingest a variety of file types. The following plain text files and rich format files are supported.

### Plain text file types

| Format     | File extensions                                                  | Mime Type                                                      |
| ---------- | ---------------------------------------------------------------- | -------------------------------------------------------------- |
| Text       | .txt, .rst                                                       | text/plain                                                     |
| Log        | .log                                                             | text/plain                                                     |
| Config     | .ini, .conf, .env, .properties, .gitignore, .editorconfig, .toml | text/plain, text/toml                                          |
| Markdown   | .markdown, .md, .mdx                                             | text/markdown                                                  |
| LaTeX      | .tex, .latex                                                     | application/x-tex, application/x-latex                         |
| Script     | .sh, .bat, .ps1                                                  | application/x-sh, application/x-msdos-batch, text/x-powershell |
| SGML       | .sgml                                                            | text/sgml                                                      |
| JSON       | .json                                                            | application/json                                               |
| YAML       | .yaml, .yml                                                      | application/x-yaml                                             |
| CSS        | .css                                                             | text/css                                                       |
| JavaScript | .js                                                              | application/javascript                                         |
| PHP        | .php                                                             | application/x-httpd-php                                        |
| Python     | .py                                                              | text/x-python                                                  |
| Ruby       | .rb                                                              | text/x-ruby                                                    |
| Java       | .java                                                            | text/x-java-source                                             |
| C          | .c                                                               | text/x-c                                                       |
| C++        | .cpp, .cxx                                                       | text/x-c++                                                     |
| C Header   | .h, .hpp                                                         | text/x-c-header                                                |
| Go         | .go                                                              | text/x-go                                                      |
| Rust       | .rs                                                              | text/rust                                                      |
| Swift      | .swift                                                           | text/swift                                                     |
| Dart       | .dart                                                            | text/dart                                                      |
| EMACS Lisp | .el                                                              | application/x-elisp, text/x-elisp, text/x-emacs-lisp           |

### Rich format file types

AI Search uses [Markdown Conversion](https://developers.cloudflare.com/workers-ai/features/markdown-conversion/) to convert rich format files to markdown. The following table lists the supported formats that will be converted to Markdown:

| Format                     | File extensions                       | Mime Types                                                                                                                                                                                                                                                              | |  PDF Documents | .pdf | application/pdf |
| -------------------------- | ------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------- | ---- | --------------- |
| Images 1                   | .jpeg, .jpg, .png, .webp, .svg        | image/jpeg, image/png, image/webp, image/svg+xml                                                                                                                                                                                                                        |                  |      |                 |
| HTML Documents             | .html, .htm                           | text/html                                                                                                                                                                                                                                                               |                  |      |                 |
| XML Documents              | .xml                                  | application/xml                                                                                                                                                                                                                                                         |                  |      |                 |
| Microsoft Office Documents | .xlsx, .xlsm, .xlsb, .xls, .et, .docx | application/vnd.openxmlformats-officedocument.spreadsheetml.sheet,application/vnd.ms-excel.sheet.macroenabled.12,application/vnd.ms-excel.sheet.binary.macroenabled.12,application/vnd.ms-excel,application/vnd.openxmlformats-officedocument.wordprocessingml.document |                  |      |                 |
| Open Document Format       | .ods, .odt                            | application/vnd.oasis.opendocument.spreadsheet,application/vnd.oasis.opendocument.text                                                                                                                                                                                  |                  |      |                 |
| CSV                        | .csv                                  | text/csv                                                                                                                                                                                                                                                                |                  |      |                 |
| Apple Documents            | .numbers                              | application/vnd.apple.numbers                                                                                                                                                                                                                                           |                  |      |                 |

1 Image conversion uses two Workers AI models for object detection and summarization. See [Workers AI pricing](https://developers.cloudflare.com/workers-ai/features/markdown-conversion/#pricing) for more details.

## File limits

AI Search has a file size limit of **up to 4 MB**.

Files that exceed this limit will not be indexed and will show up in the error logs.

```json
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/ai-search/","name":"AI Search"}},{"@type":"ListItem","position":3,"item":{"@id":"/ai-search/configuration/","name":"Configuration"}},{"@type":"ListItem","position":4,"item":{"@id":"/ai-search/configuration/data-source/","name":"Data source"}}]}
```
