bge-base-en-v1.5

Text Embeddings • BAAI • Hosted

BAAI general embedding (Base) model that transforms any given text into a 768-dimensional vector

Model Info
Context Window ↗	153,600 tokens
More information	link ↗
Maximum Input Tokens	512
Output Dimensions	768
Batch	Yes
Unit Pricing	$0.067 per M input tokens

Usage

export interface Env {
  AI: Ai;
}

export default {
  async fetch(request, env): Promise<Response> {

    // Can be a string or array of strings]
    const stories = [
      "This is a story about an orange cloud",
      "This is a story about a llama",
      "This is a story about a hugging emoji",
    ];

    const embeddings = await env.AI.run(
      "@cf/baai/bge-base-en-v1.5",
      {
        text: stories,
      }
    );

    return Response.json(embeddings);
  },
} satisfies ExportedHandler<Env>;

import os
import requests


ACCOUNT_ID = "your-account-id"
AUTH_TOKEN = os.environ.get("CLOUDFLARE_AUTH_TOKEN")

stories = [
  'This is a story about an orange cloud',
  'This is a story about a llama',
  'This is a story about a hugging emoji'
]

response = requests.post(
  f"https://api.cloudflare.com/client/v4/accounts/{ACCOUNT_ID}/ai/run/@cf/baai/bge-base-en-v1.5",
  headers={"Authorization": f"Bearer {AUTH_TOKEN}"},
  json={"text": stories}
)

print(response.json())

curl https://api.cloudflare.com/client/v4/accounts/$CLOUDFLARE_ACCOUNT_ID/ai/run/@cf/baai/bge-base-en-v1.5  \
  -X POST  \
  -H "Authorization: Bearer $CLOUDFLARE_API_TOKEN"  \
  -d '{ "text": ["This is a story about an orange cloud", "This is a story about a llama", "This is a story about a hugging emoji"] }'

Parameters

Synchronous — Send a request and receive a complete response

Input
Output

▶text

one ofrequired

pooling

stringdefault: meanenum: mean, clsThe pooling method used in the embedding process. `cls` pooling will generate more accurate embeddings on larger inputs - however, embeddings created with cls pooling are not compatible with embeddings generated with mean pooling. The default pooling method is `mean` in order for this to not be a breaking change, but we highly suggest using the new `cls` pooling for better accuracy.

▶shape[]

array

▶data[]

arrayEmbeddings of the requested text values

pooling

stringenum: mean, clsThe pooling method used in the embedding process.

Batch — Send multiple requests in a single API call

Input
Output

▶requests[]

arrayrequiredBatch of the embeddings requests to run using async-queue

▶shape[]

array

▶data[]

arrayEmbeddings of the requested text values

pooling

stringenum: mean, clsThe pooling method used in the embedding process.

API Schemas (Raw)

Synchronous — Send a request and receive a complete response

Input
Output

{
  "properties": {
    "text": {
      "oneOf": [
        {
          "type": "string",
          "description": "The text to embed",
          "minLength": 1
        },
        {
          "type": "array",
          "description": "Batch of text values to embed",
          "items": {
            "type": "string",
            "description": "The text to embed",
            "minLength": 1
          },
          "maxItems": 100
        }
      ]
    },
    "pooling": {
      "type": "string",
      "enum": [
        "mean",
        "cls"
      ],
      "default": "mean",
      "description": "The pooling method used in the embedding process. `cls` pooling will generate more accurate embeddings on larger inputs - however, embeddings created with cls pooling are not compatible with embeddings generated with mean pooling. The default pooling method is `mean` in order for this to not be a breaking change, but we highly suggest using the new `cls` pooling for better accuracy."
    }
  },
  "required": [
    "text"
  ]
}

{
  "type": "object",
  "contentType": "application/json",
  "properties": {
    "shape": {
      "type": "array",
      "items": {
        "type": "number"
      }
    },
    "data": {
      "type": "array",
      "description": "Embeddings of the requested text values",
      "items": {
        "type": "array",
        "description": "Floating point embedding representation shaped by the embedding model",
        "items": {
          "type": "number"
        }
      }
    },
    "pooling": {
      "type": "string",
      "enum": [
        "mean",
        "cls"
      ],
      "description": "The pooling method used in the embedding process."
    }
  }
}

Batch — Send multiple requests in a single API call

Input
Output

{
  "properties": {
    "requests": {
      "type": "array",
      "description": "Batch of the embeddings requests to run using async-queue",
      "items": {
        "properties": {
          "text": {
            "oneOf": [
              {
                "type": "string",
                "description": "The text to embed",
                "minLength": 1
              },
              {
                "type": "array",
                "description": "Batch of text values to embed",
                "items": {
                  "type": "string",
                  "description": "The text to embed",
                  "minLength": 1
                },
                "maxItems": 100
              }
            ]
          },
          "pooling": {
            "type": "string",
            "enum": [
              "mean",
              "cls"
            ],
            "default": "mean",
            "description": "The pooling method used in the embedding process. `cls` pooling will generate more accurate embeddings on larger inputs - however, embeddings created with cls pooling are not compatible with embeddings generated with mean pooling. The default pooling method is `mean` in order for this to not be a breaking change, but we highly suggest using the new `cls` pooling for better accuracy."
          }
        },
        "required": [
          "text"
        ]
      }
    }
  },
  "required": [
    "requests"
  ]
}

{
  "type": "object",
  "contentType": "application/json",
  "properties": {
    "shape": {
      "type": "array",
      "items": {
        "type": "number"
      }
    },
    "data": {
      "type": "array",
      "description": "Embeddings of the requested text values",
      "items": {
        "type": "array",
        "description": "Floating point embedding representation shaped by the embedding model",
        "items": {
          "type": "number"
        }
      }
    },
    "pooling": {
      "type": "string",
      "enum": [
        "mean",
        "cls"
      ],
      "description": "The pooling method used in the embedding process."
    }
  }
}