Create a fine-tuned OpenAI model with R2
In this tutorial, you will use the OpenAI ↗ API and Cloudflare R2 to create a fine-tuned model ↗.
This feature in OpenAI’s API allows you to derive a custom model from OpenAI’s various large language models based on a set of custom instructions and example answers. These instructions and example answers are written in a document, known as a fine-tune document. This document will be stored in R2 and dynamically provided to OpenAI’s APIs when creating a new fine-tune model.
In order to use this feature, you will do the following tasks:
- Upload a fine-tune document to R2.
- Read the R2 file and upload it to OpenAI.
- Create a new fine-tuned model based on the document.
To review the completed code for this application, refer to the GitHub repository for this tutorial ↗.
Prerequisites
Before you start, make sure you have:
- A Cloudflare account with access to R2. If you do not have a Cloudflare account, sign up ↗ before continuing. Then purchase R2 from your Cloudflare dashboard.
- An OpenAI API key.
- A fine-tune document, structured as JSON Lines ↗. Use the example document ↗ in the source code.
1. Create a Worker application
First, use the c3
CLI to create a new Cloudflare Workers project.
For setup, select the following options:
- For What would you like to start with?, choose
Hello World example
. - For Which template would you like to use?, choose
Hello World Worker
. - For Which language do you want to use?, choose
TypeScript
. - For Do you want to use git for version control?, choose
Yes
. - For Do you want to deploy your application?, choose
No
(we will be making some changes before deploying).
The above options will create the “Hello World” TypeScript project.
Move into your newly created directory:
2. Upload a fine-tune document to R2
Next, upload the fine-tune document to R2. R2 is a key-value store that allows you to store and retrieve files from within your Workers application. You will use Wrangler to create a new R2 bucket.
To create a new R2 bucket use the wrangler r2 bucket create
command. Note that you are logged in with your Cloudflare account. If not logged in via Wrangler, use the wrangler login
command.
Replace <BUCKET_NAME>
with your desired bucket name. Note that bucket names must be lowercase and can only contain dashes.
Next, upload a file using the wrangler r2 object put
command.
<PATH>
is the combined bucket and file path of the file you want to upload — for example, fine-tune-ai/finetune.jsonl
, where fine-tune-ai
is the bucket name. Replace <FILE_NAME>
with the local filename of your fine-tune document.
3. Bind your bucket to the Worker
A binding is how your Worker interacts with external resources such as the R2 bucket.
To bind the R2 bucket to your Worker, add the following to your wrangler.toml
file. Update the binding property to a valid JavaScript variable identifier. Replace <YOUR_BUCKET_NAME>
with the name of the bucket you created in step 2:
4. Initialize your Worker application
You will use Hono ↗, a lightweight framework for building Cloudflare Workers applications. Hono provides an interface for defining routes and middleware functions. Inside your project directory, run the following command to install Hono:
You also need to install the OpenAI Node API library ↗. This library provides convenient access to the OpenAI REST API in a Node.js project. To install the library, execute the following command:
Next, open the src/index.ts
file and replace the default code with the below code. Replace <MY_BUCKET>
with the binding name you set in wrangler.toml
file.
In the above code, you first import the required packages and define the types. Then, you initialize app
as a new Hono instance. Using the use
middleware function, you add the OpenAI API client to the context of all routes. This middleware function allows you to access the client from within any route handler. onError()
defines an error handler to return any errors as a JSON response.
5. Read R2 files and upload them to OpenAI
In this section, you will define the route and function responsible for handling file uploads.
In createFile
, your Worker reads the file from R2 and converts it to a File
object. Your Worker then uses the OpenAI API to upload the file and return the response.
The GET /files
route listens for GET
requests with a query parameter file
, representing a filename of an uploaded fine-tune document in R2. The function uses the createFile
function to manage the file upload process.
Replace <MY_BUCKET>
with the binding name you set in wrangler.toml
file.
6. Create fine-tuned models
This section includes the GET /models
route and the createModel
function. The function createModel
takes care of specifying the details and initiating the fine-tuning process with OpenAI. The route handles incoming requests for creating a new fine-tuned model.
7. List all fine-tune jobs
This section describes the GET /jobs
route and the corresponding getJobs
function. The function interacts with OpenAI’s API to fetch a list of all fine-tuning jobs. The route provides an interface for retrieving this information.
8. Deploy your application
After you have created your Worker application and added the required functions, deploy the application.
Before you deploy, you must set the OPENAI_API_KEY
secret for your application. Do this by running the wrangler secret put
command:
To deploy your Worker application to the Cloudflare global network:
- Make sure you are in your Worker project’s directory, then run the
wrangler deploy
command:
-
Wrangler will package and upload your code.
-
After your application is deployed, Wrangler will provide you with your Worker’s URL.
9. View the fine-tune job status and use the model
To use your application, create a new fine-tune job by making a request to the /files
with a file
query param matching the filename you uploaded earlier:
When the file is uploaded, issue another request to /models
, passing the file_id
query parameter. This should match the id
returned as JSON from the /files
route:
Finally, visit /jobs
to see the status of your fine-tune jobs in OpenAI. Once the fine-tune job has completed, you can see the fine_tuned_model
value, indicating a fine-tuned model has been created.
Visit the OpenAI Playground ↗ in order to use your fine-tune model. Select your fine-tune model from the top-left dropdown of the interface.
Use it in any API requests you make to OpenAI’s chat completions endpoints. For instance, in the below code example:
Next steps
To build more with Workers, refer to Tutorials.
If you have any questions, need assistance, or would like to share your project, join the Cloudflare Developer community on Discord ↗ to connect with other developers and the Cloudflare team.