How Hyperdrive works
Connecting to traditional centralized databases from Cloudflare's global network which consists of over 300 data center locations ↗ presents a few challenges as queries can originate from any of these locations.
If your database is centrally located, queries can take a long time to get to the database and back. Queries can take even longer in situations where you have to establish a connection and make multiple round trips.
Traditional databases usually handle a maximum number of connections. With any reasonably large amount of distributed traffic, it becomes easy to exhaust these connections.
Hyperdrive solves these challenges by managing the number of global connections to your origin database, selectively parsing and choosing which query response to cache while reducing loading on your database and accelerating your database queries.
Hyperdrive creates a global pool of connections to your database that can be reused as your application executes queries against your database.
When a query hits Hyperdrive, the request is routed to the nearest connection pool.
If the connection pool has pre-existing connections, the connection pool will try and reuse that connection.
If the connection pool does not have pre-existing connections, it will establish a new connection to your database and use that to route your query. This aims at reusing and creating the least number of connections possible as required to operate your application.
The Hyperdrive connection pooler operates in transaction mode, where the client that executes the query communicates through a single connection for the duration of a transaction. When that transaction has completed, the connection is returned to the pool.
Hyperdrive supports SET
statements ↗ for the duration of a transaction or a query. For instance, if you manually create a transaction with BEGIN
/COMMIT
, SET
statements within the transaction will take effect. Moreover, a query that includes a SET
command (SET X; SELECT foo FROM bar;
) will also apply the SET
command. When a connection is returned to the pool, the connection is RESET
such that the SET
commands will not take effect on subsequent queries.
This implies that a single Worker invocation may obtain multiple connections to perform its database operations and may need to SET
any configurations for every query or transaction. It is not recommended to wrap multiple database operations with a single transaction to maintain the SET
state. Doing so will affect the performance and scaling of Hyperdrive as the connection cannot be reused by other Worker isolates for the duration of the transaction.
Hyperdrive supports named prepared statements as implemented in the postgres.js
and node-postgres
drivers. Named prepared statements in other drivers may have worse performance.
Hyperdrive does not support the following PostgreSQL features:
- SQL-level management of prepared statements, such as using
PREPARE
,DISCARD
,DEALLOCATE
, orEXECUTE
. - Advisory locks (PostgreSQL documentation ↗).
LISTEN
andNOTIFY
.PREPARE
andDEALLOCATE
.- Any modification to per-session state not explicitly documented as supported elsewhere.
In cases where you need to issue these unsupported statements from your application, the Hyperdrive team recommends setting up a second, direct client without Hyperdrive.
Hyperdrive supports caching of non-mutating (read) queries to your database.
When queries are sent via Hyperdrive, Hyperdrive parses the query and determines whether the query is a mutating (write) or non-mutating (read) query.
For non-mutating queries, Hyperdrive will cache the response for the configured max_age
, and whenever subsequent queries are made that match the original, Hyperdrive will return the cached response, bypassing the need to issue the query back to the origin database.
Caching reduces the burden on your origin database and accelerates the response times for your queries.