Tunnel availability and failover

Our lightweight and open-source connector, cloudflared ↗, was built to be highly available without any additional configuration requirements. When you run a tunnel, cloudflared establishes four outbound-only connections between the origin server and the Cloudflare network. These four connections are made to four different servers spread across at least two distinct data centers. This model ensures high availability and mitigates the risk of individual connection failures. This means in event a single connection, server, or data center goes offline, your resources will remain available.

`cloudflared` replicas

Cloudflare Tunnel allows users to deploy additional instances of our connector, cloudflared, for availability and failover scenarios. We refer to these unique instances as replicas. Each replica establishes four new connections which serve as additional points of ingress to your origin, should you need them. Each of the replicas will point to the same tunnel. This ensures that your network remains up in the event a single host running cloudflared goes down.

graph LR
	C((Cloudflare))
	subgraph E[Private network]
			cf1["cloudflared <br> (Tunnel-1 replica)"]
			cf2["cloudflared <br> (Tunnel-1 replica)"]
			S1[Application]
			cf1-->S1
			cf2-->S1
	end
	C -- "Connections x 4 <br>"--> cf1
	C --> cf1
	C --> cf1
	C --> cf1
	C -- Connections x 4--> cf2
	C --> cf2
	C --> cf2
	C --> cf2

By design, replicas do not offer any level of traffic steering (random, hash, or round-robin). Instead, when a request arrives to Cloudflare, it will be forwarded to the replica that is geographically closest. If that distance calculation is unsuccessful or the connection fails, we will retry others, but there is no guarantee about which connection is chosen.

When to use `cloudflared` replicas

To provide additional points of availability for a single tunnel.
To allocate failover nodes within your network.
To update the configuration of a tunnel without downtime.

For setup instructions, refer to Deploy cloudflared replicas.

Cloudflare Load Balancers

Cloudflare Load Balancing proactively steers traffic away from unhealthy origins and intelligently distributes the traffic load based on your choice of steering algorithms. Unlike cloudflared replicas which all use the same tunnel, a typical load balancer setup requires creating multiple tunnels. Most customers will create one tunnel per data center and one load balancer pool per tunnel.

graph LR
		accTitle: Load balancing traffic to applications behind Cloudflare Tunnel

    A[Internet] --> C{Cloudflare <br> Load Balancer}
    B[WARP clients] --> C
		M[Magic WAN] --> C
    C -- Tunnel 1 --> cf1
    C -- Tunnel 2 --> cf2
		subgraph F[Data center 2]
				cf2[cloudflared <br> server]
       	S3[App server]
        S4[App server]
				cf2-->S3
				cf2-->S4
    end
		subgraph E[Data center 1]
				cf1[cloudflared <br> server]
        S1[App server]
        S2[App server]
				cf1-->S1
				cf1-->S2
    end

When to use load balancers

To intelligently steer traffic based on latency, geolocation, or other signals.
To implement failover logic if a tunnel reaches an inactive state.
To get a health alert when a tunnel reaches an inactive state.
To distribute traffic more evenly across your Cloudflare Tunnel-accessible origins or endpoints.

For setup instructions, refer to Public load balancers or Private Network Load Balancing depending on your use case.

Types of load balancers

There are two types of load balancers that you can use with Cloudflare Tunnel endpoints:

Public load balancers steer traffic from the Internet to applications published on a Cloudflare domain. Use this method if your service is served by Cloudflare Tunnel via a published application route.
Private load balancers steer traffic from WARP clients, Magic WAN, and other on-ramps to an internal IP on your private network. Use this method if your service is connected to Cloudflare Tunnel via a CIDR route.

Was this helpful?

Community
X
Discord
YouTube
GitHub