Understanding the Logpush API
Endpoints
The table below summarizes the job operations available.
The <zone_id>
argument is the zone id (hexadecimal string), which can be found using API's zones endpoint.
The <job>
argument is the numeric job id. The <dataset>
argument indicates the log category (such as http_requests
, spectrum_events
, or firewall_events
).
For concrete examples, see the tutorial Manage Logpush with cURL.
Connecting
The Logpush API requires credentials like any other Cloudflare API.
$ curl -s -H "X-Auth-Email: <REDACTED>" -H "X-Auth-Key: <REDACTED>" \ 'https://api.cloudflare.com/client/v4/zones/<ZONE_ID>/logpush/jobs'
Ownership
Before creating a new job, ownership of the destination must be proven.
To issue an ownership challenge token to your destination:
$ curl -s -XPOST https://api.cloudflare.com/client/v4/zones/<ZONE_ID>/logpush/ownership -d '{"destination_conf":"s3://<BUCKET_PATH>?region=us-west-2"}' | jq .
A challenge file will be written to the destination, and the filename will be in the response (the filename may be expressed as a path if appropriate for your destination):
{ "errors": [], "messages": [], "result": { "valid": true, "message": "", "filename": "<path-to-challenge-file>.txt" }, "success": true}
You will need to provide the token contained in the file when creating a job.
Destination
You can specify your cloud service provider destination via the required destination_conf
parameter.
- AWS S3: bucket + optional directory + region + optional encryption parameter (if required by your policy); for example:
s3://bucket/[dir]?region=<region>[&sse=AES256]
- Datadog: Datadog endpoint URL + Datadog API key + optional parameters; for example:
datadog://<DATADOG-ENDPOINT-URL>?header_DD-API_KEY=<DATADOG-API-KEY>&service=<SERVICE>&host=<HOST>&ddsource=<SOURCE>
- Google Cloud Storage: bucket + optional directory; for example:
gs://bucket/[dir]
- Microsoft Azure: service-level SAS URL with
https
replaced byazure
+ optional directory added before query string; for example:azure://<BlobContainerPath>/[dir]?<QueryString>
- Splunk: Splunk endpoint URL + Splunk channel ID + insecure-skip-verify flag + Splunk sourcetype + Splunk authorization token; for example:
splunk://<SPLUNK-ENDPOINT-URL>?channel=<SPLUNK-CHANNEL-ID>&insecure-skip-verify=<INSECURE-SKIP-VERIFY>&sourcetype=<SOURCE-TYPE>&header_Authorization=<SPLUNK-AUTH-TOKEN>
- Sumo Logic: HTTP source address URL with
https
replaced bysumo
; for example:sumo://<SumoEndpoint>/receiver/v1/http/<UniqueHTTPCollectorCode>
For S3, Google Cloud Storage, and Azure, logs can be separated into daily subdirectories by using the special string {DATE}
in the URL path; for example: s3://mybucket/logs/{DATE}?region=us-east-1&sse=AES256
or azure://myblobcontainer/logs/{DATE}?[QueryString]
. It will be substituted with the date in YYYYMMDD
format, like 20180523
.
For more information on the value for your cloud storage provider, consult the following conventions:
- AWS S3 CLI (S3Uri path argument type)
- Google Cloud Storage CLI (Syntax for accessing resources)
- Microsoft Azure Shared Access Signature
- Sumo Logic HTTP Source
To check if a destination is already in use:
$ curl -s -XPOST https://api.cloudflare.com/client/v4/zones/<ZONE_ID>/logpush/validate/destination/exists -d '{"destination_conf":"s3://foo"}' | jq .
Response
{ "errors": [], "messages": [], "result": { "exists": false }, "success": true}
There can be only 1 job writing to each unique destination. For S3 and GCS, a destination is defined as bucket + path. This means two jobs can write to the same bucket, but must write to different subdirectories in that bucket.
Job object
Options
Logpush repeatedly pulls logs on your behalf and uploads them to your destination.
Log options, such as fields or sampling rate, are configured in the logpull_options
job parameter (see Logpush job object schema). For example, the following query gets data from the Logpull API:
curl -sv \ -H'X-Auth-Email: <REDACTED>' \ -H'X-Auth-Key: <REDACTED>' \ "https://api.cloudflare.com/client/v4/zones/<ZONE_ID>/logs/received?start=2018-08-02T10:00:00Z&end=2018-08-02T10:01:00Z&fields=RayID,EdgeStartTimestamp"
In Logpush, the Logpull options would be: "logpull_options": "fields=RayID,EdgeStartTimestamp"
. See Logpull API parameters for more info.
If you don't change any options, you will receive logs with default fields that are unsampled (i.e., sample=1
).
The three options that you can customize are:
- Fields: See Log fields for the currently available fields. The list of fields is also accessible directly from the API:
https://api.cloudflare.com/client/v4/zones/<zone_id>/logpush/datasets/<dataset>/fields
. Default fields:https://api.cloudflare.com/client/v4/zones/<zone_id>/logpush/datasets/<dataset>/fields/default
. - Sampling rate: Value can range from 0.001 to 1.0 (inclusive).
sample=0.1
means return 10% (1 in 10) of all records. - Timestamp format: The format in which timestamp fields will be returned. Value options: unixnano (default), unix, rfc3339.
To check if logpull_options
is valid:
$ curl -s -XPOST https://api.cloudflare.com/client/v4/zones/<ZONE_ID>/logpush/validate/origin -d '{"logpull_options":"fields=RayID,ClientIP,EdgeStartTimestamp×tamps=rfc3339","dataset": "http_requests"}' | jq .
Response
{ "errors": [], "messages": [], "result": { "valid": true, "message": "", }, "success": true}
Audit
The following actions are recorded in Cloudflare Audit Logs: create, update, and delete job.