Connectors
Files & Object Storage

Cloudflare R2

5min

Cloudflare R2 or R2 is a service offered by Cloudflare, a well know web enablement company that has extended its offerings with object storage through a web service interface. Cloudflare R2 uses the same scalable storage infrastructure that it uses to run its infrastructure provided already to tens of thousands of customers. Cloudflare R2 can store any type of object, which allows uses like storage for Internet applications, backups, disaster recovery, data archives, data lakes for analytics, and hybrid cloud storage

Our Cloudflare R2 Storage DataLakeHouse.io integration:

  • replicates files stored in your R2 buckets to your Cloud Data Warehouse target
  • synchronizes to your target destination at a scheduled frequency

It allows you to replicate/synchronize your data, including capturing snapshots of data at any point int time, and keep it up-to-date with little to no configuration efforts. You don’t even need to prepare the target schema — DataLakeHouse.io will automatically handle all the heavy lifting for you.

All you need is to specify the connection to your R2 bucket, point to your target system, or use a DataLakeHouse.io managed Data Warehouse and DataLakeHouse.io does the rest. Our support team can even help you set it up for you during a short technical on-boarding session.

Setup Instructions

DataLakeHouse.io securely connects to your Cloudflare R2 Storage bucket. Using the form in the DataLakeHouse.io portal please complete the following basic steps.

  1. Enter a Name or Alias for this connection, in the 'Name/Alias' field, that is unique from other connectors
  2. Enter a 'Target Schema Prefix', which will be the prefix for the schema at the target you will sync your data files into
  3. Enter a 'Bucket' name, where your files are stored
    • Typically starts with https://, so enter just the name without the prefix.
  4. Select your 'Region' as Default (Global/Auto)
  5. Enter your 'Access Key', credentials to access the bucket
  6. Enter your 'Secret Key', credentials to access the bucket
  7. Enter any other optional details in the available fields (See the setup video if you need help or contact support)
    • Folder Path, is a path on the root bucket from where desired files will be retrieved
    • File Pattern, is a regular expression (RegEx) used to isolated only certain files to be retrieved
    • File Type, allows for a pre-determined type of file extension to be retreived
  8. Click the Save & Test button. Once your credentials are accepted you should be able to see a successful connection.

How to Instructions



Control Each Column Data Type

SQL Transformations allow logic to be executed against a target connection based on a scheduled frequency or triggered event of new data on tables updated via DataLakeHouse.io (DLH.io). This especially helps when you want to control the data type set in your Target Connection since all columns are set as VARCHAR(16777216).

Issue Handling

If any issues occur with the authorization simply return to the sources page in DataLakeHouse.io, edit the source details and click the 'Save & Test' button to confirm connectivity. If any issues persist please contact our support team via the DataLakeHouse Support Portal.

Updated 19 Jun 2024
Doc contributor
Doc contributor
Did this page help you?