website logo
⌘K
Getting Started 🚀
What is DataLakeHouse.io?
Our Business-Value Focus
Learn the Basic Concepts
Connectors
Operations Applications
Asana
Aloha POS
Baremetrics
Beans Route
BILL
Bloom Growth
Bullhorn
Calendly
Ceridian Dayforce
ClinicalTrials.gov
ConnectWise
DBT Cloud
DBT Cloud Log Stream
Facebook Ads
Food Delivery Service Connector
Google Analytics 4
Google Play
Harvest
Hubspot
Jira
MailChimp
McLeod Transportation
Microsoft Teams
NetSuite (Oracle NetSuite)
NetSuite SuiteAnalytics
Optimum HRIS
QuickBooks Online
Salesforce
Salesloft
Shift4 Payments
Shopify
Square
Square Marketplace
Stripe
Toast
TriNet
Verizon Wireless Business
Workday HCM
Xero
Zendesk Sell
Zoom
Databases
Files & Object Storage
SQL Data Query
SSH Tunnel Setup for Hosted Database Systems
SQL Playground Editor
SQL Transformations
DBT Cloud Transformations
Terraform: Reverse Terraforming
Sync Bridge (Data Pipelines)
Create a Sync Bridge
Manually Run a Sync Bridge
Deleting a Sync Bridge
Historical Re-sync
Analytics
Access Analytics
Snowflake Usage Analytics
Data Catalog
Create the Catalog
Populate the Catalog
Access the Catalog
Data Warehouse Clouds
❄️Snowflake
Open Source DW Models
Alerts & Notifications
Slack Notifications
Logs & Monitoring
Security
Callback Links
Service Level Agreement (SLA)
Release Notes
July 2023
June 2023
May 2023
April 2023
Q3 2022
Q4 2022
Community Overview
Contributor Agreements
Code Contribution Guide
About
Customer Support
License
Viewpoint
Credit Consumption Breakdown
Docs powered by
Archbee
website logo

Data Warehouse Clouds

3min

Cloud Data Warehousing is a means for an organization to store and retrieve curated business decision making information, quickly, accurately, and with high levels of confidence, usually for reporting or visualization. It’s pre-defined structure allows for cross-departmental analysis of business data, that is usually pre-aggregated with already applied business logic and rules, with security provisioned profiles to enable a more data governed approach for scaling and enabling self-service analytics across the organization. Lastly, it makes any analysis a team already does in an Excel Spreadsheet or elsewhere exponentially faster and more consistent. A Data Warehouse is commonly referred to as the “single source of truth” for the organization. We recommend that it is always part of an organizations data-value-chain regardless of an organization’s understanding of a ‘Data Lake’. An “Enterprise Data Warehouse” (EDW) is a common term, and highest level of achievement, for most Data Warehouse implementations as it implies a holistic representation of the organization’s information.

  • Snowflake Data Cloud : Snowflake
  • BigQuery (Google)
  • FireBolt
  • Oracle Cloud Data Warehouse
  • Teradata Vantage
  • ClickHouse
  • Our Open Source Data Warehouse Models : Open Source DW Models

Why have a Cloud Data Warehouse?

In our opinion, and as a fundamental approach to building a DataLakeHouse, every organization should have a Data Warehouse as part of the data-value-chain, regardless of the idea or understanding of a ‘Data Lake’.

A (Enterprise) Data Warehouse enables:

  • A single-source-of-the-truth for operational and managerial information and data, with curated perspectives on the data that represent the organization, including at a departmental owner perspective
  • Well-defined Key Performance Indicators (KPIs) that drive, and provide the pulse of the business, to track how the organization/business is doing and enables alerting, thresholds, and other measurement abilities
  • Corporate and business hierarchy representation of information – this can can be a direct abstract from operational systems (i.e.: ERP, CRM, POS) to codify and provide taxonomy that represents the organization not the default vendor representation of the transactional data. A common example for this is org structure, alternative chart of accounts or P&L structure, alternative business calendars, etc.
  • Historical data storage and trend analytics (thus how the organization is performing over time?)
  • Master Data Management (MDM) integration and validation, even potentially a “poor-man’s” MDM solution (depending on the organization’s business case requirements)
  • Consolidation of multiple data source systems into a single data repository for analytics (for example, multiple GL systems, and/or HR systems) to gain a single source holistic view of the organization (perfect for acquisitions or mergers where decisions need to be made regarding the new formed organization as a whole)



Updated 28 Jul 2023
Did this page help you?
PREVIOUS
Access the Catalog
NEXT
Snowflake
Docs powered by
Archbee
TABLE OF CONTENTS
Why have a Cloud Data Warehouse?
Docs powered by
Archbee