Skip to content

Latest commit

 

History

History
35 lines (21 loc) · 1.32 KB

databricks-as-data-platform.md

File metadata and controls

35 lines (21 loc) · 1.32 KB

Databricks as Data Platform

Category: Data Platform Platform: Databricks Tags: platform, architecture decision, databricks

Context

For data mesh, a self-serve data platform is required. We don't want to build this completely by outselves, but want to tailor an existing data platform to our data mesh needs.

We use Microsoft Azure for our operational systems. We are a regulated company in the finance sector. We rely heavily on Tableau for our reports.

Decision

We use Azure Databricks as our central data platform, similiar to the tech stack described on datamesh-architecture.com.

Consequences

  • Business partner is Microsoft, no separate contract with Databricks, Inc required.
  • Expected costs: XXX USD/month
  • A data platform teams needs to manage the Databricks account(s).
  • Software developers will mostly write transformations as notebooks using PySpark, SQL or Scala

Considered Alternatives

  • Azure Synapse Analytics
  • Snowflake deployed on Azure

Automation