Skip to content

Add Lakebase ETL and Reverse ETL recipes (CLI-focused)#6

Closed
sav-maya wants to merge 1 commit intodatabricks:mainfrom
sav-maya:savannah/streamline-lakebase-recipes
Closed

Add Lakebase ETL and Reverse ETL recipes (CLI-focused)#6
sav-maya wants to merge 1 commit intodatabricks:mainfrom
sav-maya:savannah/streamline-lakebase-recipes

Conversation

@sav-maya
Copy link
Collaborator

Summary

  • ETL recipe (etl-lakehouse-sync-autoscaling.md): Replicate Lakebase Postgres tables into Unity Catalog as SCD Type 2 Delta history tables using Lakehouse Sync (CDC-based, native Lakebase feature)
  • Reverse ETL recipe (reverse-etl-synced-tables-autoscaling.md): Sync Unity Catalog tables into Lakebase Postgres for sub-10ms application reads using databricks database create-synced-database-table

Both recipes are CLI-first with explicit "not yet available via CLI" callouts (with condensed UI instructions) where no CLI alternative exists yet.

What was validated

  • All CLI commands verified against databricks database --help and databricks postgres --help (CLI v0.295.0)
  • All 6 external doc links confirmed live
  • Tested end-to-end against a live Lakebase-enabled workspace:
    • ETL: replica identity check, unsupported data type check, wal2delta monitoring query
    • Reverse ETL: create-synced-database-tableget-synced-database-table → queried synced data in Postgres via psql (all 5 rows returned)
    • Confirmed the /postgres/ API caveat (project created via postgres API correctly returns "Database instance is not found" for synced table creation)

Test plan

  • CLI commands validated against --help
  • External doc links verified
  • ETL recipe SQL queries tested against live Lakebase Postgres
  • Reverse ETL synced table created, synced, and queried end-to-end
  • Cleanup completed (synced table, database instance, UC table deleted)

Two new recipes for Lakebase Autoscaling data movement:

- ETL (Lakehouse Sync): Replicate Lakebase Postgres tables into Unity
  Catalog as SCD Type 2 Delta history tables via CDC
- Reverse ETL (Synced Tables): Sync Unity Catalog tables into Lakebase
  Postgres for sub-10ms app reads via CLI

Both recipes are CLI-first with explicit callouts where steps still
require the Databricks UI. Tested end-to-end against a live workspace.
@sav-maya sav-maya closed this Mar 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant