Skip to content

Support for Delta Live Tables #213

@gerson23

Description

@gerson23

Describe the feature
In our project, we are starting to use Delta Live Tables and we need to publish the lineage information to Purview. However, the solution accelerator doesn't seem to support DLT pipelines yet.

Detailed Example
Ideally, we should be able to see the DLT pipeline lineage in Purview, connecting the assets DLT is reading from up to the assets created as an output of the pipeline.

flowchart LR;
  raw["raw asset"];
  bronze_table["bronze table"];
  silver_table["silver table"];
  gold_table["gold table"];
  asset["aggregated asset"];
  raw --> bronze_table;
  subgraph DLT Pipeline
  bronze_table --> silver_table;
  silver_table --> gold_table;
  end
  gold_table --> asset;
Loading

Another simpler option would be to have lineage information similar to notebooks, hiding the internals of the pipeline.

flowchart LR
  raw["raw asset"]
  dlt["DLT Pipeline"]
  asset["aggregate asset"]
  raw --> dlt
  dlt --> asset
Loading

Issues that this feature solves
N/A

Suggested Implementation
Not sure how to implement this. I know that, if going this way, the pipeline information itself is available in the event logs: https://learn.microsoft.com/en-us/azure/databricks/delta-live-tables/observability#--query-lineage-information-from-the-event-log

Additional context
N/A

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requesthelp wantedExtra attention is neededopenlineageRequires support from OpenLineage to address

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions