Skip to content

A data pipeline project to practice hadoop, spark, kafka, dbt and airflow

Notifications You must be signed in to change notification settings

Codilis/financial_data_pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

  • Dataset is downloaded from https://relational-data.org/dataset/Financial website in csv format.
  • Dataset is about financial transactions.
  • Will create dim and fact tables using dbt.
  • Data from trans.csv file will be used in kafka as streaming data.
  • Will use Apache airflow to create dags.
  • Data will be stored in local sqlite db.

About

A data pipeline project to practice hadoop, spark, kafka, dbt and airflow

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published