- Dataset is downloaded from https://relational-data.org/dataset/Financial website in csv format.
- Dataset is about financial transactions.
- Will create dim and fact tables using dbt.
- Data from trans.csv file will be used in kafka as streaming data.
- Will use Apache airflow to create dags.
- Data will be stored in local sqlite db.
-
Notifications
You must be signed in to change notification settings - Fork 0
Codilis/financial_data_pipeline
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A data pipeline project to practice hadoop, spark, kafka, dbt and airflow
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published