Skip to content

alchayward/qsic_task

Repository files navigation

Qsic Coding Test

running

To run the pipeline and generate the sales profiles for sales_data.tsv:

Create a python environment and install the requirements.

run:

python sales_data_processor.py

The resulting sales profile is in sales_profile.json

coverage report

Coverage report is found here.

To generate (and run tests):

python -m pytest --cov-report=html:coverage_re --cov sales_data_processor

The coverage test covers the whole pipeline, but not the entire python file, as we don't test the script execution section explicitly.

notes

I made the choice to 1) drop rows with missing values (but log a warning), and to filter out rows with 0 units.

There is only some data validation here. More is always possible, but how much is appropriate depends on the providence of the data etc.

The question of what to do in various edge cases (e.g. no data for a specified store), depend really on the business case at hand, and cant be decided without wider consultation.

About

Coding test for Data engineer role at Qsic

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published