File Ingestion and Schema Validation

Take any csv/text file of 2+ GB of your choice.
Read the file ( Present approach of reading the file )
Try different methods of file reading eg: Dask, Modin, Ray, pandas and present your findings in term of computational efficiency
Perform basic validation on data columns : eg: remove special character , white spaces from the col name
As you already know the schema hence create a YAML file and write the column name in YAML file. --define separator of read and write file, column name in YAML
Validate number of columns and column name of ingested file with YAML.
Write the file in pipe separated text file (|) in gz format.
Create a summary of the file: Total number of rows, total number of columns, file size

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
File ingestion and schema validation.ipynb		File ingestion and schema validation.ipynb
README.md		README.md
file.yaml		file.yaml
utility.py		utility.py

Provide feedback