This repo for analysis on the corona virus. This includes:
- Script for extracting google sheet data and preprocessing it into a time series dataset
- Do data exploration using tableau
- Do data exploration using pandas
- Implement time series analysis and apply machine learning to predict the potential confirmed cases in the future. (waiting on more data for the days to come)
- The data come from the Novel Coronavirus (2019-nCoV) Cases, which is a live dataset provided by JHU CSSE.
- Data available here.
- In
data_prep.py
the data extracts the latest entry for each date, and aggregates all the records into a time series dataset. - The script is reproducible, but you will need to enable Google API Access to run it yourself. Follow this tutorial.