Skip to content

Commit 9c80efe

Browse files
committed
repo update
1 parent 5ad5182 commit 9c80efe

File tree

2 files changed

+11
-4
lines changed

2 files changed

+11
-4
lines changed

README.md

Lines changed: 11 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,18 @@
1-
# interpolationStudy_toHandleMissingData_BTCdataset
2-
Imputing the Time-Series Using Python
1+
# Interpolation Study to Handle Missing Data on BTC dataset :chart_with_upwards_trend:
32

43
Time series are an important form of indexed data found specially in stocks data. Due to its time dependency, time series are subject to have missing points due to problems in reading or recording the data.
4+
55
To apply machine learning models effectively, the time series has to be continuous, as most of the ML models are not designed to deal with missing values. Hence, the rows with missing data should be either dropped or filled with appropriate values.
66

77
To prove this assumption, let’s take an example and solve it in python.
8+
89
I generated an OHLCV candle dataset from a bitcoin exchange.
9-
I took a small range without NaN and randomly added a percentage of it. I then tried various interpolation techniques in order to see: as the percentage of NaN varies, which interpolation method is the best in the case of stocks data.
1010

11-
Thanks to Dr Mohammad El-Nesr and his work here https://drnesr.medium.com/filling-gaps-of-a-time-series-using-python-d4bfddd8c460
11+
I took a small range without NaN and randomly added a percentage of it.
12+
13+
**I then tried various interpolation techniques in order to see: as the percentage of NaN varies, which interpolation method is the best in the case of stocks data**.
14+
15+
<img src="imgs/chart.png" width="100%" />
16+
17+
#### **References**
18+
Thanks to *Dr Mohammad El-Nesr* and his work [here](https://drnesr.medium.com/filling-gaps-of-a-time-series-using-python-d4bfddd8c460).

imgs/chart.png

71.8 KB
Loading

0 commit comments

Comments
 (0)