Skip to content

Commit d74d00c

Browse files
authored
Updated test accuracy to 85.84 with batch size 20
1 parent 7b1d477 commit d74d00c

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -55,11 +55,11 @@ python train.py --dataset rocstories --desc rocstories --submit --analysis --dat
5555
#### First experiments on the ROCStories test set
5656
Finetuning the PyTorch model for 3 Epochs on ROCStories takes 10 minutes to run on a single NVidia K-80.
5757

58-
The test accuracy of this PyTorch version is 85.25%, while the authors reports a median accuracy with the TensorFlow code of 85.8% and the paper reports a best single run accuracy of 86.5%.
58+
The single run test accuracy of this PyTorch version is 85.84%, while the authors reports a median accuracy with the TensorFlow code of 85.8% and the paper reports a best single run accuracy of 86.5%.
5959

60-
The reason for this 0.7-1.3% discrepency is likely that the authors implementations uses 8 GPU and can thus accomodate a batch of 64 samples while the present implementation is single GPU and is in consequence limited to 16 instances on a K80 for memory reasons. In our test, increasing the batch size from 8 to 16 samples increased the test accuracy by 2 points.
60+
The authors implementations uses 8 GPU and can thus accomodate a batch of 64 samples while the present implementation is single GPU and is in consequence limited to 20 instances on a K80 for memory reasons. In our test, increasing the batch size from 8 to 20 samples increased the test accuracy by 2.5 points. A better accuracy may be obtained by using a multi-GPU setting (on the TO-DO list).
6161

62-
The previous SOTA on the ROCStories dataset is 77.6% ("Hidden Coherence Model" of Chaturvedi et al. published in "Story Comprehension for Predicting What Happens Next" EMNLP 2017. Which is a very nice paper by the way, you should check it out)
62+
The previous SOTA on the ROCStories dataset is 77.6% ("Hidden Coherence Model" of Chaturvedi et al. published in "Story Comprehension for Predicting What Happens Next" EMNLP 2017, which is a very nice paper too!)
6363

6464
### TO-DO list
6565
- [ ] Add Multi-GPU training logic

0 commit comments

Comments
 (0)