small tweaks

thomwolf · web-flow · commit 7b1d4775d914 · 2018-06-19T22:02:52.000+02:00
diff --git a/README.md b/README.md
@@ -55,11 +55,9 @@ python train.py --dataset rocstories --desc rocstories --submit --analysis --dat
 #### First experiments on the ROCStories test set
 Finetuning the PyTorch model for 3 Epochs on ROCStories takes 10 minutes to run on a single NVidia K-80.
 
-The test accuracy of this PyTorch version is 85.25%, while the authors reports a median accuracy with the TensorFlow code of 85.8%.
+The test accuracy of this PyTorch version is 85.25%, while the authors reports a median accuracy with the TensorFlow code of 85.8% and the paper reports a best single run accuracy of 86.5%.
 
-The reason for this small discrepency is likely that the authors implementations uses 8 GPU and can thus accomodate a batch of 64 samples while the present implementation is single GPU and is thus limited to 16 instance on a K80 for memory reasons.
-
-The paper reports a best single run accuracy of 86.5%.
+The reason for this 0.7-1.3% discrepency is likely that the authors implementations uses 8 GPU and can thus accomodate a batch of 64 samples while the present implementation is single GPU and is in consequence limited to 16 instances on a K80 for memory reasons. In our test, increasing the batch size from 8 to 16 samples increased the test accuracy by 2 points.
 
 The previous SOTA on the ROCStories dataset is 77.6% ("Hidden Coherence Model" of Chaturvedi et al. published in "Story Comprehension for Predicting What Happens Next" EMNLP 2017. Which is a very nice paper by the way, you should check it out)