You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+6-5Lines changed: 6 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -52,15 +52,16 @@ As with the [TensorFlow code](https://github.com/openai/finetune-transformer-lm)
52
52
python train.py --dataset rocstories --desc rocstories --submit --analysis --data_dir [path to data here]
53
53
```
54
54
55
-
#### Accuracy on the ROCStories test set
55
+
#### First experiments on the ROCStories test set
56
56
Finetuning the PyTorch model for 3 Epochs on ROCStories takes 10 minutes to run on a single NVidia K-80.
57
57
58
-
The test accuracy of this PyTorch version (with the default TensorFlow hyper-parameters not finetuned for the differences between PyTorch and TensorFlow internal operations) is 83.43%.
58
+
The test accuracy of this PyTorch version is 83.43% (with the default TensorFlow hyper-parameters not finetuned on the PyTorch model to take into account the differences between PyTorch and TensorFlow internals).
59
+
59
60
The authors reports a median accuracy with the TensorFlow code of 85.8%.
60
-
The paper reports a best accuracy of 86.5%.
61
-
The previous SOTA on the ROCStories dataset is 77.6 (Hidden Coherence Model of Chaturvedi et al. in "Story Comprehension for Predicting What Happens Next" EMNLP 2017. Which is a very nice paper by the way, you should check it out)
62
61
63
-
As noted by the author, the code can be non-deterministic due to various GPU ops.
62
+
The paper reports a best single run accuracy of 86.5%.
63
+
64
+
The previous SOTA on the ROCStories dataset is 77.6% ("Hidden Coherence Model" of Chaturvedi et al. published in "Story Comprehension for Predicting What Happens Next" EMNLP 2017. Which is a very nice paper by the way, you should check it out)
0 commit comments