Skip to content

Commit

Permalink
Adjust baselines for lower number of epochs improved perplexity, lowe…
Browse files Browse the repository at this point in the history
…r throughput (#1748)
  • Loading branch information
emascarenhas authored Feb 7, 2025
1 parent a0d14d2 commit f75b6bd
Showing 1 changed file with 6 additions and 6 deletions.
12 changes: 6 additions & 6 deletions tests/baselines/falcon_40b.json
Original file line number Diff line number Diff line change
Expand Up @@ -7,9 +7,9 @@
"multi_card": {
"learning_rate": 4e-4,
"train_batch_size": 1,
"perplexity": 4.0893,
"train_runtime": 360,
"train_samples_per_second": 28.162,
"perplexity": 4.0,
"train_runtime": 550,
"train_samples_per_second": 15.0,
"extra_arguments": [
"--bf16",
"--gradient_accumulation_steps 16",
Expand Down Expand Up @@ -42,9 +42,9 @@
"multi_card": {
"learning_rate": 4e-4,
"train_batch_size": 1,
"perplexity": 4.0893,
"train_runtime": 470,
"train_samples_per_second": 28.162,
"perplexity": 1.6,
"train_runtime": 710,
"train_samples_per_second": 15.0,
"extra_arguments": [
"--bf16",
"--gradient_accumulation_steps 16",
Expand Down

0 comments on commit f75b6bd

Please sign in to comment.