Skip to content

Actions: allenai/reward-bench

Actions

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
574 workflow runs
574 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Best of N pipeline + tests
Tests #49: Pull request #30 synchronize by natolambert
February 15, 2024 22:00 3m 49s b_o_n
February 15, 2024 22:00 3m 49s
Best of N pipeline + tests
Tests #48: Pull request #30 opened by natolambert
February 15, 2024 21:41 4m 0s b_o_n
February 15, 2024 21:41 4m 0s
Per token multiple rms
Tests #47: Pull request #29 opened by khyathiraghavi
February 15, 2024 21:31 3m 43s per-token-multiple-rms
February 15, 2024 21:31 3m 43s
Per token multiple rms
Tests #46: Pull request #28 opened by khyathiraghavi
February 15, 2024 21:25 3m 45s per-token-multiple-rms
February 15, 2024 21:25 3m 45s
visualizing multiple rewards
Tests #45: Pull request #27 opened by khyathiraghavi
February 15, 2024 21:19 4m 23s per-token-multiple-rms
February 15, 2024 21:19 4m 23s
Add model type to results
Tests #44: Pull request #26 synchronize by natolambert
February 15, 2024 17:45 4m 6s model_type
February 15, 2024 17:45 4m 6s
Add model type to results
Tests #43: Pull request #26 opened by natolambert
February 15, 2024 17:44 4m 36s model_type
February 15, 2024 17:44 4m 36s
Update per token reward
Tests #42: Pull request #25 opened by ljvmiranda921
February 14, 2024 20:32 4m 5s update/per-token-reward
February 14, 2024 20:32 4m 5s
Clean repo (#23)
Tests #41: Commit 84c0a9b pushed by natolambert
February 14, 2024 16:49 4m 27s main
February 14, 2024 16:49 4m 27s
Clean repo
Tests #40: Pull request #23 opened by natolambert
February 13, 2024 22:22 4m 57s clean
February 13, 2024 22:22 4m 57s
Merge pull request #21 from allenai/save_scores
Tests #39: Commit f441f9c pushed by natolambert
February 13, 2024 01:01 3m 51s main
February 13, 2024 01:01 3m 51s
Save scores per prompt
Tests #38: Pull request #21 synchronize by natolambert
February 12, 2024 23:46 4m 59s save_scores
February 12, 2024 23:46 4m 59s
Save scores per prompt
Tests #37: Pull request #21 synchronize by natolambert
February 12, 2024 23:27 4m 4s save_scores
February 12, 2024 23:27 4m 4s
Save scores per prompt
Tests #36: Pull request #21 opened by natolambert
February 12, 2024 23:08 3m 54s save_scores
February 12, 2024 23:08 3m 54s
Merge pull request #20 from allenai/data_formatting
Tests #35: Commit a4eec4a pushed by natolambert
February 12, 2024 22:18 4m 2s main
February 12, 2024 22:18 4m 2s
Change data storage location
Tests #34: Pull request #20 opened by natolambert
February 12, 2024 21:58 4m 22s data_formatting
February 12, 2024 21:58 4m 22s
Merge pull request #18 from allenai/docker-eval
Tests #33: Commit 2f7a287 pushed by jacob-morrison
February 12, 2024 19:57 4m 37s main
February 12, 2024 19:57 4m 37s
Add docker image and script for submitting eval jobs
Tests #32: Pull request #18 synchronize by jacob-morrison
February 12, 2024 19:18 3m 47s docker-eval
February 12, 2024 19:18 3m 47s
Add docker image and script for submitting eval jobs
Tests #31: Pull request #18 synchronize by jacob-morrison
February 12, 2024 19:15 4m 5s docker-eval
February 12, 2024 19:15 4m 5s
Add docker image and script for submitting eval jobs
Tests #30: Pull request #18 synchronize by jacob-morrison
February 12, 2024 18:58 4m 11s docker-eval
February 12, 2024 18:58 4m 11s
Add function to get subtoken statistics (#17)
Tests #29: Commit 85b6a4a pushed by ljvmiranda921
February 9, 2024 22:15 4m 8s main
February 9, 2024 22:15 4m 8s
Merge pull request #13 from allenai/beaver_fix
Tests #28: Commit 0299429 pushed by natolambert
February 9, 2024 19:21 3m 59s main
February 9, 2024 19:21 3m 59s
Beaver fix; working towards another model
Tests #27: Pull request #13 synchronize by natolambert
February 9, 2024 19:15 3m 43s beaver_fix
February 9, 2024 19:15 3m 43s
Add function to get subtoken statistics
Tests #26: Pull request #17 opened by ljvmiranda921
February 9, 2024 18:38 3m 52s add/subtoken-counter
February 9, 2024 18:38 3m 52s
Fix code formatting (#15)
Tests #25: Commit a4a5f38 pushed by ljvmiranda921
February 9, 2024 04:47 4m 41s main
February 9, 2024 04:47 4m 41s
ProTip! You can narrow down the results and go further in time using created:<2024-02-09 or the other filters available.