Skip to content

Commit

Permalink
tester update
Browse files Browse the repository at this point in the history
  • Loading branch information
MikeGu721 committed Jun 28, 2023
1 parent f1d0e24 commit 9f0f65d
Show file tree
Hide file tree
Showing 7 changed files with 789 additions and 406 deletions.
7 changes: 6 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
@@ -1,3 +1,8 @@
./venv
./resources/data/xiezhi_all
.idea
.idea
./Tester/tasks/ceval/
./Tester/tasks/ceval_train/
./Tester/tasks/m3ke/
./Tester/tasks/mmlu/
./Tester/tasks/mmlu_train/
4 changes: 4 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,7 @@ We hope Xiezhi could help developers track the progress and analyze the importan
- [Leaderboard](#leaderboard)
- [Experiment Setting](#experiment-setting)
- [Data](#data)
- [How To Run Your Own Test](#how-to-run-your-own-test)
- [TODO](#todo)
- [Licenses](#licenses)
- [Citation](#citation)
Expand Down Expand Up @@ -61,6 +62,9 @@ Example of question in Xiezhi Interdiscipline:
Example of our few-shot learning setting:
<p align="center"> <img src="resources/question-3shot.png" style="width: 100%;" id="question-3shot"></p>

## How To Run Your Own Test
-

## TODO
- [ ] add results of traditional 4 options experiments setting
- [ ] add results of more API-based models
Expand Down
773 changes: 773 additions & 0 deletions Tester/model_test.py

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

File renamed without changes.
1 change: 1 addition & 0 deletions Tester/test.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
python -u model_test.py

0 comments on commit 9f0f65d

Please sign in to comment.