Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support long context dataset accuracy measurement. #230

Merged
merged 1 commit into from
Mar 25, 2025
Merged

Conversation

Lumosis
Copy link
Collaborator

@Lumosis Lumosis commented Mar 21, 2025

The result should be like:


Results

{'rougeL': 6.165264881957181, 'exact_match': 0.0, 'gen_len': 59242, 'gen_num': 50}

@vipannalla
Copy link
Collaborator

The results you pasted -- are they from an actual benchmark run on 405b? Can you paste the full results (as a screenshot or paste link)?

@Lumosis
Copy link
Collaborator Author

Lumosis commented Mar 21, 2025

The results you pasted -- are they from an actual benchmark run on 405b? Can you paste the full results (as a screenshot or paste link)?

No, this is from a mock run. I am working on the actual benchmarking.

@vipannalla
Copy link
Collaborator

sounds good

@Lumosis Lumosis force-pushed the lihao/accuracy branch 5 times, most recently from 91fbb46 to 877f8b2 Compare March 21, 2025 20:44
return {"exact_match": round(score, 2)}


def qa_em(label, pred):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please add some docustring about each functions.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

@Lumosis Lumosis force-pushed the lihao/accuracy branch 2 times, most recently from 4a56d13 to d0ef346 Compare March 24, 2025 23:43
Copy link
Collaborator

@vipannalla vipannalla left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good

@Lumosis Lumosis merged commit 351462e into main Mar 25, 2025
2 of 3 checks passed
@Lumosis Lumosis deleted the lihao/accuracy branch March 25, 2025 21:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants