Skip to content

Adding a range of multilingual evals #832

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 15 commits into
base: main
Choose a base branch
from
Draft

Adding a range of multilingual evals #832

wants to merge 15 commits into from

Conversation

clefourrier
Copy link
Member

No description provided.

@clefourrier clefourrier marked this pull request as draft June 25, 2025 16:02

BELEBELE_TASKS = [
LightevalTaskConfig(
name=f"belebele_instruct_{lang}_Latn",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe we should call this belebele_instruct_5_{lang}_Latn or belebele_instruct_smollm_{lang}_Latn to distinguish from the general case with more languages?

The alternative would be to have a separate belebele_instruct_en_{lang}_{script} for the full set of languages, but with English instructions

Copy link
Member Author

@clefourrier clefourrier Jun 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will add the latter, and have a
`belebele_native_inst_{lang}" vs "belebele_en_inst_{lang}"
:)

clefourrier and others added 5 commits June 26, 2025 09:23
* too many false positives with the current gpqa metric extraction, making it more string

* fixing whitespace and instruction in prompt

* better to have a strict extraction for index extraction in general actually

* added comment

* fix tests, need to invert condition
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants