Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to set generation sampling parameters? #328

Open
rawsh opened this issue Feb 14, 2025 · 4 comments
Open

How to set generation sampling parameters? #328

rawsh opened this issue Feb 14, 2025 · 4 comments

Comments

@rawsh
Copy link

rawsh commented Feb 14, 2025

Need to use deepseek reference settings of temperature=0.6, top_p=0.95.

Greedy sampling does poorly on AIME:

r1-1.5B

  • AIME24: 23.33%

Tried to refer to lighteval docs and ran into issues using model config:

model: # Model specific parameters
  base_params:
    model_args: "pretrained=Qwen/Qwen2.5-7B-Instruct,dtype=bfloat16,max_model_length=768,gpu_memory_utilisation=0.7" # Model args that you would pass in the command line
  generation: # Generation specific parameters
    temperature: 1.0
    stop_tokens: null
    truncate_prompt: false

run with:

TASK=aime24 lighteval vllm \
    "config.yaml" \
    "custom|$TASK|0|0" \
    --custom-tasks tasks.py \
    --use-chat-template \
    --output-dir ./results/

hitting:

TypeError: expected str, bytes or os.PathLike object, not dict

ref

@rawsh
Copy link
Author

rawsh commented Feb 15, 2025

Ok figured this out, would a PR for a python pipeline + optional generation params be helpful?

@rawsh
Copy link
Author

rawsh commented Feb 17, 2025

r1-1.5B (temp=0.6, top_p=0.95)

AVG: 44.98%

  • AIME25: 20%
  • AIMO-AIME: 21.11%
  • AIMO-AMC: 67.47%
  • MATH_500: 82%
  • GPQA Diamond: 34.34%

@oasis-0927
Copy link

So how did you manage to set those parameters? I tried to change MODEL_ARGS in the evaluation script but it didn't work. Thx.

@oasis-0927
Copy link

So how did you manage to set those parameters? I tried to change MODEL_ARGS in the evaluation script but it didn't work. Thx.

NVM I found that using the d4e6f59bd106d4cf2fda0f8fae3dadafe557515b commit version works fine.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants