[acc fix]Change the environment set of DeepSeek-R1 FP4 scripts. #813

ZhiweiYan-96 · 2025-11-19T06:15:21Z

This PR fix the Deepseek-r1 MXFP4 accuracy issue. It is based on the fix in PR #809, and uses triton mha for MLA.

Purpose

Test Plan

bash evaluation/deepseek_fp4/launch_deepseekr1_fp4_TP.sh

Test Result

aiter commit： d0a40f55ca1d552f20f2dd55741e7309c936a9d1 (Branch dev/perf)
Eager mode

FULL_AND_PIECEWISE mode

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

zejunchen-zejun · 2025-11-19T06:17:53Z

evaluation/deepseek_fp4/launch_deepseekr1_fp4_TP.sh

 vllm serve $model_path \
  --host localhost \
-  --port 9000 \
+  --port 6789 \


can we remain the port number

zejunchen-zejun · 2025-11-19T06:19:07Z

We need rebase the latest origin/dev/perf

zejunchen-zejun · 2025-11-19T06:19:37Z

evaluation/deepseek_fp4/launch_deepseekr1_fp4_TP.sh

-  --seed 123 2>&1 | tee log.server.log &
+  --gpu_memory_utilization 0.7 \
+  --block-size 1 \
+  --seed 123 2>&1 | tee log.server.log


can we add back the &

Signed-off-by: zhuyuhua-v <[email protected]>

zejunchen-zejun · 2025-11-20T01:02:00Z

Is this PR ok to merge? We plan to merge it today.

Signed-off-by: ZhiweiYan-96 <[email protected]>

ZhiweiYan-96 requested review from kliuae-amd, tjtanaavllm, wuhuikx and zejunchen-zejun as code owners November 19, 2025 06:15

zejunchen-zejun reviewed Nov 19, 2025

View reviewed changes

ZhiweiYan-96 changed the title ~~Change the environment set of DeepSeek-R1 FP4 scripts.~~ [acc fix]Change the environment set of DeepSeek-R1 FP4 scripts. Nov 19, 2025

ZhiweiYan-96 changed the title ~~[acc fix]Change the environment set of DeepSeek-R1 FP4 scripts.~~ [wip][acc fix]Change the environment set of DeepSeek-R1 FP4 scripts. Nov 19, 2025

zhuyuhua-v and others added 3 commits November 19, 2025 07:54

use aiter triton kernel as triton mha fallback path

1d7fa5d

Signed-off-by: zhuyuhua-v <[email protected]>

Update environment for fixing deepseek fp4 acc

e54682c

[ds fp4] set block-size to 16

670b9a4

ZhiweiYan-96 force-pushed the zhiwei/fp4_acc branch from df11d7a to 670b9a4 Compare November 19, 2025 08:01

lint

82080ea

Signed-off-by: ZhiweiYan-96 <[email protected]>

ZhiweiYan-96 changed the title ~~[wip][acc fix]Change the environment set of DeepSeek-R1 FP4 scripts.~~ [acc fix]Change the environment set of DeepSeek-R1 FP4 scripts. Nov 20, 2025

zejunchen-zejun merged commit d73b302 into dev/perf Nov 20, 2025
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[acc fix]Change the environment set of DeepSeek-R1 FP4 scripts. #813

[acc fix]Change the environment set of DeepSeek-R1 FP4 scripts. #813

Uh oh!

ZhiweiYan-96 commented Nov 19, 2025 •

edited by github-actions bot

Loading

Uh oh!

zejunchen-zejun Nov 19, 2025

Uh oh!

ZhiweiYan-96 Nov 19, 2025

Uh oh!

zejunchen-zejun commented Nov 19, 2025

Uh oh!

zejunchen-zejun Nov 19, 2025

Uh oh!

ZhiweiYan-96 Nov 19, 2025

Uh oh!

zejunchen-zejun commented Nov 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[acc fix]Change the environment set of DeepSeek-R1 FP4 scripts. #813

[acc fix]Change the environment set of DeepSeek-R1 FP4 scripts. #813

Uh oh!

Conversation

ZhiweiYan-96 commented Nov 19, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

zejunchen-zejun Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

ZhiweiYan-96 Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

zejunchen-zejun commented Nov 19, 2025

Uh oh!

zejunchen-zejun Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

ZhiweiYan-96 Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

zejunchen-zejun commented Nov 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ZhiweiYan-96 commented Nov 19, 2025 •

edited by github-actions bot

Loading