update "rollout_max_batch_size" to replace "max_concurrent" for user settings #1225

YanhuiDua · 2025-11-07T08:27:49Z

用户配置并发度的方式为以rollout_max_batch_per_instance为基准，从而配置数据流等并发度。配置文档见 xtuner/docs/zh_cn/rl/advanced_tutorial/efficiency.md

可能的几种情况如下：

不提供rollout_max_batch_size_per_instance，xtuner根据context_length获取建议推理引擎的并发度
不提供DataFlowConfig.max_concurrent，xtuner根据rollout_max_batch_size_per_instance, 推理引擎实例数，prompt_repeat_k，超参来计算max_concurrent
用户提供rollout_max_batch_size_per_instance和DataFlowConfig.max_concurrent，使用用户提供的值，

hhaAndroid · 2025-11-07T11:11:15Z

tests/ray/test_rollout.py

        ]
        self.dataloader_cfg = DataloaderConfig(
-            pack_max_length=self.max_prompt_length,
+            pack_max_length=self.max_prompt_length+self.max_response_length,


RangiLyu · 2025-11-07T23:26:21Z

rollout worker的http_concurrency需要能够大于rollout_max_batch_size，现在还是根据rollout_max_batch_size算的

RangiLyu · 2025-11-07T23:33:33Z

rollout worker的http_concurrency需要能够大于rollout_max_batch_size，现在还是根据rollout_max_batch_size算的

max_concurrent和http_concurrency可以绑定，这两个参数是客户端的请求数，而rollout_max_batch_size是推理引擎端的参数，这二者需要分开配置，否则GPU功率打不满

CyCle1024 · 2025-11-10T02:42:25Z

xtuner/v1/ray/config/worker.py

-        api_key (Optional[Union[List[str], str]]): API keys for rollout service.
-            Supports single key or list of keys. Defaults to None.
-
+        api_key (Optional[Union[List[str], str]]): API keys for rollout service. åSupports single key or list of keys. Defaults to None.


fix typo before supports

YanhuiDua · 2025-11-11T07:09:56Z

rollout worker的http_concurrency需要能够大于rollout_max_batch_size，现在还是根据rollout_max_batch_size算的

done, 通过 ttp_concurrency = config.rollout_max_batch_size_per_instance * config.allow_over_concurrency_ratio 来配置

use rollout_max_batch_size to max_concurrent in concurrency settings

f0bc0f8

YanhuiDua force-pushed the update_rollout_worker branch from 7a90af8 to f0bc0f8 Compare November 7, 2025 10:03

YanhuiDua added 5 commits November 7, 2025 18:09

add comments

9d41450

add comments

43e1d27

fix

43c0fea

add comments

d597cdd

fix

54ee0d5

YanhuiDua changed the title ~~[feat] update "rollout_max_batch_size" to replace "max_concurrent" for user settings~~ update "rollout_max_batch_size" to replace "max_concurrent" for user settings Nov 7, 2025

hhaAndroid approved these changes Nov 7, 2025

View reviewed changes

hhaAndroid reviewed Nov 7, 2025

View reviewed changes

YanhuiDua added 2 commits November 7, 2025 19:54

fix

c2a8c24

fix

e7b7f5b

YanhuiDua force-pushed the update_rollout_worker branch from 2890ff6 to e7b7f5b Compare November 7, 2025 12:09

YanhuiDua added 3 commits November 7, 2025 20:11

fix

3259107

fix

42f6b45

fix

f1652b2

CyCle1024 reviewed Nov 10, 2025

View reviewed changes

fix comments

1d736ce

YanhuiDua merged commit 0b9f1e8 into InternLM:main Nov 10, 2025
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

update "rollout_max_batch_size" to replace "max_concurrent" for user settings #1225

update "rollout_max_batch_size" to replace "max_concurrent" for user settings #1225

Uh oh!

YanhuiDua commented Nov 7, 2025 •

edited

Loading

Uh oh!

hhaAndroid Nov 7, 2025

Uh oh!

RangiLyu commented Nov 7, 2025

Uh oh!

RangiLyu commented Nov 7, 2025

Uh oh!

CyCle1024 Nov 10, 2025

Uh oh!

Uh oh!

YanhuiDua commented Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

update "rollout_max_batch_size" to replace "max_concurrent" for user settings #1225

update "rollout_max_batch_size" to replace "max_concurrent" for user settings #1225

Uh oh!

Conversation

YanhuiDua commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hhaAndroid Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

RangiLyu commented Nov 7, 2025

Uh oh!

RangiLyu commented Nov 7, 2025

Uh oh!

CyCle1024 Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

YanhuiDua commented Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

YanhuiDua commented Nov 7, 2025 •

edited

Loading