[SOT] Mark dynamic dims by type annotations #2771

SigureMo · 2025-07-09T08:09:21Z

利用类型提示来标记动态维度，以保证动转静能够一次即可收敛，避免重复组网

支持递归处理 typing，比如 dataclass、Optional[T]
Tensor 类型具有隐式动态维度 (0,)
支持以 Annotated[Tensor, DynamicDims((1, 2))] 标记其余维度

示例如下

from typing import Annotated
from dataclasses import dataclass

from fastdeploy.model_executor.graph_optimization.dynamic_dims_marker import (
    DynamicDims
)
def fn(x: Tensor):
    # x 具有隐式动态维度 (0,)
    ...

def fn(x: Annotated[Tensor, DynamicDims(1)]):
    # x 具有动态维度 (1,)
    ...

@dataclass
class ForwardMeta:
    a: Annotated[Tensor, DynamicDims((2, 3))]
    caches: list[Tensor]

def fn(x: ForwardMeta):
    # x.a 具有动态维度 (2, 3)
    # x.caches 中所有元素具有隐式动态维度 (0,)
    ...

另外在 warmup 阶段会根据是否发生打断、是否有重复构图来确定能否用更加快速的实现，直接使用编译好的 code 来执行，当无法使用时会报 warning

paddle-bot · 2025-07-09T08:09:27Z

Thanks for your contribution!

gongshaotian · 2025-07-22T03:33:44Z

如何理解 Tensor 类型具有隐式动态维度 (0,)，是一个规则吗

fastdeploy/model_executor/graph_optimization/dynamic_dims_marker.py

fastdeploy/model_executor/graph_optimization/graph_optimization_backend.py

SigureMo · 2025-07-22T04:30:10Z

如何理解 Tensor 类型具有隐式动态维度 (0,)，是一个规则吗

是一个规则，主要是基本上所有 Tensor batch 维度基本都是动态的，从标记的角度将会很麻烦，因此添加此默认规则（该隐式规则同 vLLM）

gongshaotian

LGTM

yuanlehome · 2025-07-22T05:34:19Z

fastdeploy/model_executor/models/ernie4_5_vl/ernie4_5_vl_moe.py

@@ -406,7 +406,7 @@ def load_state_dict(self, state_dict):
    def forward(
        self,
        ids_remove_padding: paddle.Tensor,
-        image_features: paddle.Tensor,
+        image_features: Optional[paddle.Tensor],


其他forward函数中也写成 Optional[paddle.Tensor] 吧

yuanlehome · 2025-07-22T05:36:20Z

fastdeploy/model_executor/layers/attention/__init__.py

@@ -32,4 +33,5 @@
    "FlashAttentionBackend",
    "IluvatarAttnBackend",
    "BlockAttentionBackend",
+    "Attention",


加上这个我记得是会有循环引用的？

不会有的，除非是之前设计不合理导致的

yuanlehome · 2025-07-22T05:37:14Z

fastdeploy/model_executor/graph_optimization/dynamic_dims_marker.py

+    def extract_inner_types(self, data, data_name, tp) -> list[tuple[Accessor[Any, Any], str, type[Any]]]:
+        raise NotImplementedError
+
+    def resolve(self, data, data_name, tp) -> None:


所有函数参数都需要添加类型注解

已经添加

zyfncg · 2025-07-22T06:20:44Z

fastdeploy/engine/engine.py

@@ -937,11 +937,11 @@ def _setting_environ_variables(self):
                "SOT_LOG_LEVEL": os.getenv("SOT_LOG_LEVEL", default="0"),
                "SOT_UNSAFE_CACHE_FASTPATH": os.getenv("SOT_UNSAFE_CACHE_FASTPATH", default="1"),
                "SOT_ENABLE_0_SIZE_FALLBACK": os.getenv("SOT_ENABLE_0_SIZE_FALLBACK", default="0"),
+                "SOT_SPECIALIZED_DIM_NUMBERS": os.getenv("SOT_SPECIALIZED_DIM_NUMBERS", default="no"),


这个默认是 no ?

框架里默认特化 1，即 "1"，FD 里默认是不特化，是 "no"

zyfncg · 2025-07-22T06:24:35Z

fastdeploy/model_executor/layers/attention/flash_attn_backend.py

@@ -144,7 +147,7 @@ def get_kv_cache_shape(
            self.head_dim,
        )

-    def init_attention_metadata(self, forward_meta: ForwardMeta):
+    def init_attention_metadata(self, forward_meta: "ForwardMeta"):


"ForwardMeta" 包字符串的作用是什么？

常用于

解决循环引用

降低运行时解析开销

但这些文件都使用了 PEP 563，所以这里其实加不加 " 效果是一样的，为提高可读性这里恢复了下

[SOT] Mark dynamic dims by type annotations

07ea00e

SigureMo added 16 commits July 11, 2025 13:14

Merge branch 'develop' into sot/mark-dynamic-dims-by-type-annotations

6bb1b67

fix conflict of forward_meta

ac71bee

Merge branch 'develop' into sot/mark-dynamic-dims-by-type-annotations

bb94e72

mark more attn backend

5002ccc

Merge branch 'develop' into sot/mark-dynamic-dims-by-type-annotations

1f0bcdb

fix missing annotated and add env SOT_SPECIALIZED_DIM_NUMBERS

00a57f6

auto infer implicit 0 dim dynamic dim

4665f3e

revert manual marked dims

1817909

revert missing update

604a0fe

auto infer can use unsafe code in warmup stage

6f7bc21

check -> type_match

2388d5a

Merge branch 'develop' into sot/mark-dynamic-dims-by-type-annotations

9d59c37

fix codestyle

fa7ede7

restore blank line

f6021a2

empty commit

1b70c1d

add need_warmup nonlocal;

cb2915a

SigureMo marked this pull request as ready for review July 22, 2025 02:54

gongshaotian reviewed Jul 22, 2025

View reviewed changes

fastdeploy/model_executor/graph_optimization/dynamic_dims_marker.py Show resolved Hide resolved

gongshaotian reviewed Jul 22, 2025

View reviewed changes

fastdeploy/model_executor/graph_optimization/graph_optimization_backend.py Show resolved Hide resolved

add doc for resolver

d8f92e6

gongshaotian previously approved these changes Jul 22, 2025

View reviewed changes

yuanlehome reviewed Jul 22, 2025

View reviewed changes

add missing type hints

fd94562

SigureMo dismissed gongshaotian’s stale review via fd94562 July 22, 2025 06:08

zyfncg reviewed Jul 22, 2025

View reviewed changes

unquote "ForwardMeta"

e92b201

yuanlehome approved these changes Jul 22, 2025

View reviewed changes

yuanlehome merged commit 48e6a0c into PaddlePaddle:develop Jul 22, 2025
4 of 5 checks passed

SigureMo deleted the sot/mark-dynamic-dims-by-type-annotations branch July 22, 2025 07:26

[SOT] Mark dynamic dims by type annotations #2771

[SOT] Mark dynamic dims by type annotations #2771

Uh oh!

Conversation

SigureMo commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paddle-bot bot commented Jul 9, 2025

Uh oh!

gongshaotian commented Jul 22, 2025

Uh oh!

Uh oh!

Uh oh!

SigureMo commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gongshaotian left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SigureMo Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

SigureMo commented Jul 9, 2025 •

edited

Loading

SigureMo commented Jul 22, 2025 •

edited

Loading

SigureMo Jul 22, 2025 •

edited

Loading