[Model] Upstream Deepseek-OCR model #27247

Isotr0py · 2025-10-21T06:07:20Z

Purpose

Fix [New Model]: deepseek-ai/DeepSeek-OCR upstream support #27236
See if we can reuse deepseek-vl2's implementation and clip's implementation as much as possible

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Isotr0py <[email protected]>

mergify · 2025-10-21T14:54:50Z

Documentation preview: https://vllm--27247.org.readthedocs.build/en/27247/

Signed-off-by: Isotr0py <[email protected]>

Isotr0py · 2025-10-21T16:57:24Z

The model should work now, but we really need a full clean for commented debug codes from upstream codebase everywhere. 😅

ywang96 · 2025-10-21T17:00:02Z

The model should work now, but we really need a full clean for commented debug codes from upstream codebase everywhere. 😅

I'll clean it up - thanks for your work!

Signed-off-by: Roger Wang <[email protected]>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

vllm/model_executor/models/deepseek_ocr.py

whx-sjtu · 2025-10-22T02:50:47Z

vllm/model_executor/models/deepseek_ocr.py

+        if self.text_config.topk_method == "noaux_tc":
+            architectures = ["DeepseekV3ForCausalLM"]
+        elif not self.text_config.use_mla:
+            architectures = ["DeepseekForCausalLM"]


Why can't we use DeepseekV2ForCausalLM in non-mla scenario? Because DeepseekV2ForCausalLM also supports non-mla:

vllm/vllm/model_executor/models/deepseek_v2.py

Line 1001 in 09a7e6f

if model_config.use_mla:

Currently DeepseekForCausalLM is hard coded to use triton version of fused_experts and fused_topk with all_reduce path of MoE implementation, witch fails to utilize SharedFusedMoE witch supports CustomOp. So if there are reasons that DeepseekForCausalLM must be used, I suggest upgrade it to reuse SharedFusedMoE too.

I believe this was needed back then when our DSv2 implementation didn't support non-mla (you can see the same code in dsvl2)

vllm/vllm/model_executor/models/deepseek_vl2.py

Lines 403 to 409 in ab3e800

if self.text_config.topk_method == "noaux_tc":

architectures = ["DeepseekV3ForCausalLM"]

elif not self.text_config.use_mla:

architectures = ["DeepseekForCausalLM"]

else:

architectures = ["DeepseekV2ForCausalLM"]

but you're right this should be no longer needed.

Ucas-HaoranWei · 2025-10-22T06:07:30Z

Great work! I have tested it.

Isotr0py added 2 commits October 21, 2025 13:52

draft

29b9c6d

Signed-off-by: Isotr0py <[email protected]>

draft

3473ffb

Signed-off-by: Isotr0py <[email protected]>

mergify bot added the deepseek Related to DeepSeek models label Oct 21, 2025

register

2175993

Signed-off-by: Isotr0py <[email protected]>

mergify bot added the new-model Requests to new models label Oct 21, 2025

Isotr0py added 7 commits October 21, 2025 14:15

register

c7ffb81

Signed-off-by: Isotr0py <[email protected]>

fix processor

7256000

Signed-off-by: Isotr0py <[email protected]>

fix lm backbone loading

63c220a

Signed-off-by: Isotr0py <[email protected]>

fix all weights loading?

b012462

Signed-off-by: Isotr0py <[email protected]>

fix vit

e5d3dfc

Signed-off-by: Isotr0py <[email protected]>

fix projector

f87ada5

Signed-off-by: Isotr0py <[email protected]>

update example

2ac4a73

Signed-off-by: Isotr0py <[email protected]>

mergify bot added the documentation Improvements or additions to documentation label Oct 21, 2025

clean

e1ada73

Signed-off-by: Isotr0py <[email protected]>

Isotr0py assigned Isotr0py and ywang96 Oct 21, 2025

Isotr0py added 4 commits October 21, 2025 23:10

fix online serving

4ccdf97

Signed-off-by: Isotr0py <[email protected]>

clean

e79819d

Signed-off-by: Isotr0py <[email protected]>

impl ngram logits processor

521037f

Signed-off-by: Isotr0py <[email protected]>

correct chat template

3a3080f

Signed-off-by: Isotr0py <[email protected]>

ywang96 added 3 commits October 21, 2025 22:41

clean up processor

bbe0264

Signed-off-by: Roger Wang <[email protected]>

cleanup main modeling code

c29f9bd

Signed-off-by: Roger Wang <[email protected]>

remove repetitive constants

12a3782

Signed-off-by: Roger Wang <[email protected]>

ywang96 changed the title ~~[WIP][Model] Upstream Deepseek-OCR model~~ [Model] Upstream Deepseek-OCR model Oct 21, 2025

Merge branch 'main' into deepseek-ocr

2a33b30

ywang96 marked this pull request as ready for review October 21, 2025 23:26

chatgpt-codex-connector bot reviewed Oct 21, 2025

View reviewed changes

vllm/model_executor/models/deepseek_ocr.py Show resolved Hide resolved

whx-sjtu reviewed Oct 22, 2025

View reviewed changes

gitlawr mentioned this pull request Oct 22, 2025

Are there any plans to release Docker images for DeepSeek-OCR? gpustack/gpustack#2974

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Model] Upstream Deepseek-OCR model #27247

[Model] Upstream Deepseek-OCR model #27247

Isotr0py commented Oct 21, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Oct 21, 2025

Uh oh!

Isotr0py commented Oct 21, 2025 •

edited

Loading

Uh oh!

ywang96 commented Oct 21, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

whx-sjtu Oct 22, 2025 •

edited

Loading

Uh oh!

ywang96 Oct 22, 2025

Uh oh!

Ucas-HaoranWei commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	if self.text_config.topk_method == "noaux_tc":
	architectures = ["DeepseekV3ForCausalLM"]
	elif not self.text_config.use_mla:
	architectures = ["DeepseekForCausalLM"]
	else:
	architectures = ["DeepseekV2ForCausalLM"]

Uh oh!

[Model] Upstream Deepseek-OCR model #27247

Are you sure you want to change the base?

[Model] Upstream Deepseek-OCR model #27247

Conversation

Isotr0py commented Oct 21, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Oct 21, 2025

Uh oh!

Isotr0py commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ywang96 commented Oct 21, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

whx-sjtu Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ywang96 Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

Ucas-HaoranWei commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Isotr0py commented Oct 21, 2025 •

edited by github-actions bot

Loading

Isotr0py commented Oct 21, 2025 •

edited

Loading

whx-sjtu Oct 22, 2025 •

edited

Loading