Update speculator config & converter to support hidden states indexing #142

shanjiaz · 2025-09-29T21:11:10Z

Changes:

Added support for optional arguments eagle_aux_hidden_state_layer_ids and inference_type.
Added more robust logic for target_vocab_size. We default on using "t2d" length, if not available, load the config file of verifier model, recursively search the dict for vocab_size. (The search is needed for nested dict. e.g. target_config_dict["text_config"]["vocab_size"] )
Removed tests for adding verifier embeddings as it's handled on the vllm side now.
Removed forward pass tests since forward function is defined on the vllm side.

Command used:

speculators convert nvidia/Llama-4-Maverick-17B-128E-Eagle3 \
  --algorithm eagle3 \
  --verifier RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16 \
  --output-path Llama4-Maverick-Eagle3-Speculators \
  --validate-device cuda:0 \
  --algorithm-kwargs '{"eagle_aux_hidden_state_layer_ids": [1,23,44], "inference_type": "text"}'

Converted checkpoint:

shanjiaz/Llama4-Maverick-Eagle3-Speculators-converted

Signed-off-by: shanjiaz <[email protected]>

github-actions · 2025-09-29T21:13:35Z

📦 Build Artifacts Available
The build artifacts (`.whl` and `.tar.gz`) have been successfully generated and are available for download: https://github.com/vllm-project/speculators/actions/runs/18381300407/artifacts/4227990755.
They will be retained for up to 30 days.
Commit: 1f84913

Signed-off-by: shanjiaz <[email protected]>

…culators into hz-update-config

Signed-off-by: shanjiaz <[email protected]>

src/speculators/models/eagle3.py

src/speculators/convert/eagle/eagle3_converter.py

examples/convert/eagle3/apply_eagle3_llama4_maverick.sh

Signed-off-by: shanjiaz <[email protected]>

rahul-tuli

The PR looks good, but now since we are removing the forward pass through the model, does it still make sense to keep the --validate/ --validate-device arguments?

src/speculators/convert/eagle/eagle3_converter.py

examples/convert/eagle3/apply_eagle3_llama4_maverick.sh

Signed-off-by: shanjiaz <[email protected]>

fynnsu

Added a couple comments

src/speculators/models/eagle3.py

Signed-off-by: shanjiaz <[email protected]>

rahul-tuli

few questions/nits which can be addressed in a follow up, good work on this, LGTM once we raise the NotImplementedError for forward passes

src/speculators/models/eagle3.py

src/speculators/convert/eagle/eagle3_converter.py

Signed-off-by: shanjiaz <[email protected]>

rahul-tuli

LGTM!

dsikka

Do we have test cases for multiple decoder layers?

Signed-off-by: shanjiaz <[email protected]>

fynnsu

One question below which might require a fix.

src/speculators/convert/eagle/eagle3_converter.py

Signed-off-by: shanjiaz <[email protected]>

rahul-tuli · 2025-10-09T14:46:49Z

LGTM pending quality!

Signed-off-by: shanjiaz <[email protected]>

rahul-tuli

LGTM!

Added tests and review has been addressed.

shanjiaz added 2 commits September 29, 2025 17:09

update speculator config & converter to support new models

969e87a

Signed-off-by: shanjiaz <[email protected]>

minimal change

5966d6e

Signed-off-by: shanjiaz <[email protected]>

shanjiaz and others added 4 commits September 30, 2025 11:34

fix pre-commit

1db8a42

Signed-off-by: shanjiaz <[email protected]>

end of lline and type fix

1cc670d

Signed-off-by: shanjiaz <[email protected]>

type update

ce4a2c3

Signed-off-by: shanjiaz <[email protected]>

Merge branch 'main' into hz-update-config

76daa73

shanjiaz marked this pull request as ready for review October 1, 2025 01:28

shanjiaz added 2 commits October 2, 2025 11:53

remove unused t2d and d2t references

1d2b10c

Signed-off-by: shanjiaz <[email protected]>

Merge branch 'hz-update-config' of https://github.com/neuralmagic/spe…

6948c4a

…culators into hz-update-config

shanjiaz requested review from dsikka, fynnsu and rahul-tuli October 2, 2025 15:56

shanjiaz added 3 commits October 2, 2025 15:27

make t2d and d2t optional

cc52456

Signed-off-by: shanjiaz <[email protected]>

removed unused functions

0ffbddc

Signed-off-by: shanjiaz <[email protected]>

removed unused imports

1d2494c

Signed-off-by: shanjiaz <[email protected]>

rahul-tuli requested changes Oct 3, 2025

View reviewed changes

shanjiaz added 8 commits October 3, 2025 09:32

remove inference type

53ee14a

Signed-off-by: shanjiaz <[email protected]>

fix examples

58b83dd

Signed-off-by: shanjiaz <[email protected]>

fix examples

1a650a1

Signed-off-by: shanjiaz <[email protected]>

minimum change

2d575be

Signed-off-by: shanjiaz <[email protected]>

make embed_tokens optional

a7537b2

Signed-off-by: shanjiaz <[email protected]>

fix test

040b5d4

Signed-off-by: shanjiaz <[email protected]>

fix references

51d29d6

Signed-off-by: shanjiaz <[email protected]>

quality

c4cfbb6

Signed-off-by: shanjiaz <[email protected]>

shanjiaz requested a review from rahul-tuli October 3, 2025 17:01

rahul-tuli reviewed Oct 6, 2025

View reviewed changes

src/speculators/convert/eagle/eagle3_converter.py Show resolved Hide resolved

rahul-tuli reviewed Oct 6, 2025

View reviewed changes

src/speculators/convert/eagle/eagle3_converter.py Outdated Show resolved Hide resolved

rahul-tuli reviewed Oct 6, 2025

View reviewed changes

examples/convert/eagle3/apply_eagle3_llama4_maverick.sh Show resolved Hide resolved

remove convert weights function

059b6eb

Signed-off-by: shanjiaz <[email protected]>

shanjiaz requested a review from rahul-tuli October 6, 2025 15:12

fynnsu previously approved these changes Oct 6, 2025

View reviewed changes

src/speculators/models/eagle3.py Outdated Show resolved Hide resolved

src/speculators/models/eagle3.py Show resolved Hide resolved

added doc-string and renamed variable

c8fcc5e

Signed-off-by: shanjiaz <[email protected]>

shanjiaz dismissed fynnsu’s stale review via c8fcc5e October 6, 2025 17:16

rahul-tuli previously approved these changes Oct 7, 2025

View reviewed changes

src/speculators/models/eagle3.py Show resolved Hide resolved

src/speculators/convert/eagle/eagle3_converter.py Outdated Show resolved Hide resolved

src/speculators/convert/eagle/eagle3_converter.py Show resolved Hide resolved

raise notImplemented error

e018cab

Signed-off-by: shanjiaz <[email protected]>

shanjiaz dismissed rahul-tuli’s stale review via e018cab October 7, 2025 13:17

shanjiaz requested review from fynnsu and rahul-tuli October 7, 2025 14:42

rahul-tuli previously approved these changes Oct 7, 2025

View reviewed changes

dsikka previously requested changes Oct 7, 2025

View reviewed changes

Add unit tests

82dbcc1

Signed-off-by: shanjiaz <[email protected]>

shanjiaz dismissed rahul-tuli’s stale review via 82dbcc1 October 7, 2025 18:18

fix precommit

c4e5bd7

Signed-off-by: shanjiaz <[email protected]>

shanjiaz requested review from dsikka and rahul-tuli October 8, 2025 16:03

Merge branch 'main' into hz-update-config

164fb99

fynnsu previously approved these changes Oct 8, 2025

View reviewed changes

src/speculators/convert/eagle/eagle3_converter.py Outdated Show resolved Hide resolved

raise error

604b937

Signed-off-by: shanjiaz <[email protected]>

shanjiaz dismissed fynnsu’s stale review via 604b937 October 9, 2025 14:34

fix precommit

1f84913

Signed-off-by: shanjiaz <[email protected]>

shanjiaz requested a review from fynnsu October 9, 2025 16:26

rahul-tuli approved these changes Oct 9, 2025

View reviewed changes

fynnsu approved these changes Oct 9, 2025

View reviewed changes

shanjiaz merged commit 8af566f into main Oct 9, 2025
12 checks passed

shanjiaz deleted the hz-update-config branch October 9, 2025 19:44

Update speculator config & converter to support hidden states indexing #142

Update speculator config & converter to support hidden states indexing #142

Uh oh!

Conversation

shanjiaz commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes:

Command used:

Converted checkpoint:

Uh oh!

github-actions bot commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rahul-tuli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fynnsu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

rahul-tuli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rahul-tuli left a comment

Choose a reason for hiding this comment

Uh oh!

dsikka left a comment

Choose a reason for hiding this comment

Uh oh!

fynnsu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rahul-tuli commented Oct 9, 2025

Uh oh!

rahul-tuli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shanjiaz commented Sep 29, 2025 •

edited

Loading

github-actions bot commented Sep 29, 2025 •

edited

Loading