Add Cache Miss/Hit Test #1

alex-jw-brooks · 2025-07-30T21:05:35Z

Same as foundation-model-stack#97 on top of foundation-model-stack#93 (don't have permission to make a branch in the upstream repo yet.

This PR builds on top of foundation-model-stack#20 and foundation-model-stack#93 to add a cache for testing using the refactored version of the test to allow some code reuse. foundation-model-stack#93 should probably be merged first (splitting this out for readability).

Summary of changes (wrt the original cache test PR)
- Makes sure gptq kwargs are passed through to the AIU model
- Makes sure options={"sendnn.dynamic": COMPILE_DYNAMIC_SENDNN} is passed consistently
- Clears the torch sendnn .cache - the current PR can break if the cache test runs second since the cache paths aren't actually reset in torch sendnn. We reset the compiler settings and clear the directory, but don't clear the spyre cache object in the current PR, which causes alignment issues if the cache test doesn't run first
- The current PR runs the check as two tests (cache miss -> cache hit); moves the cache miss test to run as a fixture to set things up so that we can just run cache hit as the test

Note that there is still some weirdness around how micro models are handled, mostly due to the way we configure common models paths / micro model usage and also check thresholds based on whether micro models exist.

…_default_validation_prefix, enable sample_key Signed-off-by: kcirred <[email protected]>

…uad_v2 sampler Signed-off-by: kcirred <[email protected]>

Signed-off-by: kcirred <[email protected]>

…oo long Signed-off-by: kcirred <[email protected]>

Signed-off-by: kcirred <[email protected]>

…m // model.config.nheads ) Signed-off-by: Rashed Z. Bhatti, PhD <[email protected]>

…g.emb_dim // model.config.nheads )" This reverts commit b3d0f9b.

Signed-off-by: kcirred <[email protected]>

[dpp] store enforce_sizes in log name and added generic kwargs to get_default_validation_prefix

… as modeling code changed Signed-off-by: Joshua Rosenkranz <[email protected]>

…tack/update_llama_model_expectation update llama model expectations tests

…le names now sorted, testing of file names modified for new order Signed-off-by: kcirred <[email protected]>

…rite Prefix rewrite

Signed-off-by: Joshua Rosenkranz <[email protected]>

…tack/fix_test_scripts_assertions fixed test_scripts program assertions

…_cache_refactor Refactor Decoder Tests

Signed-off-by: Alex-Brooks <[email protected]>

alex-jw-brooks changed the title ~~Add cache tests back~~ Add Cache Miss/Hit Test Jul 30, 2025

alex-jw-brooks force-pushed the test_cache_refactor branch 2 times, most recently from b6e36d4 to d2551b9 Compare August 12, 2025 13:56

alex-jw-brooks force-pushed the test_cache_refactor branch 2 times, most recently from 2e42e7c to 1e369b2 Compare September 11, 2025 12:10

kcirred added 3 commits September 30, 2025 15:24

[dpp] store enforce_sizes in log name and added generic kwargs to get…

051c18f

…_default_validation_prefix, enable sample_key Signed-off-by: kcirred <[email protected]>

[utils] added doc string, refactor sample_key, added return_key to sq…

f3613ba

…uad_v2 sampler Signed-off-by: kcirred <[email protected]>

[dpp/validation] restore sample_key in logic after rebase of main

2f69fa3

Signed-off-by: kcirred <[email protected]>

alex-jw-brooks force-pushed the test_cache_refactor branch from 1e369b2 to dd355c1 Compare October 3, 2025 13:24

kcirred and others added 14 commits October 3, 2025 17:56

[validation] Modified final file string to hash due to OSError name t…

34587fe

…oo long Signed-off-by: kcirred <[email protected]>

[test_validation] remove unused line

a4a89b9

Signed-off-by: kcirred <[email protected]>

[validation] removed enforce_sizes from find_validation_info_path

525b006

Signed-off-by: kcirred <[email protected]>

paged head_size getattr(model.config, "head_dim", model.config.emb_di…

b3d0f9b

…m // model.config.nheads ) Signed-off-by: Rashed Z. Bhatti, PhD <[email protected]>

Revert "paged head_size getattr(model.config, "head_dim", model.confi…

e56c686

…g.emb_dim // model.config.nheads )" This reverts commit b3d0f9b.

[dpp] added handling of return_key for __custom_line_sampler

1eac366

Signed-off-by: kcirred <[email protected]>

Merge pull request foundation-model-stack#136 from kcirred/enforce_log

992e612

[dpp] store enforce_sizes in log name and added generic kwargs to get_default_validation_prefix

updated llama model expectation tests using v1.0.0 aiu software stack…

2b3028f

… as modeling code changed Signed-off-by: Joshua Rosenkranz <[email protected]>

Merge pull request foundation-model-stack#150 from foundation-model-s…

abe35d3

…tack/update_llama_model_expectation update llama model expectations tests

[testing] changed get_default_validation_prefix to generic kwargs, fi…

fcf950f

…le names now sorted, testing of file names modified for new order Signed-off-by: kcirred <[email protected]>

Merge pull request foundation-model-stack#148 from kcirred/prefix_rew…

99e6bd1

…rite Prefix rewrite

fixed test_scripts program assertion

adc276e

Signed-off-by: Joshua Rosenkranz <[email protected]>

Merge pull request foundation-model-stack#151 from foundation-model-s…

f6c9a8b

…tack/fix_test_scripts_assertions fixed test_scripts program assertions

Merge pull request foundation-model-stack#93 from alex-jw-brooks/test…

281ff22

…_cache_refactor Refactor Decoder Tests

alex-jw-brooks force-pushed the rebased_cache_tests branch from ad3073c to fa5bd38 Compare October 13, 2025 09:43

alex-jw-brooks added 6 commits October 13, 2025 09:44

Add cache test

18bbf01

Signed-off-by: Alex-Brooks <[email protected]>

use tmp_path fixture for cache test

42305bb

Signed-off-by: Alex-Brooks <[email protected]>

fix cache_dir in cache checks

fe8c61f

Signed-off-by: Alex-Brooks <[email protected]>

only warmup on cache tests

630fe39

Signed-off-by: Alex-Brooks <[email protected]>

parametrize use_cache

fd1c20f

Signed-off-by: Alex-Brooks <[email protected]>

use request param for setting up use_cache

1566583

Signed-off-by: Alex-Brooks <[email protected]>

alex-jw-brooks force-pushed the rebased_cache_tests branch from fa5bd38 to 1566583 Compare October 13, 2025 09:46

alex-jw-brooks added 2 commits October 26, 2025 11:57

reuse aiu/cpu models from cache miss fixture

c33072e

Signed-off-by: Alex-Brooks <[email protected]>

remove duplicate code

b8181ac

Signed-off-by: Alex-Brooks <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Cache Miss/Hit Test #1

Add Cache Miss/Hit Test #1

Uh oh!

alex-jw-brooks commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add Cache Miss/Hit Test #1

Are you sure you want to change the base?

Add Cache Miss/Hit Test #1

Uh oh!

Conversation

alex-jw-brooks commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants