Skip to content

Conversation

@alex-jw-brooks
Copy link
Owner

Same as foundation-model-stack#97 on top of foundation-model-stack#93 (don't have permission to make a branch in the upstream repo yet.


This PR builds on top of foundation-model-stack#20 and foundation-model-stack#93 to add a cache for testing using the refactored version of the test to allow some code reuse. foundation-model-stack#93 should probably be merged first (splitting this out for readability).

Summary of changes (wrt the original cache test PR)
- Makes sure gptq kwargs are passed through to the AIU model
- Makes sure options={"sendnn.dynamic": COMPILE_DYNAMIC_SENDNN} is passed consistently
- Clears the torch sendnn .cache - the current PR can break if the cache test runs second since the cache paths aren't actually reset in torch sendnn. We reset the compiler settings and clear the directory, but don't clear the spyre cache object in the current PR, which causes alignment issues if the cache test doesn't run first
- The current PR runs the check as two tests (cache miss -> cache hit); moves the cache miss test to run as a fixture to set things up so that we can just run cache hit as the test

Note that there is still some weirdness around how micro models are handled, mostly due to the way we configure common models paths / micro model usage and also check thresholds based on whether micro models exist.

@alex-jw-brooks alex-jw-brooks changed the title Add cache tests back Add Cache Miss/Hit Test Jul 30, 2025
@alex-jw-brooks alex-jw-brooks force-pushed the test_cache_refactor branch 2 times, most recently from b6e36d4 to d2551b9 Compare August 12, 2025 13:56
@alex-jw-brooks alex-jw-brooks force-pushed the test_cache_refactor branch 2 times, most recently from 2e42e7c to 1e369b2 Compare September 11, 2025 12:10
kcirred and others added 14 commits October 3, 2025 17:56
…m // model.config.nheads )

Signed-off-by: Rashed Z. Bhatti, PhD <[email protected]>
…g.emb_dim // model.config.nheads )"

This reverts commit b3d0f9b.
[dpp] store enforce_sizes in log name and added generic kwargs to get_default_validation_prefix
… as modeling code changed

Signed-off-by: Joshua Rosenkranz <[email protected]>
…tack/update_llama_model_expectation

update llama model expectations tests
…le names now sorted, testing of file names modified for new order

Signed-off-by: kcirred <[email protected]>
Signed-off-by: Joshua Rosenkranz <[email protected]>
…tack/fix_test_scripts_assertions

fixed test_scripts program assertions
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants