Add test case for cache #20

avery-blanchard · 2025-04-08T20:13:42Z

This PR adds a test case for caching.

The test case runs twice with a single shape. The caching feature is enabled during the test and disabled after on reset.

cc: @JRosenkranz

JRosenkranz · 2025-04-14T13:16:47Z

tests/models/test_decoders.py

@@ -287,5 +289,56 @@ def _metric_calculator(r: torch.Tensor, t: torch.Tensor):
    else:
        print("passed validation level 0")

+@pytest.mark.parametrize("model_path,batch_size,seq_length,max_new_tokens,cache_status", cache_params)
+def test_cache(model_path, batch_size, seq_length, max_new_tokens, cache_status):


I don't think we need to parametrize everything here other than the miss and hit. Just specify them inside the test.

JRosenkranz · 2025-04-14T13:18:56Z

tests/models/test_decoders.py

+        None,
+        only_last_token=True,
+        **padding_kwargs
+    )


Is there something we need to assert for miss/hit?

JRosenkranz

We may want to create a fixture which turns caching on, and caches the compilation. Then in the test we can re-run with caching on/off to make sure we see a hit or no-cache used

Signed-off-by: Avery Blanchard <[email protected]>

avery-blanchard force-pushed the cache-test branch from 9a85871 to 0e07855 Compare April 8, 2025 20:17

JRosenkranz reviewed Apr 14, 2025

View reviewed changes

Add test case for caching

87b3ac8

Signed-off-by: Avery Blanchard <[email protected]>

avery-blanchard force-pushed the cache-test branch from 0e07855 to 87b3ac8 Compare July 18, 2025 16:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add test case for cache #20

Add test case for cache #20

Uh oh!

avery-blanchard commented Apr 8, 2025 •

edited

Loading

Uh oh!

JRosenkranz Apr 14, 2025

Uh oh!

JRosenkranz Apr 14, 2025

Uh oh!

JRosenkranz left a comment

Uh oh!

Uh oh!

Add test case for cache #20

Are you sure you want to change the base?

Add test case for cache #20

Uh oh!

Conversation

avery-blanchard commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JRosenkranz Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

JRosenkranz Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

JRosenkranz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

avery-blanchard commented Apr 8, 2025 •

edited

Loading