Fix parameter name for values tensor in run_scaled_dot_product_attention #35

NiccoloSacchi · 2025-09-15T20:52:42Z

I believe this dimension should be the same in V and K: shouldn't the V matrix contain the value embedding for each key?

I believe this dimension should be the same in V and K: the V matrix must contain the value embedding for each key.

marcelroed · 2025-09-15T22:58:31Z

Yep, good catch, not sure how that slipped through. Will fix this on our end and update this repo before I close the issue.

NiccoloSacchi · 2025-09-17T18:34:24Z

tests/adapters.py

    K: Float[Tensor, " ... keys d_k"],
-    V: Float[Tensor, " ... values d_v"],
+    V: Float[Tensor, " ... keys d_v"],
    mask: Bool[Tensor, " ... queries keys"] | None = None,


Also, shouldn't mask be of size "queries keys" instead of "... queries keys"?

The masks can differ for each batch element, so ... is correct.

Ah, I considered that but thought not really necessary (thought either all batch samples attend to everything or all batch samples only attend to past tokens). Anyway, thanks for confirming! :)

Fix parameter name for values tensor in run_scaled_dot_product_attention

b38a366

I believe this dimension should be the same in V and K: the V matrix must contain the value embedding for each key.

NiccoloSacchi commented Sep 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix parameter name for values tensor in run_scaled_dot_product_attention #35

Fix parameter name for values tensor in run_scaled_dot_product_attention #35

Uh oh!

NiccoloSacchi commented Sep 15, 2025 •

edited

Loading

Uh oh!

marcelroed commented Sep 15, 2025

Uh oh!

NiccoloSacchi Sep 17, 2025

Uh oh!

marcelroed Sep 17, 2025

Uh oh!

NiccoloSacchi Sep 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix parameter name for values tensor in run_scaled_dot_product_attention #35

Are you sure you want to change the base?

Fix parameter name for values tensor in run_scaled_dot_product_attention #35

Uh oh!

Conversation

NiccoloSacchi commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marcelroed commented Sep 15, 2025

Uh oh!

NiccoloSacchi Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

marcelroed Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

NiccoloSacchi Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NiccoloSacchi commented Sep 15, 2025 •

edited

Loading