handled the case for T5 tokenizers #16

hazemessamm · 2025-09-18T18:18:45Z

Hi, thanks for the great work!

I was working with some evaluation models that use ProtT5 and Ankh PLMs, and these models do not have a token; they only have token, so I handled their case in order to work properly with EvoProtGrad.

Decoded example for both ProtT5 and Ankh: MQMLKMGLV</s>

pemami4911 · 2025-10-18T23:26:28Z

evo_prot_grad/common/sampler.py

+            # This checks whether the gradient sequence length
+            # is exactly one more than the input sequence length.
+            elif oh_grad.shape[1] == self.chains_oh.shape[1] + 1:
+                oh_grad = oh_grad[:, :-1]


I noticed that we can't guarantee that the last element of the gradient sequence is what should removed (why not the first element?)

pemami4911 · 2025-10-18T23:29:13Z

Hi, thank you for this pull request!

Having this logic (for instances with and tokens) in _compute_gradients might be problematic: https://github.com/NREL/EvoProtGrad/pull/16/files#r2442682916

I'm thinking that we could move this logic out of _compute_gradients and into the expert's code, that way we can let each expert handle removal of its special tokens (, , etc.). This could be done in the Protein LM expert's __call__ function.

I think we would need to also add custom expert code for T5/Ankh though!

handled the case for T5 tokenizers

5446252

pemami4911 reviewed Oct 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

handled the case for T5 tokenizers #16

handled the case for T5 tokenizers #16

Uh oh!

hazemessamm commented Sep 18, 2025 •

edited

Loading

Uh oh!

pemami4911 Oct 18, 2025

Uh oh!

pemami4911 commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

handled the case for T5 tokenizers #16

Are you sure you want to change the base?

handled the case for T5 tokenizers #16

Uh oh!

Conversation

hazemessamm commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pemami4911 Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

pemami4911 commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hazemessamm commented Sep 18, 2025 •

edited

Loading