Code issue in demo

### 📚 Documentation

Hi,

In src/lightning/pytorch/demos/transformer.py,  an encoder-decoder transformer is used for next token prediction. However, just as the conventional setup in encoder-decoder models, the whole src is seen by the model since there’s no src mask. And the prediction target is just the shift right of the source. Wouldn’t this result in a future leak since the model can just output the i+1-th token in the src when predicting the i-th token in the target?

cc @lantiga @borda

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Code issue in demo #20920

📚 Documentation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Code issue in demo #20920

Description

📚 Documentation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions