Mask loss on padded actions by as821 · Pull Request #894 · Physical-Intelligence/openpi

as821 · 2026-03-03T22:15:53Z

LeRobotDataset pads action sequences at the end of an episode out to the action horizon length using the final action in the episode. If the loss from these padded actions is applied, the policy may get stuck at the place where demonstrations typically terminate as discussed in #681.

This PR masks the flow matching loss from these padded actions in both the JAX and PyTorch implementations. DataLoaderImpl now yields a third element: action_pad_mask (or None if absent) when the policy transforms (repack, data, model transforms) pass through the “actions_is_pad” key from LeRobotDataset. LeRobotLiberoDataConfig and LiberoInputs are updated as an example.

Armstrong added 4 commits March 3, 2026 12:03

mask loss on padded actions

63e90b7

formatting

3ae80f6

padded action masking for pytorch

e29848d

handle none

7cc30a8

as821 requested review from Michael-Equi, jimmyt857 and kvablack as code owners March 3, 2026 22:15

jimmyt857 removed their request for review March 3, 2026 22:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mask loss on padded actions#894

Mask loss on padded actions#894
as821 wants to merge 4 commits intoPhysical-Intelligence:mainfrom
as821:action_padding

as821 commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

as821 commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant