Skip to content

Mask loss on padded actions#894

Open
as821 wants to merge 4 commits intoPhysical-Intelligence:mainfrom
as821:action_padding
Open

Mask loss on padded actions#894
as821 wants to merge 4 commits intoPhysical-Intelligence:mainfrom
as821:action_padding

Conversation

@as821
Copy link

@as821 as821 commented Mar 3, 2026

LeRobotDataset pads action sequences at the end of an episode out to the action horizon length using the final action in the episode. If the loss from these padded actions is applied, the policy may get stuck at the place where demonstrations typically terminate as discussed in #681.

This PR masks the flow matching loss from these padded actions in both the JAX and PyTorch implementations. DataLoaderImpl now yields a third element: action_pad_mask (or None if absent) when the policy transforms (repack, data, model transforms) pass through the “actions_is_pad” key from LeRobotDataset. LeRobotLiberoDataConfig and LiberoInputs are updated as an example.

@jimmyt857 jimmyt857 removed their request for review March 3, 2026 22:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant