implement contrastive loss #365

tashapais · 2025-09-26T23:42:26Z

I've successfully implemented contrastive loss for PufferLib with the following key components:

Core Implementation (pufferlib/contrastive_loss.py):

ContrastiveLoss class with InfoNCE loss using geometric future sampling
Functional interface compute_contrastive_loss_pufferlib() for easy integration
Proper handling of episode boundaries using terminal/truncation flags
Metrics tracking for logging (positive/negative similarities, number of pairs, etc.)

Integration Example (pufferlib/pufferl_with_contrastive.py):

Shows how to extend PufferLib's training loop to include contrastive loss
Demonstrates proper embedding extraction and loss computation
Maintains compatibility with existing PPO losses

Key Features:

✅ Samples random (st, at) pairs from replay buffer
✅ Creates positive examples sf^(1) using geometric distribution Δ ~ GEOM(1-γ)
✅ Generates negatives by shuffling future states from other trajectories
✅ Uses unnormalized representations as specified
✅ Integrates seamlessly with PufferLib's architecture
✅ Tested and working with synthetic data

The implementation is ready to use - you can integrate it into your training by importing the contrastive loss
function and adding it to your training loop as shown in the integration example.

…example is pufferl_with_contrastive.py

you can integrate loss into your training by importing the function, …

b8d9455

…example is pufferl_with_contrastive.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

implement contrastive loss #365

implement contrastive loss #365

Uh oh!

tashapais commented Sep 26, 2025

Uh oh!

Uh oh!

implement contrastive loss #365

Are you sure you want to change the base?

implement contrastive loss #365

Uh oh!

Conversation

tashapais commented Sep 26, 2025

Uh oh!

Uh oh!