You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We find out the problem is related to KL computation that the policy and ref policy do not have image as input. However, the consequences are still blurring (kind of good in our experiments?), waiting for more tests.
Kindly remind the issue from R1-V here for someone spot the similar issue
Deep-Agent/R1-V#20
Sorry for making this mistake on our initial codebase. This may lead to our failed trial, as we
explained here:
The text was updated successfully, but these errors were encountered: