Clamping in Affine Transform #253

vpratz · 2024-11-21T09:54:36Z

The changes in clamping have lead to worse performance in the two moons test. Reverting the changes in eb5c446 and 7906c1e fixed this, but was not motivated by extensive testing.

@stefanradev93 Could you please evaluate which form is the best in practice and, in case the changes are the best in practical applications, adjust the test settings accordingly?

The text was updated successfully, but these errors were encountered:

stefanradev93 · 2024-11-21T12:04:19Z

That was also my experience, so I picked the settings that stabilized training on the regression and two moons examples (basically the old settings from 1.5x).

However, using my "stable" settings, @elseml got NaNs in his experiments. It seems that stability is currently a bit dependent on the test case, which needs to be solved.

My last suspicion is in the residual net, which we preciously didn't use.

stefanradev93 · 2024-11-21T12:12:28Z

@vpratz @elseml
Could you test the clamping in: eb5c446?
I also fixed the previously wrong position of the actnorm, so the arcsin^-1 may work. However, testing just the loss is not enough. The regression test bed was sometimes producing NaNs in the sampling results.

vpratz · 2024-11-21T12:26:12Z

Thanks for the additional information. I just reran pytest tests/test_two_moons a few times on eb5c446 and now it passes for me locally. When I ran it earlier it failed, producing a value that was too high.

The problem with the NaNs seems to persist, see #254.

What would be your preferred configuration until we have figured this out? Using the code from eb5c446?

stefanradev93 · 2024-11-21T13:17:43Z

I am on it.

stefanradev93 · 2024-11-21T13:44:39Z

I have a setting that produced no NaNs in loss or samples over three runs across:

Two moons
SIR
Regression

I will push my commit, and @elseml and @vpratz can decide whether to keep it or not upon further testing.

stefanradev93 · 2024-11-21T14:21:07Z

Let me know if you still observe some problems after my recent push.

vpratz · 2024-11-21T14:58:07Z

Thanks a lot! The tests look stable now. @elseml If you don't encounter problems, I think we can close this issue

elseml · 2024-11-21T15:01:34Z

Quick timeline of my experiences (note that my setup is more prone to NaNs in general):

Commit aeee627 introduces major instabilities (~ NaN in 60% of runs)
Reverting its changes to affine_transform.py in commit eb5c446 fixed the issue
The changes to affine_transform.py in commit 7906c1e led to NaNs in ~10% of runs (quite hard to test due to their rare occurence) and an extremly high loss at the beginning of the training (larger by factor ~4000 compared to before) but not later on
I encountered no NaNs after commit 3878625 yet (cannot be ruled out due to their rare occurence, I will update when it changes) and it fixed the exploding loss at the beginning of the training

The issue can thus be closed for now from my side - we will have to see in the long run if the occasional instabilities in #254 were sufficiently addressed.

vpratz · 2024-11-21T15:03:00Z

Thanks, let's keep an eye on this an reopen the issue when needed.

paul-buerkner · 2024-11-21T15:04:12Z

Great! Thank you all for helping to fix this issue!

vpratz added the discussion Discuss a topic or question not necessarily with a clear output in mind. label Nov 21, 2024

vpratz assigned stefanradev93 Nov 21, 2024

vpratz added this to bayesflow development Nov 21, 2024

github-project-automation bot moved this to Future in bayesflow development Nov 21, 2024

vpratz added the v2 label Nov 21, 2024

vpratz closed this as completed Nov 21, 2024

github-project-automation bot moved this from Future to Done in bayesflow development Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clamping in Affine Transform #253

Clamping in Affine Transform #253

vpratz commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

vpratz commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

vpratz commented Nov 21, 2024

elseml commented Nov 21, 2024

vpratz commented Nov 21, 2024

paul-buerkner commented Nov 21, 2024

Clamping in Affine Transform #253

Clamping in Affine Transform #253

Comments

vpratz commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

vpratz commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

stefanradev93 commented Nov 21, 2024

vpratz commented Nov 21, 2024

elseml commented Nov 21, 2024

vpratz commented Nov 21, 2024

paul-buerkner commented Nov 21, 2024