Fix Z_Normalization #551

philip30 · 2018-11-20T23:13:21Z

This is a very minor change to correct the policy gradient when calculating z_normalization. I think Rewards should be normalized not only per sequence but also per item in the minibatch. So, the number of items in a minibatch will really impact the learning behaviour of the policy gradient.

…gmentation

Philip Arthur and others added 4 commits November 13, 2018 14:19

Fixed the z-norm

fa8e7cf

Fixed calc

11c673e

Merge branch 'fix_segmentation' of github.com:neulab/xnmt into fix_se…

a81ca68

…gmentation

Removed todo

469ac03

philip30 merged commit 2f64d67 into master Nov 21, 2018

philip30 deleted the fix_segmentation branch November 21, 2018 01:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Z_Normalization #551

Fix Z_Normalization #551

philip30 commented Nov 20, 2018

Fix Z_Normalization #551

Fix Z_Normalization #551

Conversation

philip30 commented Nov 20, 2018