Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementation of loss function implementation. #50

Open
KennyWu opened this issue Nov 9, 2024 · 3 comments
Open

Implementation of loss function implementation. #50

KennyWu opened this issue Nov 9, 2024 · 3 comments

Comments

@KennyWu
Copy link

KennyWu commented Nov 9, 2024

I am reading the pretraining and finetuning loss function and am trying to understand the implementation of this loss:
Image

I think I have a general idea of implementation, but would like to confirm in case I am not missing any details.
The idea I have is as follows:

For each variable calculate the MAE loss
Multiply each variable loss by specific variable weight.
Sum loss of variables grouping by Surface variables and Atmospheric Variables separately
Multiply Surface and Atmospheric losses by alpha and beta weights respectively
Finally sum losses together and multiply by (dataset weight/(tot # variables in dataset)

@wesselb
Copy link
Contributor

wesselb commented Dec 3, 2024

Hey @KennyWu! I think your interpretation of the equation is correct. :) Also don't forget the division by H x W and C x H x W, which are important to balance the atmospheric and surface-level variables.

@firatozdemir
Copy link

Hey @wesselb, I have a follow up question. It looks like the loss is applied on unnormalised predictions, but the dynamic range of different variables are too large to account for the small range you have within w_{k,c}^{A}.

Can you confirm that the \hat{X}^t and X^t are not normalized observations?

@wesselb
Copy link
Contributor

wesselb commented Jan 20, 2025

Hey @firatozdemir! The loss should be computed over the normalised observations. Otherwise you're completely right that the different magnitudes of the variables would be an issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants