Add a user namelist for reduced output settings #498

JorgSchwinger · 2025-02-17T12:04:19Z

This PR add a file user_nl_blom_outvol_reduced to the directory cime_config. The purpose of this file is to provide a namelist with reduced output settings for the NorESM3 spin-up simulations or test simulations. The following general rules are applied:

no daily output
no output on isopycnic layers
no (HAMOCC) or minimal (BLOM) monthly 3d-output on levels
reduced annual 3d output

For convenience, and since it increases the output volume only little, I have repeated the output of 2d fields also in annual files.

@matsbn, @TomasTorsvik, @jmaerz , @milicak , @AleksiNummelin please review these output settings and let me know if you see the need for changes (please put a comment at the corresponding line).

@matsbn, one could further reduce the BLOM output by using 2-byte compressed output. After my experience this results in in ~20-30% smaller files after compression to netCDF4 and the accuracy is still good enough for most applications.

jmaerz · 2025-02-17T12:15:26Z

Hi @JorgSchwinger , is there any good reason to provide this as a user_nl_blom and not similar to/in line with #478?

JorgSchwinger · 2025-02-17T12:29:45Z

This was discussed here NorESMhub/noresm3_dev_simulations#88 and the conclusion was that it is easier to do it as user-mods_directory

jmaerz · 2025-02-17T14:26:35Z

Hi @JorgSchwinger , apologize my ignorance, but are user-mods equivalent to the source-mods directory? - if so, I really feel that this is a step backward rather than forward and I would advocate for introducing ONE xml-switch name at the NorESM level for let say CMIP7-spinup (and one for CMIP7-production runs or alike plus other switches needed) that switches on all components output via namelists. Admittedly, that's what I read when looking into issue NorESMhub/noresm3_dev_simulations#88 - the latter has never really conclusively decided to my understanding (and use-mods escaped my understanding...).

JorgSchwinger · 2025-02-17T15:45:48Z

user-mods are activated with an option when you create the case. The corresponding namelist files are maintained in cime (we could have a copy in BLOM's cime_config, as suggested by this PR, but that's not strictly necessary).

It is much easier, particularly for the other components that are not exclusively used for NorESM. If you don't agree with this solution, you should bring this up for the other components here NorESMhub/noresm3_dev_simulations#88

jmaerz · 2025-02-18T08:50:02Z

Hi @JorgSchwinger , in any case, many thanks for providing this - let's see, where NorESMhub/noresm3_dev_simulations#88 leads to. In the case that user-mods will be used, I would suggest i) to push this PR ONLY to NorESM (not to BLOM) and ii) potentially include your settings rather with a switch into the currently used namelist_definition_blom.xml-file.

JorgSchwinger · 2025-03-06T09:35:31Z

Hi all, regardless what we do about this, it would be good to review the content of the user_nl file. Particularly, @matsbn could you review the physical output setting, and @jmaerz could you make sure you are ok with the outputs for the extended N-cycle (and M4AGO).

@matsbn what do you think about using 2-byte compressed output also for BLOM as done for HAMOCC in this namelist? My experience is that the (compressed) files are significantly smaller.

Please let me know any changes.

JorgSchwinger · 2025-03-06T09:40:26Z

@jmaerz I am also totally fine with having a switch in env_run.xml, which then controls the namelist generation "online". To activate, this would then involve a separate step (an xmlchange command after create_case), but that would be probably also acceptable. I would not have the time right now to do this, though.

jmaerz · 2025-03-06T09:47:40Z

Hi @JorgSchwinger , thanks for picking up on this - I'll check this tmw. Other than that: if there is an easy possibility via usermods folder+script in BLOM/iHAMOCC, I would suggest to go that route and to put this eventually in the namelist_definition_blom.xml file. Once your current file is properly filled, we can easily translate it via a slightly modified version of the script that I sent you into the namelist_definition_blom.xml file (with a switch in the config_component.xml as done in #478 - essentially combining the two PRs).

JorgSchwinger · 2025-03-06T09:55:09Z

@jmaerz this would be possible, if cime would pick up user-mods-dirs on a component level (instead of or in addition to at the NorESM level), that is what you are saying, right? This would of course be a nice.

jmaerz · 2025-03-06T09:57:42Z

Hi @JorgSchwinger , that's at least, how I understood Matvey this morning. Let's see - I asked for advice in NorESMhub/noresm3_dev_simulations#88

matsbn · 2025-03-06T09:59:27Z

Hi all, regardless what we do about this, it would be good to review the content of the user_nl file. Particularly, @matsbn could you review the physical output setting, and @jmaerz could you make sure you are ok with the outputs for the extended N-cycle (and M4AGO).

@matsbn what do you think about using 2-byte compressed output also for BLOM as done for HAMOCC in this namelist? My experience is that the (compressed) files are significantly smaller.

Please let me know any changes.

I will have a look at this and sorry I was not able to attend to it sooner. I would like to keep some of the daily output BLOM output and a minimal set of output in model layers (LYR).

Using 2-byte output is not ideal for variables like salinity where the global value range is large, but the majority of values are within a narrow range (low accuracy for the bulk of the ocean volume). I felt that when we started to use the netCDF4 compression of output, the size benefit of 2-byte output was reduced (cannot recall numbers, though). Putting in automatic netCDF4 compression in the short term archiving process for NorESM3 (as we did it in NorESM2) was discussed at a recent NorESM tag team meeting. This gives a significant space saving and must be automatic, since experience show that users typically don't do this themselves afterwards.

JorgSchwinger · 2025-03-06T10:18:00Z

Using 2-byte output is not ideal for variables like salinity where the global value range is large, but the majority of values are within a narrow range (low accuracy for the bulk of the ocean volume). I felt that when we started to use the netCDF4 compression of output, the size benefit of 2-byte output was reduced (cannot recall numbers, though). Putting in automatic netCDF4 compression in the short term archiving process for NorESM3 (as we did it in NorESM2) was discussed at a recent NorESM tag team meeting. This gives a significant space saving and must be automatic, since experience show that users typically don't do this themselves afterwards.

Files are about 20-30% smaller with the 2-byte option (after compression, recently tested) - so it is not a huge saving (but still something...)

jmaerz · 2025-03-06T12:43:58Z

... maybe worth considering something like a spinup and a production run flag for output?

mvdebolskiy · 2025-03-07T09:58:00Z

make a directory: cime_config/usermods_dirs/reduced_output
mv cime_config/user_nl_blom_outvol_reduced cime_config/usermods_dirs/reduced_output/user_nl_blom
It will allow to have a usermods_dir in coupled model to pick this user_nl_blom up for coupled simulations by adding ../../../components/blomcime_config/usermods_dirs/reduced_output in cime_config/usermods_dirs/reduced_out_devsim/include_user_mods
See Add reduced output usermod for dev simulations NorESM#654

jmaerz · 2025-03-07T15:10:27Z