Skip to content

Handling missing records AssignPSUSamplingParameters #170

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
edvinf opened this issue Feb 4, 2025 · 2 comments · Fixed by #192
Closed

Handling missing records AssignPSUSamplingParameters #170

edvinf opened this issue Feb 4, 2025 · 2 comments · Fixed by #192
Assignees
Labels
analytical issues for new analytical estimation workflow enhancement New feature or request release1.6 Issues that must be solved before release of 1.6 unstableRepo proposed solution to issue is in the develop branch and the unstable repository

Comments

@edvinf
Copy link
Contributor

edvinf commented Feb 4, 2025

AssignPSUSamplingParameters halts with an error if data records are not provided for all PSUs that are sampled in the PSU-sampling parameters. This is meant to ensure that correct data-sets have been obtained.

Some complications arise, however with partial measurements. An example is catchlottery-sampling of Blue Whiting in 2023. Ages have been selectively obtained, and some hauls have no age-records. So, while the PSU has indeed been sampled, it would be reasonable to assume that it is missing for purposes of estimation in age-domains.

This is currently rather cumbersome to deal with. It would be preferable to have a mechanism in StoX, for assuming these as missing at random. One suggestion would be to have an option to AssignPSUSamplingParameters which allows one to assume any missing data-records as missing at random. With such an option, unwanted records could be filtered in StoXBiotic in usual ways.

If age-less records are left in, estimation for each PSU proceeds as usual, but some PSUs will have NA estimates for some needed parameters, so that population-estimation will halt with an error. The issue could potentially be addressed also at this stage.

A StoX project encountering the issue is archived in imr internal gitlab: https://git.imr.no/a5362/samplesize-sensitivity-commercial-sampling

@edvinf edvinf added analytical issues for new analytical estimation workflow enhancement New feature or request release1.6 Issues that must be solved before release of 1.6 labels Feb 4, 2025
@edvinf edvinf self-assigned this Feb 4, 2025
@edvinf
Copy link
Contributor Author

edvinf commented Feb 7, 2025

For now, updated guidance for a simplified approach is incorporated into documentation in (#177 )

@edvinf
Copy link
Contributor Author

edvinf commented Feb 17, 2025

This will probably be solved by removing missing serialnumbers from the sampling parameters, and issue a message or warning.

@edvinf edvinf linked a pull request Mar 17, 2025 that will close this issue
@edvinf edvinf added the unstableRepo proposed solution to issue is in the develop branch and the unstable repository label Mar 18, 2025
@edvinf edvinf closed this as completed Mar 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
analytical issues for new analytical estimation workflow enhancement New feature or request release1.6 Issues that must be solved before release of 1.6 unstableRepo proposed solution to issue is in the develop branch and the unstable repository
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant