Skip to content

Estimate final size of a dataset #9954

Answered by TomNicholas
uriii3 asked this question in General
Discussion options

You must be logged in to vote

we use this code to evaluate the estimated size

Is there a reason why you don't just use ds.nbytes?

we want to estimate the size but we are compressing the final file

Knowing the compression factor exactly without looking at all the data is definitionally impossible, because the size of the final file is entirely dependent on the actual data values in your arrays: if your arrays all contained the same value repeated over and over then any decent compression algorithm should compress that down to almost nothing, but if your data contained random uncorrelated noise then lossless compression won't make it smaller at all (and may even get bigger).

some sort of 'predicting' the factor wit…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@uriii3
Comment options

Answer selected by uriii3
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants