(feat): remove dtype + fill val handling per chunk #124

ilan-gold · 2025-11-03T14:49:37Z

Calling to_native_dtype + __str__ came up as one of the only python-CPU-bound things when doing some benchmarking. My use-case is quite contrived (generating thousands of WithSubset objects) but I think it's probably worth investigating getting rid of these calls. Some observations:

I wonder if all getting the dtype and fill_val be wrapped up in just relying on https://docs.rs/zarrs/latest/zarrs/array/struct.Array.html#method.open and then using the values directly (there are probably other benefits of doing this) but I think this is a separate PR
Regardless, most of this refactor is around removing Basic anyway so that chunk handling is independent of the ability. I noticed that ChunkRepresentation requires ownership over its arguments which means we copy per-chunk. Not sure what would go into making that a reference, but it's no worse than the previous situation where I think we were generating copies repeatedly, but from PyO3 calling python

The benefit wasn't crazy ~5% but I think going in this direction is good (see point 1)

TODO:

Understand our vlen test error messages / warnings re: what we support.

(feat): first pass remove dtype + fill val handling per chunk

697eb52

ilan-gold marked this pull request as draft November 3, 2025 14:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(feat): remove dtype + fill val handling per chunk #124

(feat): remove dtype + fill val handling per chunk #124

ilan-gold commented Nov 3, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

(feat): remove dtype + fill val handling per chunk #124

Are you sure you want to change the base?

(feat): remove dtype + fill val handling per chunk #124

Conversation

ilan-gold commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ilan-gold commented Nov 3, 2025 •

edited

Loading