Skip to content

This contribution provides an interface in MATLAB to read and write Zarr files. Zarr is a file format for scientific data (like HDF5 and netCDF) which is optimized for cloud storage and parallel computing environments.

License

Notifications You must be signed in to change notification settings

mathworks/MATLAB-support-for-Zarr-files

Repository files navigation

MATLAB Support for Zarr files

Zarr® is a chunked, compressed, N-dimensional array storage format optimized for performance and scalability. It is widely used in scientific computing for handling large datasets efficiently. This repository provides an interface to read and write Zarr arrays and metadata from MATLAB®.

For complete documentation, refer to the documentation.md file, or refer to the help section of each function.

Status

  • Supports only Zarr v2.
  • Supports reading and writing of Zarr arrays from local storage and Amazon S3.
  • Supports reading and writing of Zarr metadata from local storage and Amazon S3.

Setup

To use this repository, clone the repo to your local folder and add it to your MATLAB using addpath. For example,

>> addpath("C:\<username>\support-Zarr-in-MATLAB\")

MathWorks Products (https://www.mathworks.com)

Requires MATLAB release R2022b or newer

3rd Party Products:

  • Python - v3.10 or newer
  • tensorstore - v0.1.71 or newer
  • numpy - v1.26.4 or newer

See the link here for the Python versions compatible with different MATLAB releases.

Installation

Before proceeding, please ensure that you have a supported version of Python installed on your machine. Please refer to the following links to configure your system to use Python with MATLAB:

Make sure that the Python path is included in your system path environment variable. To verify that you have a supported version of Python, type (in MATLAB Command Window):

>> pyenv

ans = 

  PythonEnvironment with properties:

          Version: "3.11"
       Executable: "C:\Users\<username>\AppData\Local\Programs\Python\Python311\pythonw.exe"
          Library: "C:\Users\<username>\AppData\Local\Programs\Python\Python311\python311.dll"
             Home: "C:\Users\<username>\AppData\Local\Programs\Python\Python311"
           Status: NotLoaded
    ExecutionMode: OutOfProcess

If the value of the Version property is empty, then you do not have a supported version available.

Once Python is installed, install the Python packages tensorstore and numpy.

Getting Started

  1. Clone the github repo to your local drive.
  2. Start MATLAB.
  3. Add the parent cloned directory to your MATLAB path:
>> addpath ("C:\<username>\support-Zarr-in-MATLAB\")

Examples

Read a Zarr array

filepath = "group1\dset1";
data     = zarrread(filepath);

Create and write to a Zarr array

filepath   = "myZarrfiles\singleDset";
data_shape = [10,10];              % shape of the Zarr array to be written
data       = 5*ones(10,10);        % Data to be written

zarrcreate (filepath, data_shape)  % Create the Zarr array with default attributes
zarrwrite(filepath, data)          % Write data to the Zarr array

Create a Zarr array and write data to it using zlib compression with non-default chunking

filepath = "myZarrfiles\singleZlibDset";

% Size of the data
data_shape = [10, 20];
% Chunk size
chunk_shape = [5, 5];
% Sample data to be written
data = single(5*ones(10, 20));

% Set the compression ID and compression level
compress.id = "zlib";
compress.level = 8;

% Create the Zarr array
zarrcreate(filepath, data_shape, ChunkSize=chunk_shape, DataType="single", ...
	Compression=compress)
	
% Write to the Zarr array
zarrwrite(filepath, data)

Read the metadata from a Zarr array

filepath = "group1\dset1";
info = zarrinfo(filepath);

Help

To view documentation of a function, type help <function_name>. For example,

>> help zarrcreate

or refer to the documentation.md and examples.md files. For more examples, see the examples directory in the repository.

License

The license is available in the License.txt file in this GitHub repository.

Community Support

MATLAB Central

Copyright 2025 The MathWorks, Inc.

About

This contribution provides an interface in MATLAB to read and write Zarr files. Zarr is a file format for scientific data (like HDF5 and netCDF) which is optimized for cloud storage and parallel computing environments.

Resources

License

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published