Replies: 1 comment
-
Hi @MosGeo. Sorry for the having missed this. Without digging deeper, one idea is that the sheer number of chunks in a single folder might be causing you problems on some operating systems. Could you possible "chunk" your Rather than:
you'd end up with a directory structure something like:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi All,
This question is related to #1486 and #1479.
I am interested in using zarr for a problem that will require a lot of non-sequential retrieval of data (i.e., certain rows at a time). I've done three tests that give different timings below. You can see that the direct sequential timing is the fastest. Any guidance on how to improve the the indecies random retrieval?
The three scenarios tested:
I am interested in the first case in particular. Is zarrthe best option for this? How about the the storage type? My current perference is SQLiteStore as I am attaching some other tables to the same database file (for fast metadata retrieval and querying; this might be related to zarr-developers/zarr-specs#154).
Note: the chunking was chosen based on the data retrieval (i.e., I will always retrieve the whole row).
Now, lets create the sampling arrays
The final results are (not regerous testing but it holds overall):
Beta Was this translation helpful? Give feedback.
All reactions