-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reading local Zarr files into stars #663
Comments
I get > read_mdim("sic_daily_sample.zarr/")
stars object with 3 dimensions and 1 attribute
attribute(s), summary of first 1e+05 cells:
Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
siconc [1] NA NA NA NaN NA NA 1e+05
dimension(s):
from to refsys point
longitude 1 4320 WGS 84 NA
latitude 1 961 WGS 84 NA
time 1 1 POSIXct TRUE
values x/y
longitude [-180.0417,-179.9583),...,[179.875,179.9583) [x]
latitude [-80.04167,-79.95833),...,[-0.04166667,0.04166667) [y]
time 2021-01-09 UTC What is your |
Thanks Edzer, I tried the same code and got the same error message. My sessionInfo() gives
And my sf_extSoftVersion() prints
|
Please update |
That still printed the same error message as previously. I haven't yet downloaded the latest version of RStudio but I don't imagine that would cause this error? |
See also #566 (comment) |
Apologies, I'm not yet proficient with R. How do I install that patch? I tried using remotes::install_github("rspatial/sf") but I'm still seeing the same error code. |
No need for you to install that patch. |
Sorry I'm a bit lost as to what steps I can take from the other issue to fix my issue. |
I'm just cross linking them; I can reproduce the error on GitHub actions here: https://github.com/r-spatial/stars/actions/runs/7712573313/job/21020420577#step:6:297 |
It seems that this issue is specific to the Windows binary release. Note that you can use CopernicusMarine for subsetting Copernicus Marine data as well. However, it does not yet support ZARR data because of the issue reported here and #566 (comment) |
Thanks @pepijn-devries - I'll look at doing that to download as a netCDF if the Zarr format remains unusable for my setup. My main issue is that the full data I need is massive (~1.3TB as a netCDF but only ~250GB as Zarr), so Zarr would be preferable if it can work! But if not, I'll get a new hard drive and put my computer to the test. |
Windows and MacOS binary releases; we added blosc, at least to windows binary builds, but this suggests it's not working. |
Hi @edzer, Is there any news on the Windows build and blosc decompression of ZARR files? Thanks for your work on the package! By the way, I did some additional testing. The issue does not only occur on Windows, but also on a Linux Fedora (virtual) machine I have set up: library(stars)
#> Loading required package: abind
#> Loading required package: sf
#> Linking to GEOS 3.12.1, GDAL 3.7.3, PROJ 9.2.1; sf_use_s2() is TRUE
dsn <- 'ZARR:"/vsicurl/https://ncsa.osn.xsede.org/Pangeo/pangeo-forge/gpcp-feedstock/gpcp.zarr"'
bounds <- c(longitude = "lon_bounds", latitude = "lat_bounds")
r <- read_mdim(dsn, bounds = bounds)
#> Warning in CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL
#> Error 1: Decompressor blosc not handled
#> Warning in CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL
#> Error 1: Decompressor blosc not handled
#> Warning in CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL
#> Error 1: Decompressor blosc not handled
#> Warning in CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL
#> Error 1: Decompressor blosc not handled
#> Warning in CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL
#> Error 1: Decompressor blosc not handled
#> Warning in CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL
#> Error 1: Decompressor blosc not handled
#> Warning in CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL
#> Error 1: Decompressor blosc not handled
#> Error in CPL_read_mdim(file, array_name, options, offset, count, step, : CHAR() can only be applied to a 'CHARSXP', not a 'NULL' Created on 2024-03-11 with reprex v2.1.0 With
|
Same here, using MacOS.
With
|
This was failing for me on Mac, both in R and on the command line, accessing ERA5 dataset on GCP
and stars::read_stars('ZARR:"/vsigs/gcp-public-data-arco-era5/co/single-level-reanalysis.zarr":/time') Failed with the On Mac, it appears that gdal is not built with blosc by default by Homebrew ( So I did:
Then install.packages(c("sf", "stars"), type = "source") and voila: stars::read_stars('ZARR:"/vsigs/gcp-public-data-arco-era5/co/single-level-reanalysis.zarr":/time')
#> stars object with 2 dimensions and 1 attribute
#> attribute(s):
#> Min. 1st Qu. Median Mean
#> time [(hours since 1979-01-01 00:00:00)] 0 93503.75 187007.5 187007.5
#> 3rd Qu. Max.
#> time [(hours since 1979-01-01 00:00:00)] 280511.2 374015
#> dimension(s):
#> from to offset delta x/y
#> x 1 374016 -0.5 1 [x]
#> y 1 1 -0.5 1 [y] So I think a PR to gdal to add |
Actually, maybe a PR to https://github.com/R-macos/recipes/ is a better choice (or in addition to homebrew) |
Yes, that is the place from where CRAN macos binaries originate from, which most people use. |
I'll see if I can send a PR to https://github.com/r-windows/ for the windows build of gdal... |
PR submitted here: R-macos/recipes#60. I had some trouble building gdal locally, any suggestions would be welcome! |
Homebrew PR submitted as well: Homebrew/homebrew-core#201008 |
@pepijn-devries : blosc support should be present in the windows build: https://github.com/r-spatial/sf/blob/main/src/Makevars.ucrt#L8 |
I think on Windows this bundle is used for GDAL: https://github.com/r-spatial/sf/blob/39e8f51372e19237d95cd406ae4683a253c3c5b2/tools/winlibs.R#L11 And if I look at the build script that creates that bundle: https://github.com/rwinlib/gdal3/blob/master/.github/workflows/bundle.sh It does not link to blosc. Isn't that the problem? |
I think rwinlib is no longer used for building CRAN windows binaries, and IIRC the R source tree (and build tools) is not on, or taken from, GitHub, at best it is a copy. |
Hi,
After looking at the vignette for reading Zarr files in stars, I am unsure how to read local Zarr directories into R. I have been trying to work with satellite imagery for the Southern Ocean downloaded from Copernicus' Marine Data Client.
Here is my attempt at coding this
`library(stars)
dsn <- 'ZARR:"sic_daily_samples.zarr/"'
read_mdim(dsn)`
Which gives the error message
Error in CPL_read_mdim(file, array_name, options, offset, count, step, : CHAR() can only be applied to a 'CHARSXP', not a 'NULL' In addition: Warning messages: 1: In CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL Error 1: Decompressor blosc not handled 2: In CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL Error 1: Decompressor blosc not handled 3: In CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL Error 1: Decompressor blosc not handled 4: In CPL_read_mdim(file, array_name, options, offset, count, step, : GDAL Error 1: Decompressor blosc not handled
I've uploaded a subset of the data for ease but I can't figure out how to read it as a zipped or unzipped file, so any help with this would be appreciated!
Thanks,
Josh
sic_daily_samples.zarr.zip
The text was updated successfully, but these errors were encountered: