Switch from per-band mean/std normalization of Sentinel-2 values to surface reflectance conversion instead #94

weiji14 · 2023-12-19T22:26:31Z

Using mean and standard deviation normalization is a common procedure in standard Computer Vision, and can be applied e.g. by using torchvision's Normalize function. But this normalization can lead to incorrect band ratios when applied to optical remote sensing images.

E.g. let's take the formula for Normalized Difference Vegetation Index (NDVI):

$$ \text{NDVI} = \frac{\text{NIR} - \text{Red}}{\text{NIR} + \text{Red}} $$

If say, we have a un-normalized Sentinel-2 pixel with Band 8 (NIR): 3327, and Band 4 (Red): 426, then the NDVI value would be:

$$ \text{NDVI} = \frac{3327-426}{3327+426} = 0.77 $$

However, if we apply a per-band mean/std normalization scheme, the value becomes:

$$\text{NIR} = \frac{3327-2238}{1414} = 0.77$$

$$\text{Red} = \frac{426-583}{981} = -0.16$$

$$\text{NDVI} = \frac{0.77-(-0.16)}{0.77+(-0.16)} = 1.52$$

Clearly this is wrong, since we've removed the NDVI signal! A model trained on these mean/std normalized pixel values would have a harder time capturing the semantics of band indices such as NDVI.

One possible solution, is that instead of applying a per-band normalization, we can convert the Sentinel-2 Digital Number (DN) values to surface reflectance by dividing with the dynamic range of the band to a value between 0-1. Sentinel-2's MSI sensor is 12-bit, but the data is stored as 16-bit. Usually people use 10000, but this doesn't work for very bright white areas, so I'll use $2^{14} = 16384$ below:

$$\text{NIR} = \frac{3327}{16384} = 0.20$$

$$\text{Red} = \frac{426}{16384} = 0.026$$

$$\text{NDVI} = \frac{0.20-0.026}{0.20+0.026} = 0.77$$

which matches with the actual NDVI value.

Notes:

Calculating the actual surface reflectance is a little more complicated than the above for the Sentinel-2 L2A product since we also need to apply an offset (see https://sentinel.esa.int/web/sentinel/technical-guides/sentinel-2-msi/level-2a-algorithms-products), but the main point is - don't use mean/std normalization!
The SCL band should be divided by a different value compared to the other optical bands (B02-B12).

Side note: Using a single mean and standard deviation value for all Sentinel-2 bands won't preserve the band ratios either. E.g. if we use a mean value of 1351 and standard deviation of 1071, and apply it to the NIR/Red bands

$$\text{NIR} = \frac{3327-1351}{1071} = 1.845$$

$$\text{Red} = \frac{426-1351}{1071} = -0.8637$$

$$\text{NDVI} = \frac{1.845-(-0.8637)}{1.845+(-0.8637)} = 2.76$$

The 2.76 result is still not the correct NDVI value of 0.77.

References:

The text was updated successfully, but these errors were encountered:

weiji14 · 2023-12-20T02:46:52Z

Another detail after chatting with @lillythomas, we'll also need to apply a bias correction for Sentinel-2 images that were taken after Jan 2022, due to changes in the BOA_ADD_OFFSET value (see https://sentinels.copernicus.eu/web/sentinel/-/copernicus-sentinel-2-major-products-upgrade-upcoming). This is to ensure that the band values of Sentinel-2 images before and after 2022 are from the same distribution. @srmsoumya, we'll probably handle this as custom logic in the transform of the datamodule here:

model/src/datamodule.py

Line 152 in ee74c91

self.tfm = v2.Compose([v2.Normalize(mean=self.MEAN, std=self.STD)])

srmsoumya · 2023-12-20T12:31:16Z

Good article on normalizing EO imagery: https://medium.com/sentinel-hub/how-to-normalize-satellite-images-for-deep-learning-d5b668c885af
Another option to consider, use BatchNorm as a way to learn normalized weights for each band in the imagery,

weiji14 · 2023-12-20T19:50:41Z

Another option to consider, use BatchNorm as a way to learn normalized weights for each band in the imagery,

No, we should not use BatchNorm for the Foundation Model layers, since it is doing mean/std normalization! In Super Resolution models such as ESRGAN (Wang et al., 2018) and EDSR (Lim et al., 2017), BatchNorm layers have been removed and replaced with residual skip connections. Quoting from Lim et al., 2017:

We remove the batch normalization layers from our network as Nah et al.[19] presented in their image deblurring work. Since batch normalization layers normalize the features, they get rid of range flexibility from networks by normalizing the features, it is better to remove them. We experimentally show that this simple modification increases the performance substantially as detailed in Sec. 4.

Furthermore, GPU memory usage is also sufficiently reduced since the batch normalization layers consume the same amount of memory as the preceding convolutional layers. Our baseline model without batch normalization layer saves approximately 40% of memory usage during training, compared to SRResNet. Consequently, we can build up a larger model that has better performance than conventional ResNet structure under limited computational resources.

The point is though, that we shouldn't be doing any mean/std normalization on the inputs to the first layer of the model, so that the band ratios are preserved. If we want to apply normalization to subsequent layers, that's fine, and for more recent models (from 2020- onwards), it seems like LayerNorm is preferred over BatchNorm (e.g. see ConvNext and https://stats.stackexchange.com/questions/474440/why-do-transformers-use-layer-norm-instead-of-batch-norm).

yellowcap · 2024-06-05T10:52:25Z

I think for v1 and beyond we can use both. The model should see L1, L2 data, and different normalization patterns so that it hopefully generalized better. So I am closing this for now. @weiji14 feel free to keep this open or re-open later if we get back to working on this.

weiji14 added bug Something isn't working data-pipeline Pull Requests about the data pipeline labels Dec 19, 2023

weiji14 changed the title ~~Switch from per-band normalization of Sentinel-2 values to surface reflectance conversion instead~~ Switch from per-band mean/std normalization of Sentinel-2 values to surface reflectance conversion instead Dec 19, 2023

weiji14 added this to the v1 Release milestone Dec 19, 2023

brunosan assigned weiji14 Dec 19, 2023

weiji14 mentioned this issue Feb 7, 2024

Tutorial on burn scar analysis using embeddings from partial inputs #149

Merged

yellowcap mentioned this issue Feb 7, 2024

Add normalization parameters to documentation #152

Merged

weiji14 mentioned this issue Apr 30, 2024

Improved metadata #240

Closed

yellowcap closed this as completed Jun 5, 2024

weiji14 mentioned this issue Aug 19, 2024

standardisation of optical data #320

Open

weiji14 reopened this Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch from per-band mean/std normalization of Sentinel-2 values to surface reflectance conversion instead #94

Switch from per-band mean/std normalization of Sentinel-2 values to surface reflectance conversion instead #94

weiji14 commented Dec 19, 2023 •

edited

Loading

weiji14 commented Dec 20, 2023 •

edited

Loading

srmsoumya commented Dec 20, 2023

weiji14 commented Dec 20, 2023

yellowcap commented Jun 5, 2024

Switch from per-band mean/std normalization of Sentinel-2 values to surface reflectance conversion instead #94

Switch from per-band mean/std normalization of Sentinel-2 values to surface reflectance conversion instead #94

Comments

weiji14 commented Dec 19, 2023 • edited Loading

weiji14 commented Dec 20, 2023 • edited Loading

srmsoumya commented Dec 20, 2023

weiji14 commented Dec 20, 2023

yellowcap commented Jun 5, 2024

weiji14 commented Dec 19, 2023 •

edited

Loading

weiji14 commented Dec 20, 2023 •

edited

Loading