scVI+MMD: Variable-Strength Batch Correction with scVI #2

watiss · 2021-05-17T03:50:02Z

This change adds a component onto the VAE objective function which measures the extent to which posterior distributions for different batches are dissimilar. Essentially, this measures the effectiveness of the batch effect correction. This component is added onto the loss with a scaling factor - beta - which can be used to regulate the batch effect correction (note that at its extreme, batch effect correction can cause a potential over-mixing of cells due to enforcing over-similarity of latent distributions).

The newly added component consists of the aggregate Maximum Mean Discrepancy (https://www.jmlr.org/papers/volume13/gretton12a/gretton12a.pdf), aka MMD, of pairs of sets of samples taken from the latent distributions, grouped per their originator batch. We provide two modes of computation, ‘normal’ and ‘fast’, where the former computes the exact MMD (quadratic runtime in the number of samples) while the latter computes a fast approximation of the MMD (linear runtime in the number of samples). Furthermore, in the spirit of fast approximations, we only compute the MMD for sequential pairs of batches rather than for all pair-wise combinations.

MMD parameters (mode and weight) can be set during model instantiation. The MMD loss is recorded in the history of the trainer and thus is available from the model history once training is complete.

Details of the MMD computation

The formula used for the normal (exact) computation is formula 5 in the paper linked above. We use a Gaussian kernel with gamma=1. X and Y in our case are two sets of samples Z1 and Z2 taken from the latent space where Z1 corresponds to cells originating from batch k and Z2 corresponds to cells originating from batch k’. We carry this out for all sequential (k, k’) pairs and sum over all of them to obtain the aggregate MMD loss, L_mmd.

Finally, the existing SCVI loss (negative ELBO) is updated as follows:
L_scvi-mmd = L_scvi + beta*L_mmd

In fast mode, we proceed the same way as above, with the exception that the formula used for the MMD computation is the one presented in Lemma 14 in the paper linked above.

Results

The following Colab notebooks show runs of the current and updated SCVI models along with training curves and training runtimes: notebook1 for the current model, notebook2 for the updated model.

Changes to poetry.lock

This change-list also includes a minor change to the poetry.lock file that updates the version of the llvmlite package. Currently Poetry fails dependency resolution with a SolverProblemError because the declared numba and llvmlite versions do not match. In fact, the numba 0.51.2 release notes (https://pypi.org/project/numba/0.51.2/) declare that it is only compatible with llvmlite 0.34.*, which mismatches the version of llvmlite declared currently in the .lock file (0.35.0rc2).

…work

watiss · 2021-05-17T03:55:49Z

tests/models/test_models.py

+    assert len(model.history["mmd_loss_validation"]) == 1
+    assert not np.isnan(model.history["mmd_loss_train"].values[0][0])
+    assert not np.isnan(model.history["mmd_loss_validation"].values[0][0])
+    model.get_mmd_loss()


From what I understand, this call does not look like it is actually doing validation. I followed the same pattern as in the rest of this test (for example see get_elbo, a bit further above), but I am curious how these calls (such as get_elbo, get_marginal_ll, etc.) actually perform validation (if they do).

adamgayoso · 2021-05-17T23:17:04Z

scvi/module/_vae.py

+        z1_even = z1[: batch_size - 1 : 2, :]
+        z1_odd = z1[1:batch_size:2, :]
+        z2_even = z2[: batch_size - 1 : 2, :]
+        z2_odd = z2[1:batch_size:2, :]


no need to take every other, as the order in this tensor is random anyway, so you can like take the first half and the second half

Thank you. I made that change. Although, I’m curious: If the order is random, shouldn’t it anyway not matter which pairing scheme we use, whether it is every other or pairs from first/second halves?

watiss added 13 commits May 8, 2021 16:31

update llvmlite version in poetry lock file to get poetry install to …

39ea679

…work

add mmd

f513d91

add mmd loss

09456ff

add wighted mmd loss to total loss

4c91d87

record mmd loss in LossRecorder

259228f

fix tensor device in _compute_mmd_loss

299691d

add mmd-related params in scvi init

ff1cdb2

record mmd_loss in training history

324af32

Merge branch 'master' into valeh/scvi_mmd

0f6ad83

remove size mismatch warning from _compute_fast_mmd

0d15df1

add tests

5f777c5

mini test updates

48dd18d

add mmd paper reference

26e0297

watiss changed the title ~~SCVI+MMD: Variable-Strength Batch Correction with scVI~~ scVI+MMD: Variable-Strength Batch Correction with scVI May 17, 2021

watiss commented May 17, 2021

View reviewed changes

adamgayoso reviewed May 17, 2021

View reviewed changes

compute fast mmd for pairs made of first and second halves of z1 and z2

d342b8c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scVI+MMD: Variable-Strength Batch Correction with scVI #2

scVI+MMD: Variable-Strength Batch Correction with scVI #2

watiss commented May 17, 2021

watiss May 17, 2021

adamgayoso May 17, 2021

watiss May 19, 2021

scVI+MMD: Variable-Strength Batch Correction with scVI #2

Are you sure you want to change the base?

scVI+MMD: Variable-Strength Batch Correction with scVI #2

Conversation

watiss commented May 17, 2021

Details of the MMD computation

Results

Changes to poetry.lock

watiss May 17, 2021

Choose a reason for hiding this comment

adamgayoso May 17, 2021

Choose a reason for hiding this comment

watiss May 19, 2021

Choose a reason for hiding this comment