Add SpinQuant pass #1557

jambayk · 2025-01-17T00:23:13Z

Describe your changes

Add a new pass do weight rotation using SpinQuant.

Similar to QuaRot, this pass also only performs offline weight rotation.
The concept is similar to QuaRot but the rotation weights are trained on a calibration dataset to improve activation quantization quality.
Only per_token and per_tensor activation quantization are supported. Groupwise quatization is not supported yet since we don't expect it to be used in subsequent QDQ models.
Common training utils have been abstracted from the lora passes.

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.
Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link

xiaoyu-work · 2025-01-23T01:27:02Z

olive/passes/pytorch/rotate.py

+
+            # optimizer
+            optimizer = SGDG(
+                rotation_params, lr=training_args.learning_rate, weight_decay=training_args.weight_decay, stiefel=True


Is stiefel always true here?

yes, it is required to be True to do the cayley sgd for orthagonal matrices. without it, it behaves the same as normal sgd. original implementation here https://github.com/facebookresearch/SpinQuant/blob/44dbc26056ee9e319dd8ce24bfbf7203785f5c77/optimize_rotation.py#L109

Curious why the stiefel default is False, and it also implements the case when stiefel is False?

I am not sure about the reason but yes, it implements the False case too.

I got it from https://github.com/facebookresearch/SpinQuant/blob/main/train_utils/optimizer.py#L57 which was in based on the torch sgd implementation https://github.com/pytorch/pytorch/blob/main/torch/optim/sgd.py#L26.
Maybe they wanted to have both modes be available in the same optimizer for completeness.

IMO, if we don't (and won't) support the other case, we should remove those dead codes.

sounds good! I will remove the False case and simplify the optimizer.

I tried doing this, but the options are more tied than I expected. There are if/else conditions that involve stiefel=true/else + related parameters. I cannot verify the correctness of a modified optimizer so decided to keep it as is. Even the original implementation of spinquant copied the optimizer directly from the source code of the paper that introduced the algorithm.

olive/passes/pytorch/rotate.py

jambayk marked this pull request as draft January 17, 2025 00:31

jambayk force-pushed the jambayk/spinquant branch from 5e5658e to 2da4605 Compare January 17, 2025 00:37

jambayk force-pushed the jambayk/quarot branch from 360e955 to 827cd4f Compare January 21, 2025 23:24

jambayk force-pushed the jambayk/spinquant branch from 2da4605 to 2d14fa9 Compare January 21, 2025 23:27

jambayk force-pushed the jambayk/quarot branch from 827cd4f to 6c2cfe2 Compare January 22, 2025 02:05

jambayk force-pushed the jambayk/spinquant branch 2 times, most recently from 28222ad to 7d4169d Compare January 22, 2025 22:56

Base automatically changed from jambayk/quarot to main January 22, 2025 23:30

jambayk force-pushed the jambayk/spinquant branch from 7d4169d to 706371d Compare January 22, 2025 23:33

jambayk marked this pull request as ready for review January 22, 2025 23:33

xiaoyu-work reviewed Jan 23, 2025

View reviewed changes

jambayk commented Jan 23, 2025

View reviewed changes

olive/passes/pytorch/rotate.py Show resolved Hide resolved

add spinquant

e0a7806

jambayk force-pushed the jambayk/spinquant branch from 85eac59 to e0a7806 Compare January 24, 2025 23:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SpinQuant pass #1557

Add SpinQuant pass #1557

jambayk commented Jan 17, 2025 •

edited

Loading

xiaoyu-work Jan 23, 2025

jambayk Jan 23, 2025

xiaoyu-work Jan 23, 2025

jambayk Jan 23, 2025

xiaoyu-work Jan 23, 2025

jambayk Jan 23, 2025

jambayk Jan 24, 2025

Add SpinQuant pass #1557

Are you sure you want to change the base?

Add SpinQuant pass #1557

Conversation

jambayk commented Jan 17, 2025 • edited Loading

Describe your changes

Checklist before requesting a review

(Optional) Issue link

xiaoyu-work Jan 23, 2025

Choose a reason for hiding this comment

jambayk Jan 23, 2025

Choose a reason for hiding this comment

xiaoyu-work Jan 23, 2025

Choose a reason for hiding this comment

jambayk Jan 23, 2025

Choose a reason for hiding this comment

xiaoyu-work Jan 23, 2025

Choose a reason for hiding this comment

jambayk Jan 23, 2025

Choose a reason for hiding this comment

jambayk Jan 24, 2025

Choose a reason for hiding this comment

jambayk commented Jan 17, 2025 •

edited

Loading