-
Notifications
You must be signed in to change notification settings - Fork 419
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ET-VK][ez] Fix linear weight int4 test due to change in ATen API #7739
Conversation
## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7739
Note: Links to docs will display an error until the docs builds have been completed. ⏳ No Failures, 1 PendingAs of commit b9c38a2 with merge base 1a6b7a6 (): This comment was automatically generated by Dr. CI and updates every 15 minutes. |
## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/) ghstack-source-id: 261917427 Pull Request resolved: #7739
This pull request was exported from Phabricator. Differential Revision: D68333687 |
…Ten API" ## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D68333687 |
Pull Request resolved: #7739 ## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. ghstack-source-id: 261959346 @exported-using-ghexport Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/)
b230fcf
into
gh/SS-JIA/170/base
) Pull Request resolved: #7739 ## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. ghstack-source-id: 261959346 @exported-using-ghexport Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/) Co-authored-by: Stephen Jia <[email protected]>
Stack from ghstack (oldest at bottom):
Context
Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API.
Concretely, the changes in API were:
_for_cpu
suffix was added to the operator name_convert_weight_to_int4pack_mm
operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value.Differential Revision: D68333687