[ET-VK][ez] Fix linear weight int4 test due to change in ATen API #7739

SS-JIA · 2025-01-17T20:57:12Z

Stack from ghstack (oldest at bottom):

-> [ET-VK][ez] Fix linear weight int4 test due to change in ATen API #7739

Context

Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API.

Concretely, the changes in API were:

The _for_cpu suffix was added to the operator name
The _convert_weight_to_int4pack_mm operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value.

Differential Revision: D68333687

## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/) [ghstack-poisoned]

pytorch-bot · 2025-01-17T20:57:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7739

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 1 Pending

As of commit b9c38a2 with merge base 1a6b7a6 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/) ghstack-source-id: 261917427 Pull Request resolved: #7739

facebook-github-bot · 2025-01-17T20:57:29Z

This pull request was exported from Phabricator. Differential Revision: D68333687

…Ten API" ## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/) [ghstack-poisoned]

facebook-github-bot · 2025-01-17T21:00:17Z

This pull request was exported from Phabricator. Differential Revision: D68333687

Pull Request resolved: #7739 ## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. ghstack-source-id: 261959346 @exported-using-ghexport Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/)

) Pull Request resolved: #7739 ## Context Recently the ATen API for 4-bit quantized linear has changed, so our test must adapt to the change in API. Concretely, the changes in API were: * The `_for_cpu` suffix was added to the operator name * The `_convert_weight_to_int4pack_mm` operator now expects unpacked 4-bit weights instead of a packed scheme where 2 4-bit values are packed into a single 8-bit value. ghstack-source-id: 261959346 @exported-using-ghexport Differential Revision: [D68333687](https://our.internmc.facebook.com/intern/diff/D68333687/) Co-authored-by: Stephen Jia <[email protected]>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 17, 2025

facebook-github-bot added the fb-exported label Jan 17, 2025

SS-JIA added the topic: not user facing label Jan 17, 2025

jorgep31415 approved these changes Jan 17, 2025

View reviewed changes

facebook-github-bot merged commit b230fcf into gh/SS-JIA/170/base Jan 17, 2025
10 of 12 checks passed

facebook-github-bot deleted the gh/SS-JIA/170/head branch January 17, 2025 23:48

facebook-github-bot temporarily deployed to cherry-pick-bot January 17, 2025 23:48 — with GitHub Actions Inactive

pytorchbot mentioned this pull request Jan 17, 2025

[ET-VK][ez] Fix linear weight int4 test due to change in ATen API #7751

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ET-VK][ez] Fix linear weight int4 test due to change in ATen API #7739

[ET-VK][ez] Fix linear weight int4 test due to change in ATen API #7739

SS-JIA commented Jan 17, 2025 •

edited

Loading

pytorch-bot bot commented Jan 17, 2025 •

edited

Loading

facebook-github-bot commented Jan 17, 2025

facebook-github-bot commented Jan 17, 2025

[ET-VK][ez] Fix linear weight int4 test due to change in ATen API #7739

[ET-VK][ez] Fix linear weight int4 test due to change in ATen API #7739

Conversation

SS-JIA commented Jan 17, 2025 • edited Loading

Context

pytorch-bot bot commented Jan 17, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7739

⏳ No Failures, 1 Pending

facebook-github-bot commented Jan 17, 2025

facebook-github-bot commented Jan 17, 2025

SS-JIA commented Jan 17, 2025 •

edited

Loading

pytorch-bot bot commented Jan 17, 2025 •

edited

Loading