Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Comparison of Quantization Error for Each Linear Operator ? #1424

Open
Lenan22 opened this issue Dec 17, 2024 · 1 comment
Open

Comparison of Quantization Error for Each Linear Operator ? #1424

Lenan22 opened this issue Dec 17, 2024 · 1 comment

Comments

@Lenan22
Copy link

Lenan22 commented Dec 17, 2024

I would like to compare the error before and after quantization for each linear operator, specifically by calculating the cosine similarity between the quantized computation results and the bfloat16/float16/float32 computation results. Do you have an existing solution for this? If so, could you share it with me?

@jerryzh168
Copy link
Contributor

we don't have an example right now in torchao, but we can try to provide an example, I can take a look a bit later if no one takes this up

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants