Comparison of Quantization Error for Each Linear Operator ? #1424

Lenan22 · 2024-12-17T03:48:45Z

I would like to compare the error before and after quantization for each linear operator, specifically by calculating the cosine similarity between the quantized computation results and the bfloat16/float16/float32 computation results. Do you have an existing solution for this? If so, could you share it with me?

jerryzh168 · 2024-12-17T23:02:12Z

we don't have an example right now in torchao, but we can try to provide an example, I can take a look a bit later if no one takes this up

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comparison of Quantization Error for Each Linear Operator ? #1424

Comparison of Quantization Error for Each Linear Operator ? #1424

Lenan22 commented Dec 17, 2024

jerryzh168 commented Dec 17, 2024

Comparison of Quantization Error for Each Linear Operator ? #1424

Comparison of Quantization Error for Each Linear Operator ? #1424

Comments

Lenan22 commented Dec 17, 2024

jerryzh168 commented Dec 17, 2024