Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for low bit-width Quantization in BitPack #1

Open
hachons opened this issue Aug 22, 2023 · 0 comments
Open

Support for low bit-width Quantization in BitPack #1

hachons opened this issue Aug 22, 2023 · 0 comments

Comments

@hachons
Copy link

hachons commented Aug 22, 2023

I'm interested in knowing if BitPack currently supports quantization for 2-bit and 4-bit models. Could you confirm the same for mixed precision as well? Additionally, I'd appreciate insights on accuracy metrics and any potential challenges associated with achieving accurate inference at such low bit-widths.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant