Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

version 0.2.1 iteration plan #110

Closed
7 tasks
iofu728 opened this issue Mar 19, 2024 · 1 comment
Closed
7 tasks

version 0.2.1 iteration plan #110

iofu728 opened this issue Mar 19, 2024 · 1 comment
Assignees
Labels
documentation Improvements or additions to documentation feature feature iteration plan

Comments

@iofu728
Copy link
Contributor

iofu728 commented Mar 19, 2024

Estimated Release Date: 3/19
Release Manager: @suiguoxin
Schedule:

  • Design Review: 3/19
  • Coding: 3/10
  • Testing: 3/19

Features

Backlog

  • P1 exp: target comp ratio v.s. real comp ratio on specific data
  • >token level, < sentence level, list different mappings and design interface P1 word level compression When I use chinese llama, the compressed prompt has garbled code #4
  • P1 Support more / faster engines Support for llama.cpp or exl2 #41, including llama_cpp, FasterTransformer, vLLM ETA: TBD
    • survey which engines to support
  • P2 Documentation and examples
    • Supported models and experiment results (with compressor throughput) after a faster engine supported
@iofu728 iofu728 pinned this issue Mar 19, 2024
@iofu728 iofu728 self-assigned this Mar 19, 2024
@iofu728 iofu728 added iteration plan feature feature documentation Improvements or additions to documentation labels Mar 19, 2024
@iofu728
Copy link
Contributor Author

iofu728 commented Mar 20, 2024

v0.2.1 released.
move to v0.2.2.

@iofu728 iofu728 closed this as completed Mar 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation feature feature iteration plan
Projects
None yet
Development

No branches or pull requests

1 participant