Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release v0.7.0 #12365

Open
5 of 8 tasks
simon-mo opened this issue Jan 23, 2025 · 15 comments
Open
5 of 8 tasks

Release v0.7.0 #12365

simon-mo opened this issue Jan 23, 2025 · 15 comments
Labels
release Related to new version release

Comments

@simon-mo simon-mo added misc release Related to new version release and removed misc labels Jan 23, 2025
@njhill
Copy link
Member

njhill commented Jan 23, 2025

#12298 should also now be ready. merged

@ywang96
Copy link
Member

ywang96 commented Jan 23, 2025

I should have some numbers ready by EoD

@njhill
Copy link
Member

njhill commented Jan 24, 2025

THe perf dashboard shows a potential regression (see slack thread), which looks most likely to be caused by #12253.

@wangxiyuan
Copy link
Contributor

And this one please #11324, the last PR for platform pluggable.

@comaniac
Copy link
Collaborator

THe perf dashboard shows a potential regression (see slack thread), which looks most likely to be caused by #12253.

Revert: #12377

@youkaichao
Copy link
Member

THe perf dashboard shows a potential regression (see slack thread), which looks most likely to be caused by #12253.

Revert: #12377

Fixed in #12380

@robertgshaw2-redhat
Copy link
Collaborator

@ElizaWszola - can you post the Mixtral issue

@DarkLight1337
Copy link
Member

DarkLight1337 commented Jan 24, 2025

There are still a few consistent CI failures on main:

@ElizaWszola
Copy link
Contributor

There are some issues in the moe_align_block_size_kernel causing illegal memory access in Mixtral models. Here is a PR that fixes this #12413 (I still haven't figured out why the current code is broken though).

@tlrmchlsmth
Copy link
Collaborator

#12417 would be good to pick up as well

@DarkLight1337
Copy link
Member

DarkLight1337 commented Jan 24, 2025

Anyone available to review #12251? The authors made the effort to open the PR before the model release so we should handle this ASAP.

@rainkert
Copy link

rainkert commented Jan 25, 2025

Anyone available to review #12251? The authors made the effort to open the PR before the model release so we should handle this ASAP.

Yes, this is very urgent. After we released the model, we have received many user inquiries about how to run this model within the vLLM framework. Please address this PR as soon as possible.

@robertgshaw2-redhat
Copy link
Collaborator

Anyone available to review #12251? The authors made the effort to open the PR before the model release so we should handle this ASAP.

Yes, this is very urgent. After we released the model, we have received many user inquiries about how to run this model within the vLLM framework. Please address this PR as soon as possible.

When is the model release planned for?

@DarkLight1337
Copy link
Member

When is the model release planned for?

It has been released yesterday already. The PR was originally opened 4 days ago.

@simon-mo
Copy link
Collaborator Author

I plan to cut a release now for V1 alpha release now.

Currently, the knowns issues are

The next release will happen whenever the to model supports are merged, and/or update to Deepseek support pending FlashInfer kernels.

@simon-mo simon-mo mentioned this issue Jan 27, 2025
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release Related to new version release
Projects
None yet
Development

No branches or pull requests