What's new in 1.2.0 (2025-01-10)
These are the changes in inference v1.2.0.
New features
- FEAT: support HunyuanVideo by @qinxuye in #2721
- FEAT: support hunyuan-dit text2image by @qinxuye in #2727
- FEAT: support cline for vllm engine by @hwzhuhao in #2734
- FEAT: [UI] theme switch by @Minamiyama in #1335
- FEAT: support qwen2vl run on ascend npu by @Xu-pixel in #2741
- FEAT: [UI] Add language toggle for i18n support. by @yiboyasss in #2744
- FEAT: Support cogagent-9b by @amumu96 in #2740
- FEAT: Xavier: Share KV cache between VLLM replicas by @ChengjieLi28 in #2732
- FEAT: [UI] Add gguf_quantization, gguf_model_path, and cpu_offload for image models. by @yiboyasss in #2753
- FEAT: Support Marco-o1 by @Jun-Howie in #2749
Enhancements
- ENH: [UI] Update Button Style and Interaction Logic for Editing Cache in Model Card. by @yiboyasss in #2746
- ENH: Improve error message by @codingl2k1 in #2738
Bug fixes
- BUG: adapt mlx-vlm v0.1.7 by @qinxuye in #2724
- BUG: pin mlx<0.22.0 to prevent qwen2_vl failing in mlx-vlm by @qinxuye in #2752
Others
- FIX: [UI] Resolve bug preventing '/' input in model_path. by @yiboyasss in #2747
- FIX: [UI] Fix dark mode background bug. by @yiboyasss in #2748
- CHORE: Update new models in readme by @codingl2k1 in #2713
New Contributors
Full Changelog: v1.1.1...v1.2.0