v1.2.0

Latest

Latest

XprobeBot released this 10 Jan 09:34

df45f11

What's new in 1.2.0 (2025-01-10)

These are the changes in inference v1.2.0.

New features

FEAT: support HunyuanVideo by @qinxuye in #2721
FEAT: support hunyuan-dit text2image by @qinxuye in #2727
FEAT: support cline for vllm engine by @hwzhuhao in #2734
FEAT: [UI] theme switch by @Minamiyama in #1335
FEAT: support qwen2vl run on ascend npu by @Xu-pixel in #2741
FEAT: [UI] Add language toggle for i18n support. by @yiboyasss in #2744
FEAT: Support cogagent-9b by @amumu96 in #2740
FEAT: Xavier: Share KV cache between VLLM replicas by @ChengjieLi28 in #2732
FEAT: [UI] Add gguf_quantization, gguf_model_path, and cpu_offload for image models. by @yiboyasss in #2753
FEAT: Support Marco-o1 by @Jun-Howie in #2749

Enhancements

ENH: [UI] Update Button Style and Interaction Logic for Editing Cache in Model Card. by @yiboyasss in #2746
ENH: Improve error message by @codingl2k1 in #2738

Bug fixes

BUG: adapt mlx-vlm v0.1.7 by @qinxuye in #2724
BUG: pin mlx<0.22.0 to prevent qwen2_vl failing in mlx-vlm by @qinxuye in #2752

Others

FIX: [UI] Resolve bug preventing '/' input in model_path. by @yiboyasss in #2747
FIX: [UI] Fix dark mode background bug. by @yiboyasss in #2748
CHORE: Update new models in readme by @codingl2k1 in #2713

New Contributors

@Xu-pixel made their first contribution in #2741

Full Changelog: v1.1.1...v1.2.0

Contributors

qinxuye, Minamiyama, and 7 other contributors

Assets 2