Releases: lshqqytiger/ZLUDA
Releases · lshqqytiger/ZLUDA
v3.8.6
Change logs
- Update ZLUDA BLASLt including bindings for ROCm 6.x api changes.
- Enable hipBLASLt on Windows in nightly build.
You'll need to compile hipBLASLt yourself or get unofficial library for now. - Add cublasXt apis.
- Regenerate cublas bindings.
cublasHgemm*
apis are now exported. - Introduce nightly build.
Build with--nightly
for unstable features. Currently, only one unstable feature, hipBLASLt on Windows, is available. - Allow build for HIP SDK 5.5 and 6.1. (untested)
Any bug/issue report is welcome.
v3.8.5
Change logs
- Rewrite runtime compiler. (now available: PyTorch complex types, JIT, etc)
- New environment variable
ZLUDA_NVRTC_LIB
: our new ZLUDA runtime compiler depends on the original NVIDIA runtime compiler library, so you should specify the path of it unless it is namednvrtc_cuda.dll
.
v3.8.4
Change logs
- Rebuilt for HIP SDK 6.2. ZLUDA for HIP SDK 6.1
v3.8.4
Change logs
- Restore cublas argument. (injector)
- Implement some Dark API functions (#41), Thanks @SEt-t
- Implement
mul24
instruction. - Add environment variable
ZLUDA_COMGR_LOG_LEVEL
.
If you setZLUDA_COMGR_LOG_LEVEL=1
, it will give you minimal logs that demonstrate the kernel code compilation is under progress. - Bring back the minimal implementations of runtime API. (#45)
It is enough to run stable-diffusion-webui-forge with CUDA Stream enabled.
v3.8.3
Change logs
- Add
--version
argument forzluda.exe
. (zluda_injector) - Revive HIP SDK 5.7 support for Windows.
※ Please move to HIP SDK 6.1 if possible. 5.7 support can be discontinued at any time.
v3.8.2
Change logs
- Add support of Rust 1.80.
- Fix bug of
hiprt-sys
. Thanks @mobile-bungalow.
v3.8.1
This release is automatically generated by GitHub Actions.
v3.8
Change logs
- Add ROCm 6.1 support.
- Discontinue ROCm <6 support. v3.8 for ROCm 5.7
※ The future release will target ROCm 6 or later.
v3.7.2
v3.8-rocm5
Implement cuda runtime api. (cudart) (#17) * [WIP] Implement cudart. * wip * wip * Implement cudart. * wip * Ready to merge.