Skip to content

Releases: lshqqytiger/ZLUDA

v3.8.6

10 Jan 05:55
d60bddb
Compare
Choose a tag to compare

Change logs

  • Update ZLUDA BLASLt including bindings for ROCm 6.x api changes.
  • Enable hipBLASLt on Windows in nightly build.
    You'll need to compile hipBLASLt yourself or get unofficial library for now.
  • Add cublasXt apis.
  • Regenerate cublas bindings.
    cublasHgemm* apis are now exported.
  • Introduce nightly build.
    Build with --nightly for unstable features. Currently, only one unstable feature, hipBLASLt on Windows, is available.
  • Allow build for HIP SDK 5.5 and 6.1. (untested)
    Any bug/issue report is welcome.

v3.8.5

03 Jan 05:15
2930436
Compare
Choose a tag to compare

Change logs

  • Rewrite runtime compiler. (now available: PyTorch complex types, JIT, etc)
  • New environment variable ZLUDA_NVRTC_LIB: our new ZLUDA runtime compiler depends on the original NVIDIA runtime compiler library, so you should specify the path of it unless it is named nvrtc_cuda.dll.

v3.8.4

18 Dec 12:45
1b6e012
Compare
Choose a tag to compare

Change logs

v3.8.4

11 Sep 05:51
c0804ca
Compare
Choose a tag to compare

Change logs

  • Restore cublas argument. (injector)
  • Implement some Dark API functions (#41), Thanks @SEt-t
  • Implement mul24 instruction.
  • Add environment variable ZLUDA_COMGR_LOG_LEVEL.
    If you set ZLUDA_COMGR_LOG_LEVEL=1, it will give you minimal logs that demonstrate the kernel code compilation is under progress.
  • Bring back the minimal implementations of runtime API. (#45)
    It is enough to run stable-diffusion-webui-forge with CUDA Stream enabled.

v3.8.3

26 Aug 10:50
1c238a9
Compare
Choose a tag to compare

Change logs

  • Add --version argument for zluda.exe. (zluda_injector)
  • Revive HIP SDK 5.7 support for Windows.

※ Please move to HIP SDK 6.1 if possible. 5.7 support can be discontinued at any time.

v3.8.2

20 Aug 06:14
2f2e38a
Compare
Choose a tag to compare

Change logs

v3.8.1

14 Jul 07:37
86cdab3
Compare
Choose a tag to compare

This release is automatically generated by GitHub Actions.

v3.8

13 Jul 04:47
d7714d8
Compare
Choose a tag to compare

Change logs

※ The future release will target ROCm 6 or later.

v3.7.2

17 May 03:44
4f12e8c
Compare
Choose a tag to compare

Change logs

v3.8-rocm5

17 May 04:15
11cc584
Compare
Choose a tag to compare
Implement cuda runtime api. (cudart) (#17)

* [WIP] Implement cudart.

* wip

* wip

* Implement cudart.

* wip

* Ready to merge.