Skip to content

Releases: kvcache-ai/Mooncake

v0.2.0

20 Jan 12:09
ab8021d
Compare
Choose a tag to compare

The first official release of Mooncake!

See our README for details.

What's Changed

  • docs: update README.md by @eltociear in #9
  • fix the p2pstore go module name, make it could be imported in other projects. by @doujiang24 in #7
  • docs: add new vllm-integration guide. by @ShangmingCai in #11
  • fix: remove atty and fix dependency path by @chestnut-Q in #19
  • [DOC] Add contributing guidelines by @stmatengss in #22
  • [Bugfix] cpp std::string by @wxsms in #18
  • [Doc][Integration] Update commands, params description, and benchmark results for vllm integration v2. by @ShangmingCai in #25
  • [Doc] Fix typos in contributing.md by @ShangmingCai in #26
  • [TransferEngine][feature] support multiple metadata servers by @alogfans in #21
  • [CI] add build test on ubuntu by @Ann-1024 in #28
  • [Doc] Add metadata server backend explanation. by @ShangmingCai in #30
  • [Bugfix] build error when USE_CUDA by @wxsms in #32
  • [Bugfix] fix typos and fix rdma device selection when retry_count != 0 by @fengquyoumo in #33
  • [Doc] Update the integration state of Mooncake Transfer Engine with vLLM. by @ShangmingCai in #40
  • [Doc] Update Mooncake Icon by @stmatengss in #41
  • [Doc] Update README to use v0.2 guide and benchmark results. by @ShangmingCai in #42
  • [TransferEngine] Add topology discovery by @wx-csy in #46
  • [TransferEngine][feature] add the http metadata server. by @doujiang24 in #37
  • [TransferEngine] Refactor code to hide transport logics from user APIs by @alogfans in #51
  • [Transfer Engine] fix metadata server connection string builder in vllm intergation by @alogfans in #56
  • Update metadata_server config info for vllm integration doc. by @ShangmingCai in #57
  • Bump golang.org/x/crypto to 0.31.0 & golang.org/x/net to v0.33.0 by @doujiang24 in #58
  • [TransferEngine] fix: selectDevice() should return zero at all case by @alogfans in #60
  • [p2pstore] fix memory leaking in cgo. by @doujiang24 in #61
  • [TransferEngine] test: cmake enable testing. by @doujiang24 in #65
  • [Doc] Re-enable cuda graph to improve inference performance. by @ShangmingCai in #67
  • [TransferEngine] fix: improve error message by @doujiang24 in #63
  • [TransferEngine] adjust transfer_engine_bench: 1.Introduce the gflag buffer_size for enhanced configurability. 2. Utilize uint64_t for block_size to prevent overflow. by @fengquyoumo in #72
  • [TransferEngine] Add tests for Topology by @liusy58 in #74
  • [TransferEngine] Correct the count of slice_quque_ in rdma/worker_pool, remove the repeated check for overlap by @fengquyoumo in #79
  • fix invalid go release url by @gujingit in #83
  • [TransferEngine] change: auto discover topology & install transport. by @doujiang24 in #73

New Contributors

Full Changelog: https://github.com/kvcache-ai/Mooncake/commits/v0.2.0