Releases · kvcache-ai/Mooncake

The first official release of Mooncake!

See our README for details.

What's Changed

docs: update README.md by @eltociear in #9
fix the p2pstore go module name, make it could be imported in other projects. by @doujiang24 in #7
docs: add new vllm-integration guide. by @ShangmingCai in #11
fix: remove atty and fix dependency path by @chestnut-Q in #19
[DOC] Add contributing guidelines by @stmatengss in #22
[Bugfix] cpp std::string by @wxsms in #18
[Doc][Integration] Update commands, params description, and benchmark results for vllm integration v2. by @ShangmingCai in #25
[Doc] Fix typos in contributing.md by @ShangmingCai in #26
[TransferEngine][feature] support multiple metadata servers by @alogfans in #21
[CI] add build test on ubuntu by @Ann-1024 in #28
[Doc] Add metadata server backend explanation. by @ShangmingCai in #30
[Bugfix] build error when USE_CUDA by @wxsms in #32
[Bugfix] fix typos and fix rdma device selection when retry_count != 0 by @fengquyoumo in #33
[Doc] Update the integration state of Mooncake Transfer Engine with vLLM. by @ShangmingCai in #40
[Doc] Update Mooncake Icon by @stmatengss in #41
[Doc] Update README to use v0.2 guide and benchmark results. by @ShangmingCai in #42
[TransferEngine] Add topology discovery by @wx-csy in #46
[TransferEngine][feature] add the http metadata server. by @doujiang24 in #37
[TransferEngine] Refactor code to hide transport logics from user APIs by @alogfans in #51
[Transfer Engine] fix metadata server connection string builder in vllm intergation by @alogfans in #56
Update metadata_server config info for vllm integration doc. by @ShangmingCai in #57
Bump golang.org/x/crypto to 0.31.0 & golang.org/x/net to v0.33.0 by @doujiang24 in #58
[TransferEngine] fix: selectDevice() should return zero at all case by @alogfans in #60
[p2pstore] fix memory leaking in cgo. by @doujiang24 in #61
[TransferEngine] test: cmake enable testing. by @doujiang24 in #65
[Doc] Re-enable cuda graph to improve inference performance. by @ShangmingCai in #67
[TransferEngine] fix: improve error message by @doujiang24 in #63
[TransferEngine] adjust transfer_engine_bench: 1.Introduce the gflag buffer_size for enhanced configurability. 2. Utilize uint64_t for block_size to prevent overflow. by @fengquyoumo in #72
[TransferEngine] Add tests for Topology by @liusy58 in #74
[TransferEngine] Correct the count of slice_quque_ in rdma/worker_pool, remove the repeated check for overlap by @fengquyoumo in #79
fix invalid go release url by @gujingit in #83
[TransferEngine] change: auto discover topology & install transport. by @doujiang24 in #73