Releases: kvcache-ai/Mooncake
Releases · kvcache-ai/Mooncake
v0.2.0
The first official release of Mooncake!
See our README for details.
What's Changed
- docs: update README.md by @eltociear in #9
- fix the p2pstore go module name, make it could be imported in other projects. by @doujiang24 in #7
- docs: add new vllm-integration guide. by @ShangmingCai in #11
- fix: remove atty and fix dependency path by @chestnut-Q in #19
- [DOC] Add contributing guidelines by @stmatengss in #22
- [Bugfix] cpp std::string by @wxsms in #18
- [Doc][Integration] Update commands, params description, and benchmark results for vllm integration v2. by @ShangmingCai in #25
- [Doc] Fix typos in contributing.md by @ShangmingCai in #26
- [TransferEngine][feature] support multiple metadata servers by @alogfans in #21
- [CI] add build test on ubuntu by @Ann-1024 in #28
- [Doc] Add metadata server backend explanation. by @ShangmingCai in #30
- [Bugfix] build error when USE_CUDA by @wxsms in #32
- [Bugfix] fix typos and fix rdma device selection when retry_count != 0 by @fengquyoumo in #33
- [Doc] Update the integration state of Mooncake Transfer Engine with vLLM. by @ShangmingCai in #40
- [Doc] Update Mooncake Icon by @stmatengss in #41
- [Doc] Update README to use v0.2 guide and benchmark results. by @ShangmingCai in #42
- [TransferEngine] Add topology discovery by @wx-csy in #46
- [TransferEngine][feature] add the http metadata server. by @doujiang24 in #37
- [TransferEngine] Refactor code to hide transport logics from user APIs by @alogfans in #51
- [Transfer Engine] fix metadata server connection string builder in vllm intergation by @alogfans in #56
- Update metadata_server config info for vllm integration doc. by @ShangmingCai in #57
- Bump golang.org/x/crypto to 0.31.0 & golang.org/x/net to v0.33.0 by @doujiang24 in #58
- [TransferEngine] fix: selectDevice() should return zero at all case by @alogfans in #60
- [p2pstore] fix memory leaking in cgo. by @doujiang24 in #61
- [TransferEngine] test: cmake enable testing. by @doujiang24 in #65
- [Doc] Re-enable cuda graph to improve inference performance. by @ShangmingCai in #67
- [TransferEngine] fix: improve error message by @doujiang24 in #63
- [TransferEngine] adjust transfer_engine_bench: 1.Introduce the gflag buffer_size for enhanced configurability. 2. Utilize uint64_t for block_size to prevent overflow. by @fengquyoumo in #72
- [TransferEngine] Add tests for
Topology
by @liusy58 in #74 - [TransferEngine] Correct the count of slice_quque_ in rdma/worker_pool, remove the repeated check for overlap by @fengquyoumo in #79
- fix invalid go release url by @gujingit in #83
- [TransferEngine] change: auto discover topology & install transport. by @doujiang24 in #73
New Contributors
- @eltociear made their first contribution in #9
- @doujiang24 made their first contribution in #7
- @ShangmingCai made their first contribution in #11
- @chestnut-Q made their first contribution in #19
- @stmatengss made their first contribution in #22
- @wxsms made their first contribution in #18
- @alogfans made their first contribution in #21
- @Ann-1024 made their first contribution in #28
- @fengquyoumo made their first contribution in #33
- @wx-csy made their first contribution in #46
- @liusy58 made their first contribution in #74
- @gujingit made their first contribution in #83
Full Changelog: https://github.com/kvcache-ai/Mooncake/commits/v0.2.0