gpud-v0.3.8
GPUd release notes (2025-01-08T13:27:48Z)
Welcome to this new release!
What's Changed
- fix(pkg/process): gracefully handle read operations on aborted process, Read to return error if not started by @gyuho in #276
- fix(package-controller): invoke process start before process read by @cardyok in #277
- fix(os): fetch system manufacturer once for linux by @gyuho in #274
- fix(disk/lsblk): support older lsblk without JSON mode, using --pairs by @gyuho in #278
- feat(nvml): include xid events JSON, dmesg xid/sxid to include device UUID field, fix flaky tests, clean up lsblk logs by @gyuho in #279
- feat(fuse): track connections with /meterics (for waiting/congested FUSE connection, per fuse device), lower hw-slowdown event level from warning to info by @gyuho in #268
- fix(systemd): set shorter context timeout for dbus calls by @gyuho in #280
- fix(pkg/disk): skip usage table output render if unmounted by @gyuho in #283
- fix(dmesg): "journalctl" as fallback, when older dmesg does not support "--since" flag (<2.37) by @sunhailin-Leo in #282
- feat(cpu/dmesg): add regex to catch hung tasks, soft lockup by @gyuho in #285
- nit(nvidia/xid-sxid-state): make purge tests less flaky by @gyuho in #286
- feat(go module): upgrade dependencies fsnotify, grpc, k8s*, prom by @gyuho in #289
- feat(nvidia/peermem): explicitly skip "invalid context" errors by @gyuho in #288
- feat(cpu,memory): return hung task, soft lockup, oom from dmesg via /events, fix log item error type to "*string" by @gyuho in #287
- feat(state): separate read-only sqlite instance for better concurrency by @gyuho in #281
New Contributors
- @sunhailin-Leo made their first contribution in #282
Full Changelog: v0.3.7...v0.3.8