-
-
Notifications
You must be signed in to change notification settings - Fork 719
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disk read/write dashboard plots #5122
Comments
After a quick look, it appears we don't measure disk I/O statistics in the system monitor. We do capture network I/O using distributed/distributed/system_monitor.py Line 74 in 117d277
From the |
@ncclementi if it's easy for you to roll this into the current network bandwidth plots that would be welcome. |
I believe the current network plot PR is ready to merge (as soon as CI finishes) but I can include this on a separate PR. I have a question:
Something like (code not tested) ...
result = {"cpu": cpu, "memory": memory, "time": now, "count": self.count}
if self._collect_disk_io_counters:
try:
io_disk = psutil.disk_io_counters()
except Exception:
pass
else:
last = self._last_disk_io_counters
duration = now - self.last_time
read_bytes = io_disk.read_bytes/((duration or 0.5))
write_bytes = io_disk.write_bytes / (duration or 0.5)
self.last_time = now
self._last_disk_io_counters = io_disk
self.read_bytes_disk.append(read_bytes)
self.write_bytes_disk.append(write_bytes)
result["read_bytes_disk"] = read_bytes
result["write_bytes_disk"] = write_bytes Side note: I notice we have this in metrics but it's not used, is this something that should be removed? distributed/distributed/metrics.py Line 33 in 611414c
|
Yes, we want to add something very much like what you have above to the system monitor. I don't recall the reason for the disk_io_counters. One could use git blame here, or just ignore it for now. |
Some time ago @pentschev added logging for spilling rapidsai/dask-cuda#442 . The PR logging for each event and can query for total time spilling . Might be of interest to folks on this issue |
I believe this was included in #5129 (thanks @ncclementi!) |
Following on #5090 it would be useful to have similar plots for Disk usage. I suspect that it would be the exact same chart (read/write per worker and timeseries), but we'll need to capture disk I/O statistics. I'm not sure that we do this today. It might be worth checking out the system_monitor.py file to see and if not maybe add measurements coming from
psutil
(or somewhere else if there are better ways of measuring this).The text was updated successfully, but these errors were encountered: