-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What information is missing? #341
Comments
At this point we should consider what info is required for either a normal user or an admin and in a second pass decide which only admins should see, |
Some statistics could be useful, which then might avoid logfile analysis. The things we currently look for are:
Note that core hours could become board hours here if useful (which can then be multiplied up by an estimated average cores-per-board if desired). It would be even better if these statistics can then be broken down (depending on the user model) e.g.:
All of this is desirable, but if it is hard to achieve, we can always do it by post-analysis instead of course. |
Answering only the should not the insn't. Info on each job. Summaries for all jobs History of jobs Machine info Also if applicable the number of jobs in the queue due to machine full and then stats on wait times ect. |
Steve also requested the ability to report on down boards / chips / cores over time. Not sure how easy that would be to keep completely in spalloc though. |
Collecting statistics with a database present should be a lot easier. |
Re job internal status, that would have to be something told to us and which we would just report onwards. Not much we can do otherwise; spalloc really doesn't see what is going on inside. |
I'd be tempted to make the long-term aggregate reporting stuff be things that is just done by running scripts against the DB, instead of being part of the application itself. |
If we dont have the data then lets not complicate the system at this point. |
Happy enough for that to be done, at least initially, especially as this is likely to be faster than scanning files. Longer term, having a web page with nice graphs is an option that can be implemented later. |
If there is a away to allow all query scripts but block data changing ones that is fine. Allowing none precanned scripts/ queries that change the data is dangerous as once accident could destroy the whole database |
When we had webpages with graphs ect these can use prepared scripts which run on the same API. |
Direct access to the DB is always an admin-only thing, as I won't put a general query interface in the service. (For one thing, the connection management API is not set up for producing read-only connections, and for another there will be fields that should remain shrouded from general users.) If you want to run a general query, the way to do it will be to log onto the spalloc machine and either run against the live DB or take a copy of it. Making a copy of the DB could be an (admin-only) operation. |
I've added the ability to look up board information from a machine by the IP address of the board. That was the missing whereis operation from the existing spalloc. 😁 |
Ah yes, that inspires further things:
|
What information should be produced by the spalloc reimplementation, but isn't?
The text was updated successfully, but these errors were encountered: