Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🐕 Batch: Revisiting Session Management #1511

Open
3 tasks
DhanshreeA opened this issue Jan 15, 2025 · 0 comments
Open
3 tasks

🐕 Batch: Revisiting Session Management #1511

DhanshreeA opened this issue Jan 15, 2025 · 0 comments
Labels
medium Useful issue or future roadmap, and needs attention

Comments

@DhanshreeA
Copy link
Member

Summary

Session management in the Ersilia CLI has introduced the convenience to run multiple simultaneous models across different terminal "sessions", thereby increasing user productivity. However, the way Ersilia handles sessions currently, wherein a session is linked to a terminal process makes creates several complications especially in the following situations:

  1. Session directories (typically found in ~/eos/sessions ) do not get cleared up if they contain files that don't have write permissions, meaning they don't get successfully deleted, leaving behind the session directory as well.
  2. If a terminal is accidentally closed, or if a user's system crashes, or the docker engine crashes (in case of using Dockerized models), the session directories tend to remain on the system.
  3. Then there is the case of not being able to simultaneously run more than one session with the same model 🐕 Batch: Running the same model in parallel #1223

For the 1st case, we have identified issues with log files or temporary directories from model containers present in mounted storage on the system owned by the root user from the container, and having 700 permission. These files cannot be deleted by a regular user without running sudo, therefore these files do not get removed, causing the entire session directory to remain on disk.

Objective(s)

  • Clean up temporary log files from volume mounted docker containers
  • Figure out how to remove orphaned session folders present on the system.
  • Running same model in parallel sessions on the same system.

Documentation

No response

@DhanshreeA DhanshreeA added the medium Useful issue or future roadmap, and needs attention label Jan 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
medium Useful issue or future roadmap, and needs attention
Projects
Status: On Hold
Development

No branches or pull requests

1 participant