Hello, we are currently using the Pios as a shared resource in our group for long-term experiments. This works nicely, however this usually leads to there always being something running on the clusters, making it very difficult to regularly restart the cluster and to allow for the leader database backup to happen. As the experiments are by different people it is also not easily possible for one person to manually stop and start the whole cluster.
Would it be possible to implement a routine that can either back-up the database during running experiments, or to automate a cluster-wide restart with subsequent continuation of all the experiments? Maybe also not fully automated but in a semi-automated / supervised fashion so that it does not go unnoticed if something goes wrong during reboot.
Thanks already!
Kai