Run a shell in the allocated node with salloc #208
Loading…
x
Reference in New Issue
Block a user
No description provided.
Delete Branch "slurm-interactive"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
By default, salloc will open a new shell in the current node instead
of in the allocated node. This often causes users to leave the extra
shell running once the allocation ends. Repeating this process several
times causes chains of shells.
By running the shell in the remote node, once the allocation ends the
shell finishes as well.
Fixes: #174
See: https://slurm.schedmd.com/faq.html#prompt
Example:
CC @varcila
@ -88,7 +88,7 @@ in {# LaunchParameters=ulimit_pam_adopt will set RLIMIT_RSS in processes# adopted by the external step, similar to tasks running in regular stepsI would remove or update the comment
5b041f2339to9c622bb6b7I am having trouble loging into fox, I guess related to
LaunchParameters=use_interactive_step:This is from my machine:
I can login from inside apex, but not when using it as proxy though. This is not ideal, but I can live with it if there is no other choice :)Now I can only get a shell when I run
sallocwithout the--no-shellcommand, cannot ssh into fox from other shells.Can confirm that it's broken:
Full output
--no-shell. I get the same result (send packet type 50 (SSH_MSG_USERAUTH_REQUEST) into closed connection) regardless of having allocated the node or not.This is caused by PAM because it doesn't find the pam_slurm_adopt module. It happens because I forgot to add the slurm package to the override after merging bscpkgs, so it comes with PAM disabled. Should be fixed now:
Note: Using callPackage to do overrides only is not a good idea, as in this case the slurm module uses
overrideto change some options, which would fail if we wrap the original package with another callPackage layer. Using a raw import seems to be a good compromise, so we don't pollute the overlay.nix file.I can confirm that now I can ssh from other terminals from apex, and from my machine, with and without the
--no-shelloption.84b7e316a5toa7018250ca