-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Login and connection issues #146
Comments
Hey, When you login in SIRIUS it stores a sol called I assume that your compute nodes share the same user home directory. Per default the token is stored in the SIRIUS config directory in the user home directory (e.g. You can solve this by using a separate "config directory" on each node (or more precisely for each SIRIUS instance running). This can be achieved via the command line parameter In case you want to automate the login per instance without the risk to leak your credentials in some console logs you can use login via environment variables. In that case you can provide the name of the environment variables where the credentials are stored instead of the actual credentials. E.g. Regarding the login problem, I assume that you IP or account got temporarily banned due to too many failing token requests. If the problem still persists please send me an email with the affected username (email address). |
Hi, seeing inconsistent login issues where the SIRIUS session stops recognizing the login session after about 40 small-mass jobs, the rest of them fail. We are also trying to submit within the same session across different nodes and have both specified cores in the SIRIUS command as well as adding a 'sleep' line to see if providing a quick break to the server prevents possible collisions. Each job runs on 36 cores (the total number of cores on a single node).
Currently the user I'm working with cannot log in at all (after logging in today successfully before). The login is failing repeatedly both in the GUI and in the command line.
We were using the 5.8.5 version through a conda environment and also the 5.8.6-snapshot binary.
Is the server down or having other issues? As the login has previously worked today and some jobs have run successfully, we're not sure how to troubleshoot from here further.
The text was updated successfully, but these errors were encountered: