Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

As a user of the integration experiment infrastructure I would like to launch apps that do not utilize the same number of cores across all nodes #2890

Open
dannosliwcd opened this issue Mar 6, 2023 · 0 comments · May be fixed by #2926
Labels
3.1 feature Description of a new feature as a story

Comments

@dannosliwcd
Copy link
Contributor

Is your feature request related to a problem? Please describe.
Some applications have restrictions on the total number of MPI ranks over which they can decompose their computations. For example, several of the NPB benchmarks require power-of-two or square-number rank counts. Depending on the node count and the experiment platform's count of cores per node, some nodes may contain fewer ranks than other nodes.

Right now, the appconf class only supports defining a count of ranks per node, which the experiment launcher multiplies against node count to determine the total rank count. This results in an incorrect total rank count. Although the application is able to execute in that case, GEOPM does not count epochs on nodes that do not check in to an epoch with every assigned rank.

Describe the solution you'd like
I would like for appconfs to be able to declare the total rank count needed for an application. Many apps are fine with the current behavior, so it would also be nice if it is possible for appconfs to omit such a declaration (defaulting to the current nodes*ranks_per_node behavior) so that we don't need to modify existing configs.

@dannosliwcd dannosliwcd added the feature Description of a new feature as a story label Mar 6, 2023
dannosliwcd added a commit to dannosliwcd/geopm that referenced this issue Apr 13, 2023
dannosliwcd added a commit to dannosliwcd/geopm that referenced this issue May 15, 2023
dannosliwcd added a commit to dannosliwcd/geopm that referenced this issue Jul 21, 2023
dannosliwcd added a commit to dannosliwcd/geopm that referenced this issue Jul 21, 2023
dannosliwcd added a commit to dannosliwcd/geopm that referenced this issue Jul 31, 2023
dannosliwcd added a commit to dannosliwcd/geopm that referenced this issue Aug 1, 2023
dannosliwcd added a commit to dannosliwcd/geopm that referenced this issue Apr 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.1 feature Description of a new feature as a story
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants