Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(3.9.0-3.9.1) Default ThreadsPerCore Slurm setting causes reduced CPU utilization #6238

Open
nihitsaxena4 opened this issue May 8, 2024 · 0 comments

Comments

@nihitsaxena4
Copy link

Bug description

ParallelCluster does not explicitly set the ThreadsPerCore for compute node configuration causing Slurm to use the default value of 1. Slurm v23.11 introduced a change that requires the ThreadsPerCore setting to match the threads per physical core of the underlying instance. For compute resources that support hardware multi-threading and it has not been disabled, this will result in CPU under utilization at around 50% (Slurm will never allocate to the secondary virtual cores).

Affected versions (OSes, schedulers)

  • ParallelCluster 3.9.0, 3.9.1
  • Slurm 23.11.4
  • All operating systems supported by ParallelCluster

Mitigation

You can find a detailed explanation and the mitigation of the problem here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants