-
Notifications
You must be signed in to change notification settings - Fork 312
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FSX Timeout when changing permission after mounting #6235
Comments
Hi @samagids, sorry for the delay. I hope you were able to fix the issue in the meantime. I tried to look at the attached logs but from the logs I cannot see any error related to mount of FSx volume.
In this step the Then I see you have an I removed the attached files from the GitHub issue because in the logs there were some details about your subnets, vpcs, policies, AD settings, etc. If you have an active AWS support contract, please open a case with AWS Premium Support team using the below documentation to report the issue: Enrico |
I saw you're passing an existing FileSystemId for FSx. An important thing to check are the security groups to ensure the nodes are able to mount the File system. As stated in the documentation:
BTW I'd suggest to use Anyway from the |
So during build, the cluster headnodes mounts the /scratch FSX volume but hangs while changing permission from the origin to 0777 and ownership to root. It timeout aften 600 seconds causing a cluster create failure. Increases the HeadNodeBootstrapTimeout but it still did not help. Looks like there is a hard coded FSX mount timeout of 600 seconds. Our volume is 6.8T 2.9T 3.9T 44% /scratch
The text was updated successfully, but these errors were encountered: