-
Notifications
You must be signed in to change notification settings - Fork 10
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Support recipes and scheduler in Hyperpod CLI (#41)
* add recipes feature for distributed training * improve unit test coverage for recipes feature * add support recipes along with command line args * add recipes * Crescendo helm chart for role and rolebinding (#17) * update the helm chart to create team level roles and bindings * revert unrelated changes * Rename quotaAllocationTarget to computeQuotaTarget * remove kueue related resources from helm chart * Remove parameters of kueue from chart * flip the team role creation to false * Revise readme to add instructions to create the role and binding * add changelog for distributed training * change to public submodules * QuotaAllocation support for Hyperpod CLI (#12) * QuotaAllocation support for Hyperpod CLI --------- Co-authored-by: Amazon GitHub Automation <54958958+amazon-auto@users.noreply.github.com> Co-authored-by: Song Jiang <jiangsongbz@gmail.com> Co-authored-by: Baiyang Li <baiyanl@amazon.com> Co-authored-by: baiyli <105086653+baiyli@users.noreply.github.com> * Remove custom_launcher folder * sync with mainline --------- Co-authored-by: cansun <80425164+can-sun@users.noreply.github.com> Co-authored-by: Amazon GitHub Automation <54958958+amazon-auto@users.noreply.github.com> Co-authored-by: Song Jiang <jiangsongbz@gmail.com> Co-authored-by: Baiyang Li <baiyanl@amazon.com> Co-authored-by: baiyli <105086653+baiyli@users.noreply.github.com> Co-authored-by: Can Sun <sucan@amazon.com>
- Loading branch information
1 parent
87f5660
commit bb25aed
Showing
69 changed files
with
3,475 additions
and
2,392 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -17,3 +17,7 @@ __pycache__/ | |
|
||
/doc/_apidoc/ | ||
/build | ||
|
||
# Ignore all contents of result and results directories | ||
/result/ | ||
/results/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,3 @@ | ||
[submodule "src/hyperpod_cli/custom_launcher/launcher/nemo/nemo_framework_launcher"] | ||
path = src/hyperpod_cli/custom_launcher/launcher/nemo/nemo_framework_launcher | ||
url = https://github.com/NVIDIA/NeMo-Framework-Launcher.git | ||
branch = 3d41c31 | ||
[submodule "src/hyperpod_cli/sagemaker_hyperpod_recipes"] | ||
path = src/hyperpod_cli/sagemaker_hyperpod_recipes | ||
url = https://github.com/aws/sagemaker-hyperpod-recipes.git |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
10 changes: 0 additions & 10 deletions
10
helm_chart/HyperPodHelmChart/charts/kueue/templates/priority-class.yaml
This file was deleted.
Oops, something went wrong.
17 changes: 0 additions & 17 deletions
17
helm_chart/HyperPodHelmChart/charts/kueue/templates/queue.yaml
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
5 changes: 5 additions & 0 deletions
5
helm_chart/HyperPodHelmChart/charts/team-role-and-bindings/Chart.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
apiVersion: v2 | ||
name: team-role-and-bindings | ||
version: 0.1.0 | ||
appVersion: 1.0 | ||
description: This chart installs the namespaced and cluster roles and bindings for team members |
Oops, something went wrong.