Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refresh capacity reservations in standalone EC2 launch cmd #4271

Merged
merged 7 commits into from
Sep 20, 2024

Conversation

arjkesh
Copy link
Contributor

@arjkesh arjkesh commented Sep 20, 2024

GitHub Issue #, if available:

Note:

  • If merging this PR should also close the associated Issue, please also add that Issue # to the Linked Issues section on the right.

  • All PR's are checked weekly for staleness. This PR will be closed if not updated in 30 days.

Description

  • Refresh capacity reservations and wait random time intervals before retrying

Tests run

  • Ensure test launches on p5 (07df12a)

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license. I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…ig.toml']

pytorch/training/buildspec-2-4-ec2.yml:
'Overrode tags on buildspec file'

dlc_developer_config.toml:
{   'build': {   'build_frameworks': ['pytorch'],
                 'build_inference': False,
                 'build_training': True},
    'buildspec_override': {   'dlc-pr-pytorch-training': 'pytorch/training/buildspec-2-4-ec2.yml'},
    'dev': {   'deep_canary_mode': False,
               'graviton_mode': False,
               'neuronx_mode': False},
    'test': {   'ec2_tests': True,
                'ecs_tests': False,
                'eks_tests': False,
                'sagemaker_local_tests': False,
                'sagemaker_remote_tests': False,
                'sanity_tests': False}}
@aws-deep-learning-containers-ci aws-deep-learning-containers-ci bot added build Reflects file change in build folder ec2 Reflects file change in dlc_tests/ec2 folder pytorch Reflects file change in pytorch folder Size:S Determines the size of the PR test Reflects file change in test folder labels Sep 20, 2024
@aws-deep-learning-containers-ci aws-deep-learning-containers-ci bot added the Size:XS Determines the size of the PR label Sep 20, 2024
@arjkesh arjkesh marked this pull request as ready for review September 20, 2024 22:31
@arjkesh arjkesh requested a review from a team as a code owner September 20, 2024 22:31
@arjkesh arjkesh enabled auto-merge (squash) September 20, 2024 22:31
@arjkesh arjkesh merged commit 2ca5091 into aws:master Sep 20, 2024
28 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
build Reflects file change in build folder ec2 Reflects file change in dlc_tests/ec2 folder pytorch Reflects file change in pytorch folder Size:S Determines the size of the PR Size:XS Determines the size of the PR test Reflects file change in test folder
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants