Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CustomSageMakerEndpoint: Add autoscaling, value for min/max instances #424

Open
2 tasks
airmonitor opened this issue May 3, 2024 · 0 comments
Open
2 tasks
Labels
enhancement New feature or request needs-triage This issue or PR still needs to be triaged.

Comments

@airmonitor
Copy link

Describe the feature

The current solution doesn't allow to specify the minimum and maximum instance number.
It deploys one instance only, without any scaling policy.

Use Case

Due to heavy instance load, the autoscaling group should perform scaling out to meet new demand.

Proposed Solution

two additional parameters max_instances, min_instances

Other Information

No response

Acknowledgements

  • I may be able to implement this feature request
  • This feature might incur a breaking change
@airmonitor airmonitor added the needs-triage This issue or PR still needs to be triaged. label May 3, 2024
@krokoko krokoko added the enhancement New feature or request label May 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request needs-triage This issue or PR still needs to be triaged.
Projects
None yet
Development

No branches or pull requests

2 participants