Skip to content

Resources and autoscaling fields in BentoDeployment CRD meaning. #3822

Answered by parano
dzhelonkin asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @dzhelonkin - the spec.resources means the resource allocated for the API servers, essentially the pod spec for running users' service API code. Whereas spec.runners[].resources are pod specs for Runners used in the service. E.g. if you have a model that uses GPU, you should put the GPU resource under spec.runners[].resources. The same applies to the autoscaling field. Note that when deploying BentoDeployment CR with Yatai, runners and API server will be distributed across different pods allowing it to scale and utilize resources more efficiently.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@dzhelonkin
Comment options

Answer selected by dzhelonkin
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants