Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New workflows may require changes to tf-serving and consumer settings. #356

Open
willgraf opened this issue Jun 9, 2020 · 0 comments
Open
Labels
wontfix This will not be worked on

Comments

@willgraf
Copy link
Contributor

willgraf commented Jun 9, 2020

Describe the bug
The current cluster configuration has been well tested on a NVIDIA V100 GPU and on a typical segmentation workflow. However, depending on the model and the hardware used in future clusters, there are a few settings that may need to be tweaked.

tf-serving

  • MAX_BATCH_SIZE: The maximum number of batches that tf-serving will process in a given duty cycle. If the job is using very large input tensors, this batch size may need to be decreased
  • MAX_ENQUEUED_BATCHES: The number of batches that will sit in the work queue waiting to be processed. If the requests have a very large payload tf-serving to be evicted due to memory issues, and this parameter should be decreased.

redis-consumer

  • TF_MAX_BATCH_SIZE: The number of batches to send to the model server. This value MUST be less than or equal to MAX_BATCH_SIZE above and may need to be altered for future workflows.
  • GRPC_TIMEOUT: The length of time to wait for a gRPC inference request. If a model's inference time is quite slow, this may need to be adjusted to prevent timeouts.

Additional context
For more notes on the interplay between these settings and the hardware itself, please review this related issue.

@willgraf willgraf added the bug Something isn't working label Jun 9, 2020
@willgraf willgraf added the wontfix This will not be worked on label Oct 6, 2020
@willgraf willgraf removed the bug Something isn't working label Dec 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

1 participant