Soft delete server on downscale #154

aniketmaurya · 2022-10-07T04:12:06Z

In the case of downscaling the servers, we don't give any grace time to process the backlogs it might have.
This results in the loss of some client requests.

A better way would be to soft delete the server first, wait for a minute, and finally, stop the LightningWork.

Soft delete: just remove the server from the LoadBalancer.servers list but don't stop the ModelServing work.

Here is how Kubernetes do it.

IMO, the best way to do this would be overriding the on_exit method.

The text was updated successfully, but these errors were encountered:

aniketmaurya added the enhancement New feature or request label Oct 7, 2022

aniketmaurya self-assigned this Oct 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Soft delete server on downscale #154

Soft delete server on downscale #154

aniketmaurya commented Oct 7, 2022 •

edited

Soft delete server on downscale #154

Soft delete server on downscale #154

Comments

aniketmaurya commented Oct 7, 2022 • edited

aniketmaurya commented Oct 7, 2022 •

edited