Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AWS Batch #41

Open
quazzuk opened this issue Jun 2, 2019 · 3 comments
Open

AWS Batch #41

quazzuk opened this issue Jun 2, 2019 · 3 comments

Comments

@quazzuk
Copy link

quazzuk commented Jun 2, 2019

Hi

Great project!

How about adding support for training using AWS Batch? Basically, I'm looking for a setup where I can develop and test locally then deploy using AWS Batch on spot instances. Do you think this functionality would be a good fit for Spotty or am I best starting something from scratch?

Thanks
Andrew

@apls777
Copy link
Collaborator

apls777 commented Jun 2, 2019

Hi Andrew,

Thanks!

I think it's a great idea to add AWS Batch support. The idea behind Spotty is to have an abstraction over any "provider". It can be instances from different cloud providers, services like AWS Batch or AWS ParallelCluster, or just any machine accessible through SSH.

My next goal is to add support for Google Cloud, so right now I don't have time to work on this functionality, but if you want to make a contribution, it would be great :). If you're interested, we could discuss implementation details further.

Best regards,
Oleg

@tekumara
Copy link

Interesting idea! What would AWS Batch give you over the current AWS capability?

@apls777
Copy link
Collaborator

apls777 commented Jul 27, 2020

@tekumara An AWS Batch job you can just run and forget about it. The instance will be terminated automatically once the job is done. The current functionality assumes that you start and stop an instance manually. So, when you just want to train your model, it might be more convenient to do it with a single command, and not be afraid that you will forget to stop the instance after. Also, I think AWS Batch can run jobs on Spot instances in a way that it waits for them if they're not available at the moment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants