Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Misc]: Building docker container requires insane amount of memory #350

Open
mrseeker opened this issue Mar 21, 2024 · 7 comments
Open

[Misc]: Building docker container requires insane amount of memory #350

mrseeker opened this issue Mar 21, 2024 · 7 comments

Comments

@mrseeker
Copy link

Anything you want to discuss about Aphrodite.

I am trying to build a custom version of Aphrodite, however during the build of the Aphrodite engine with docker I need an insane amount of Memory and CPU. Is there a way to reduce this?

I already tried setting "MAX_JOBS=1" but that did not help.

@AlpinDale
Copy link
Member

Where did you set the MAX_JOBS variable? It should be set in the Dockerfile right before the build command towards the end.

@mrseeker
Copy link
Author

I tried to set it at line 30 in the Dockerfile, but it still receives "Killed" by the OOM Killer.

@AlpinDale
Copy link
Member

Perhaps it would be best to pull the aphrodite package from pypi instead of building it in the docker. pip install aphrodite-engine==0.5.1 should do it.

@mrseeker
Copy link
Author

I don't think the Aphrodite package supports custom-made AWS endpoints...

@puppetm4st3r
Copy link

with MAX_JOBS 2 compile ok with 64gb of ram

@AlpinDale
Copy link
Member

Ah right this reminds me, @mrseeker , we build for all GPU architectures which may take more time and use more memory. You can try getting rid of the export for torch cuda arch list, that'll probably help.

@mrseeker
Copy link
Author

Found out that if I changed the arch list to just include the arch that I need, then it's scaling down to almost 90Gb when compiling...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants