Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Serverless Endpoint Can't Run Due to Insufficient Space #4665

Open
JamesBowerXanda opened this issue May 8, 2024 · 0 comments
Open

Serverless Endpoint Can't Run Due to Insufficient Space #4665

JamesBowerXanda opened this issue May 8, 2024 · 0 comments
Labels
bug component: hosting Relates to the SageMaker Hosting Platform

Comments

@JamesBowerXanda
Copy link

JamesBowerXanda commented May 8, 2024

Describe the bug
I am trying to run a serverless endpoint but the endpoint always fails to get created while trying to install dependencies. I understand that serverless endpoints do not have much space but I provisioned the full 6GB amount and it hasn't even gotten to downloading the model.

To reproduce
Create a sagemaker serverless endpoint withe the following configuration:

IMAGE:

763104351884.dkr.ecr.eu-west-2.amazonaws.com/pytorch-inference:2.2.0-cpu-py310-ubuntu20.04-sagemaker

REQUIRMENTS:

torchaudio==2.2.2
sox==1.5.0
huggingface_hub>=0.8.0
hyperpyyaml>=0.0.1
joblib>=0.14.1
numpy>=1.17.0
packaging
pandas>=1.0.1
pre-commit>=2.3.0
pygtrie>=2.1,<3.0
scipy>=1.4.1,<1.13.0
sentencepiece>=0.1.91
SoundFile; sys_platform == 'win32'
torch>=1.9.0,<=2.2.2
tqdm>=4.42.0
transformers>=4.30.0
speechbrain==1.0.0

Alternatively you could reduce this to the following but the others will be installed as dependencies anyway:

torchaudio==2.2.2
sox==1.5.0
speechbrain==1.0.0

MEMORY:

6GB

Expected behavior
Serverless endpoint is created

Screenshots or logs
image

System information
A description of your system. Please provide:

  • SageMaker Python SDK version: Used AWS Console
  • Framework name (eg. PyTorch) or algorithm (eg. KMeans): Pytorch, Speechbrain (speechbrain/spkrec-ecapa-voxceleb)
  • Framework version: Speechbrain (1.0.0)
  • Python version: 3.10
  • CPU or GPU: CPU
  • Custom Docker image (Y/N): N

Additional context
On my local machine a virtual environment with the packages outlined in the requirements.txt file takes 842MB

@benieric benieric added the component: hosting Relates to the SageMaker Hosting Platform label May 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug component: hosting Relates to the SageMaker Hosting Platform
Projects
None yet
Development

No branches or pull requests

2 participants