Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PyTorch dockerfile failed to build #6544

Closed
sjiang17 opened this issue Apr 12, 2018 · 3 comments
Closed

PyTorch dockerfile failed to build #6544

sjiang17 opened this issue Apr 12, 2018 · 3 comments

Comments

@sjiang17
Copy link

sjiang17 commented Apr 12, 2018

  • PyTorch or Caffe2: PyTorch
  • OS: Mac 10.13.4
  • PyTorch version:
  • How you installed PyTorch (conda, pip, source):
  • Python version:
  • CUDA/cuDNN version: N/A
  • GPU models and configuration: N/A
  • GCC version (if compiling from source):
  • CMake version:
  • Build command you used (if compiling from source):
  • Versions of any other relevant libraries:

I have tried to build a docker image with
git clone https://github.com/pytorch/pytorch.git
and
docker build -t pytorch-gpu -f docker/pytorch/Dockerfile .

The build failed with the following msg:

......
[ 18%] Building NVCC (Device) object src/ATen/CMakeFiles/ATen.dir//THC/ATen_generated_THCTensorConv.cu.o
[ 18%] Building NVCC (Device) object src/ATen/CMakeFiles/ATen.dir/
/THC/ATen_generated_THCTensorRandom.cu.o
[ 19%] Building NVCC (Device) object src/ATen/CMakeFiles/ATen.dir//THC/ATen_generated_THCTensorScatterGather.cu.o
[ 19%] Building NVCC (Device) object src/ATen/CMakeFiles/ATen.dir/
/THC/ATen_generated_THCTensorTopK.cu.o
[ 19%] Building NVCC (Device) object src/ATen/CMakeFiles/ATen.dir//THC/ATen_generated_THCTensorSort.cu.o
[ 19%] Building NVCC (Device) object src/ATen/CMakeFiles/ATen.dir/
/THC/ATen_generated_THCTensorTypeUtils.cu.o
[ 19%] Building NVCC (Device) object src/ATen/CMakeFiles/ATen.dir//THC/ATen_generated_THCSortUtils.cu.o
[ 20%] Building NVCC (Device) object src/ATen/CMakeFiles/ATen.dir/
/THC/ATen_generated_THCTensorMode.cu.o
The command '/bin/sh -c TORCH_CUDA_ARCH_LIST="3.5 5.2 6.0 6.1 7.0+PTX" TORCH_NVCC_FLAGS="-Xfatbin -compress-all" CMAKE_PREFIX_PATH="$(dirname $(which conda))/../" pip install -v .' returned a non-zero code: 137

Has anyone encountered the same problem & is there any solution?
Thank you!

@apaszke
Copy link
Contributor

apaszke commented Apr 12, 2018

137 means that the process received a SIGKILL signal, which happens if you you e.g. run out of memory. How much RAM do you have in your machine?

@apaszke
Copy link
Contributor

apaszke commented Apr 12, 2018

Also, you can try some of the workarounds proposed here (increase the threshold for max mem use in Docker): moby/moby#21083

@sjiang17
Copy link
Author

Thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants