Skip to content
This repository was archived by the owner on Jan 22, 2024. It is now read-only.
This repository was archived by the owner on Jan 22, 2024. It is now read-only.

Debian 10 (Buster) error response from daemon: Unknown runtime specified nvidia. / OCI runtime create failed. #1056

@svdHero

Description

@svdHero

1. Issue or feature description

System information

  • OS: Debian GNU/Linux 10 (buster) x86_64
  • Kernel: 4.19.0-5-amd64
  • CPU: Intel i7-6700 (8) @ 3.400GHz
  • GPU 1: Intel HD Graphics 530
  • GPU 2: NVIDIA GeForce RTX 2070
  • Docker: 19.03.1, build 74b1e89

Problem description
I followed the Quickstart documentation in order to install a docker image with GPU support on Debian Buster. However, when I try to run the docker container for verification, I only get the following error message:

svdhero@ml-box-pmt:~$ docker run --runtime=nvidia --rm nvidia/cuda nvidia-smi
docker: Error response from daemon: Unknown runtime specified nvidia.

Alternatively, I also tried

svdhero@ml-box-pmt:~$ docker run --rm nvidia/cuda nvidia-smi
docker: Error response from daemon: OCI runtime create failed: container_linux.go:345: starting container process caused "exec: \"nvidia-smi\": executable file not found in $PATH": unknown.

without any luck, as one can see.

Previously, I installed my NVIDIA drivers successfully via

sudo apt install nvidia-driver

as one can see here:

svdhero@ml-box-pmt:~$ nvidia-smi 
Fri Aug 23 13:01:51 2019       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.74       Driver Version: 418.74       CUDA Version: N/A      |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  GeForce RTX 2070    On   | 00000000:01:00.0 Off |                  N/A |
|  0%   39C    P8     3W / 175W |      0MiB /  7952MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

I also installed docker successfully, as one can see here:

svdhero@ml-box-pmt:~$ docker --version
Docker version 19.03.1, build 74b1e89

svdhero@ml-box-pmt:~$ docker run --rm hello-world

Hello from Docker!
This message shows that your installation appears to be working correctly.

To generate this message, Docker took the following steps:
 1. The Docker client contacted the Docker daemon.
 2. The Docker daemon pulled the "hello-world" image from the Docker Hub.
    (amd64)
 3. The Docker daemon created a new container from that image which runs the
    executable that produces the output you are currently reading.
 4. The Docker daemon streamed that output to the Docker client, which sent it
    to your terminal.

To try something more ambitious, you can run an Ubuntu container with:
 $ docker run -it ubuntu bash

Share images, automate workflows, and more with a free Docker ID:
 https://hub.docker.com/

For more examples and ideas, visit:
 https://docs.docker.com/get-started/

And finally I have installed nvidia-container-toolkit via:

distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker

which seemed to have been successful:

svdhero@ml-box-pmt:~$ sudo apt search nvidia-container-toolkit
Sorting... Done
Full Text Search... Done
nvidia-container-toolkit/buster,now 1.0.3-1 amd64 [installed]
  NVIDIA container runtime hook

However, as stated at the beginning, I get the Unknown runtime specified nvidia error.
This is a brand-new Debian install with no legacy packages installed.

2. Steps to reproduce the issue

  1. Install latest docker version on Debian 10 Buster.
  2. Install latest nvidia-container-toolkit
  3. Run command docker run --runtime=nvidia --rm nvidia/cuda nvidia-smi
  4. Get an Unknown runtime specified nvidia error.

3. Information to attach (optional if deemed irrelevant)

See attached text file nvidia_system_information.txt containing:

  • Some nvidia-container information: nvidia-container-cli -k -d /dev/tty info
  • Kernel version from uname -a
  • Any relevant kernel output lines from dmesg
  • Driver information from nvidia-smi -a
  • Docker version from docker version
  • NVIDIA packages version from dpkg -l '*nvidia*'
  • NVIDIA container library version from nvidia-container-cli -V
  • NVIDIA container library logs (see troubleshooting)
  • Docker command, image and tag used

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions