Example of nvidia-docker2 with docker-compose #568

silent-vim · 2017-12-12T21:03:32Z

1. Issue or feature description

Hi, I am getting started with nvidia-docker. I have been able to install the dependencies and can see the output for nvidia-smi run through nvidia-docker. I am trying to see how to use this within a docker-compose file. I went through the documentation but could not find any example of specifying the runtime. It will be great if you can provide some pointers around this.

Most of the tutorials also work only with nvidia docker v1. Example: http://collabnix.com/deploying-application-in-the-gpu-accelerated-data-center-using-docker/

2. Steps to reproduce the issue

nvidia-docker run --runtime=nvidia --rm nvidia/cuda nvidia-smi
Tue Dec 12 20:59:58 2017
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 384.90                 Driver Version: 384.90                    |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla M60           Off  | 0000030C:00:00.0 Off |                  Off |
| N/A   43C    P0    40W / 150W |      0MiB /  8123MiB |      1%      Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

3. Information to attach (optional if deemed irrelevant)

$ nvidia-docker version
NVIDIA Docker: 2.0.1
Client:
 Version:      17.09.1-ce
 API version:  1.32
 Go version:   go1.8.3
 Git commit:   19e2cf6
 Built:        Thu Dec  7 22:24:23 2017
 OS/Arch:      linux/amd64

Server:
 Version:      17.09.1-ce
 API version:  1.32 (minimum version 1.12)
 Go version:   go1.8.3
 Git commit:   19e2cf6
 Built:        Thu Dec  7 22:23:00 2017
 OS/Arch:      linux/amd64
 Experimental: false

The text was updated successfully, but these errors were encountered:

3XX0 · 2017-12-12T21:05:03Z

https://github.com/NVIDIA/nvidia-docker/wiki/Frequently-Asked-Questions#do-you-support-docker-compose

flx42 · 2017-12-12T21:05:33Z

It's work in progress for docker-compose:
docker/compose#5405

In the meantime, you can set our runtime as the default runtime, and it will work.

silent-vim · 2017-12-12T22:50:41Z

Thanks so I looked at the container that is created, it has nvcc as I can run the command and get the details but nvidia-smi results in command not found

$ nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2016 NVIDIA Corporation
Built on Tue_Jan_10_13:22:03_CST_2017
Cuda compilation tools, release 8.0, V8.0.61
airflow@448a3bdde1d7:~$ which nvcc
/usr/local/cuda/bin/nvcc

Also under the daemon.json in /etc/docker I can see the following.

{
    "runtimes": {
        "nvidia": {
            "path": "/usr/bin/nvidia-container-runtime",
            "runtimeArgs": []
        }
    }
}

Is there anything I need to do specifically to make nvidia the default runtime ?

flx42 · 2017-12-12T23:04:55Z

Add "default-runtime": "nvidia",

silent-vim · 2017-12-12T23:11:32Z

Awesome @flx42 that worked! Thanks to you and @3XX0 for quick help 👍

airflow@42539f02d3e2:~$ nvidia-smi
Tue Dec 12 23:10:16 2017
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 384.90                 Driver Version: 384.90                    |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla M60           Off  | 0000030C:00:00.0 Off |                  Off |
| N/A   46C    P0    41W / 150W |      0MiB /  8123MiB |      1%      Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

ngreenwald89 · 2018-09-07T21:42:20Z

Where exactly do you place the "default-runtime": "nvidia"? I put it inside "nvidia" object and still get same error:

{ "runtimes": { "nvidia": { "path": "nvidia-container-runtime", "default-runtime": "nvidia", "runtimeArgs": [] } } }

flx42 · 2018-09-07T23:57:23Z

@ngreenwald89 there is an example here: https://github.com/NVIDIA/k8s-device-plugin#preparing-your-gpu-nodes

carlwain74 · 2019-06-21T22:57:24Z

@ngreenwald89 For me the path needed /usr/bin/ otherwise it never worked

root@machine:PerfTest# cat /etc/docker/daemon.json
{
    "default-runtime": "nvidia",
    "runtimes": {
        "nvidia": {
            "path": "/usr/bin/nvidia-container-runtime",
            "runtimeArgs": []
        }
    }
}

HWiese1980 · 2019-07-26T11:07:23Z

Is this still work in progress? More than one and a half years later?

qhaas · 2019-07-31T14:03:43Z

From my reading of nvidia-docker documentation, using docker run --runtime nvidia and setting default-runtime to 'nvidia' in '/etc/docker/daemon.json' appear to be functionality of the now deprecated nvidia-docker2 package. Hopefully, we will have an alternative method to using docker-compose that doesn't require deprecated features:

Note that with the release of Docker 19.03, usage of nvidia-docker2 packages are deprecated since NVIDIA GPUs are now natively supported as devices in the Docker runtime

UPDATE:
Here is what I did to get it working, based on advice in this issue: docker/compose#6691

Uninstalled deprecated nvidia-docker2 packages (just to be sure, I removed all packages from nvidia container related repositories, then removed the nvidia container repos themselves)
Deployed nvidia-continer-runtime via its repository and used the systemd override approach

The downside is that you have to have root to make these changes, but then again, if someone adds you to the 'docker' group, they likely trust you.

The upside is, we are no longer limited to docker-compose v2.3

flx42 closed this as completed Dec 12, 2017

ghost mentioned this issue Dec 21, 2017

docker-compose.yml moved? scanner-research/scanner#113

Closed

casperdcl mentioned this issue May 23, 2019

Ideas conjuring/conjuring#1

Open

19 tasks

0x2b3bfa0 mentioned this issue May 11, 2022

Revert NVIDIA APT GPG keys fix iterative/terraform-provider-iterative#570

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example of nvidia-docker2 with docker-compose #568

Example of nvidia-docker2 with docker-compose #568

silent-vim commented Dec 12, 2017

3XX0 commented Dec 12, 2017

flx42 commented Dec 12, 2017

silent-vim commented Dec 12, 2017

flx42 commented Dec 12, 2017

silent-vim commented Dec 12, 2017

ngreenwald89 commented Sep 7, 2018

flx42 commented Sep 7, 2018

carlwain74 commented Jun 21, 2019 •

edited

HWiese1980 commented Jul 26, 2019

qhaas commented Jul 31, 2019 •

edited

Example of nvidia-docker2 with docker-compose #568

Example of nvidia-docker2 with docker-compose #568

Comments

silent-vim commented Dec 12, 2017

1. Issue or feature description

2. Steps to reproduce the issue

3. Information to attach (optional if deemed irrelevant)

3XX0 commented Dec 12, 2017

flx42 commented Dec 12, 2017

silent-vim commented Dec 12, 2017

flx42 commented Dec 12, 2017

silent-vim commented Dec 12, 2017

ngreenwald89 commented Sep 7, 2018

flx42 commented Sep 7, 2018

carlwain74 commented Jun 21, 2019 • edited

HWiese1980 commented Jul 26, 2019

qhaas commented Jul 31, 2019 • edited

carlwain74 commented Jun 21, 2019 •

edited

qhaas commented Jul 31, 2019 •

edited