TypeError: Fail to find the dnn implementation. #10634

rosefun · 2018-07-10T08:15:33Z

Platform: Windows10
Tensorflow Version: 1.7.0（GPU）
Cuda compilation tools, release 9.0, V9.0.176
CUDNN： 7.1.2
Graphic processor: Nvidia Geforce GTX 1050

My code:

from keras.layers import CuDNNLSTM,Bidirectional
lstmsize=6
lstm0 = CuDNNLSTM(lstmsize,return_sequences = True)

Error:

UnknownError (see above for traceback): Fail to find the dnn implementation.
[[Node: cu_dnngru_1/CudnnRNN = CudnnRNN[T=DT_FLOAT, direction="unidirectional", dropout=0, input_mode="linear_input", is_training=true, rnn_mode="gru", seed=87654321, seed2=0, _device="/job:localhost/replica:0/task:0/device:GPU:0"](cu_dnngru_1/transpose, cu_dnngru_1/ExpandDims_1, cu_dnngru_1/Const_1, cu_dnngru_1/concat)]]
[[Node: loss/mul/_73 = _Recvclient_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_618_loss/mul", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]]

Hopefully for help!

The text was updated successfully, but these errors were encountered:

ChristofHenkel · 2018-07-14T17:44:13Z

I have the same problem under Linux Ubuntu 16.04

Maybe this helps:

https://devtalk.nvidia.com/default/topic/1030610/cuda-setup-and-installation/fail-to-find-the-dnn-implementation-/

duhaime · 2018-10-20T18:22:54Z

Same problem on Ubuntu 18.04.1 LTS running Cuda V9.0.176 and cuDNN 7.2.1. Ditto on RHEL 7.4 with Cuda V9.0.176 and cuDNN 9.0-v7

ASH1998 · 2018-11-09T04:54:29Z

for cuda 7.1.1 and cudnn 9.0 :
CuDNNLSTM, or CuDNNGRU ran successfully, then after some days gave the same error.
Fixed : reinstalling cuda and cudnn.

There has to be some other better solution. This way is too tiresome and lengthy!!

kyleabeauchamp · 2019-02-26T05:23:08Z

I'm also seeing this error on Ubuntu 18.04, RTX 2070, cuda 10, keras, and tf-nightly-gpu. I cross posted on NVidia but haven't seen much help there: https://devtalk.nvidia.com/default/topic/1046589/cuda-setup-and-installation/issues-with-tensorflow-on-cuda10-and-rtx2080/

infinitylogesh · 2019-02-28T09:33:01Z

I had the same issue , when I updated tensorflow to 1.12. Error got resolved after updating my CuDNN verstion to 7.5 from 7. I followed the steps mentioned in the below url for updating the CuDNN version (Note: The steps mentioned in the link are for installing CUDNN , but the same is applicable for update as well)

https://jhui.github.io/2017/09/07/AWS-P2-CUDA-CuDNN-TensorFlow/

kyleabeauchamp · 2019-03-06T05:11:04Z

I ended up fixing this issue with the allow_growth = True comment on tensorflow/tensorflow#24496

shiningliang · 2019-03-11T13:05:39Z

Platform: Ubuntu 18.04
Tensorflow Version: 1.13.1（GPU）
CUDA: V10.0.130
CUDNN: 7.4.2
GPU: RTX 2080Ti

I got the same error. I have built the graph, it occurred when initializing variables. When I use tf-nightly-gpu of version 1.13 I didn't have this error.
And I have set the allow_growth = True, it didn't work.

oinksterthepig · 2019-04-16T23:32:40Z

I got this error while running cudnn LSTM. They worked for a while then they quit working. I did "conda update tensorflow-gpu" and that fixed it. The problem must be in tensorflow somewhere?

00krishna · 2019-04-26T18:31:33Z

I got this error last night while working on the tensorflow tutorial "https://www.tensorflow.org/alpha/tutorials/load_data/text". I was using tensorflow-gpu 2.0alpha on an Ubuntu 18.04x64 machine and python version 3.6. I updated my Cudnn from 7.4 to 7.5.1 and tried up upgrade tensorflow too--but that did not change anything. I was able to compile the Cudnn samples Mnist network--which is the usual test for a successful install. Just wanted to let you know about the continuing issue.

zyu511008 · 2019-04-27T00:50:51Z

unsubscribe please

…

________________________________ zyu511008@hotmail.com From: Krishna Bhogaonker<mailto:notifications@github.com> Date: 2019-04-27 02:33 To: keras-team/keras<mailto:keras@noreply.github.com> CC: Subscribed<mailto:subscribed@noreply.github.com> Subject: Re: [keras-team/keras] TypeError: Fail to find the dnn implementation. (#10634) I got this error last night while working on the tensorflow tutorial "https://www.tensorflow.org/alpha/tutorials/load_data/text". I was using tensorflow-gpu 2.0alpha on an Ubuntu 18.04x64 machine and python version 3.6. I updated my Cudnn from 7.4 to 7.5.1 and tried up upgrade tensorflow too--but that did not change anything. I was able to compile the Cudnn samples Mnist network--which is the usual test for a successful install. Just wanted to let you know about the continuing issue. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub<#10634 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AKXHW4SYRLENBQXLRNXPYU3PSNDGDANCNFSM4FJDZWHA>.

cageyoko · 2019-05-18T07:38:13Z

I got this error while running cudnn LSTM. They worked for a while then they quit working. I did "conda update tensorflow-gpu" and that fixed it. The problem must be in tensorflow somewhere?

I also use 'conda update tensorflow-gpu' and fixed it.. thanks!

gerlaic · 2019-05-20T21:27:04Z

I got this error last night while working on the tensorflow tutorial "https://www.tensorflow.org/alpha/tutorials/load_data/text". I was using tensorflow-gpu 2.0alpha on an Ubuntu 18.04x64 machine and python version 3.6. I updated my Cudnn from 7.4 to 7.5.1 and tried up upgrade tensorflow too--but that did not change anything. I was able to compile the Cudnn samples Mnist network--which is the usual test for a successful install. Just wanted to let you know about the continuing issue.

Reference: tensorflow/tensorflow#20067 (comment)

Have you make sure your GPU is available? If you have any other session running on the same GPU on Windows, you would want to do halt and close.

try the following snippet to check if you have a GPU available. This will occur when there is no available device:

from tensorflow.python.client import device_lib

def get_available_gpus():
    local_device_protos = device_lib.list_local_devices()
    return [x.name for x in local_device_protos if x.device_type == 'GPU']

VertexC · 2019-06-10T00:37:50Z

I fixed this issue by upgrading cuddn from 7.0 to 7.5. I am using cuda10.1 and tf-gpu1.14 on Ubuntu 16.04.

morningsky · 2019-06-18T18:24:08Z

I ended up fixing this issue with the allow_growth = True comment on tensorflow/tensorflow#24496

Thanks! I solved this problem by your way

FrozenWolf-Cyber · 2019-11-18T14:31:32Z

In tensorflow 2.0 i got the same error while running RNN LSTM model.The reason was due to lower version of my cuDNN.In the tensorflow gpu requirements page it was recommended to have
cuDNN SDK >= 7.4.1.You can refer for more details in https://www.tensorflow.org/install/gpu
Asked in Tensorflow Reddit forum https://www.reddit.com/r/tensorflow/comments/dxnnq2/i_am_getting_an_error_while_running_the_rnn_lstm/?utm_source=share&utm_medium=web2x

FrozenWolf-Cyber · 2019-11-18T14:32:10Z

In tensorflow 2.0 i got the same error while running RNN LSTM model.The reason was due to lower version of my cuDNN.In the tensorflow gpu requirements page it was recommended to have
cuDNN SDK >= 7.4.1.You can refer for more details in https://www.tensorflow.org/install/gpu
Asked in Tensorflow Reddit forum https://www.reddit.com/r/tensorflow/comments/dxnnq2/i_am_getting_an_error_while_running_the_rnn_lstm/?utm_source=share&utm_medium=web2x

tricyzhou · 2020-03-23T11:57:23Z

maybe u can solve it by "tf.config.experimental.set_memory_growth()"！！！

Shekhrozx · 2020-04-03T06:59:12Z

Try this. It works

gpus = tf.config.experimental.list_physical_devices('GPU')
for gpu in gpus:
        tf.config.experimental.set_memory_growth(gpu, True)
tf.config.experimental.set_virtual_device_configuration(gpus[0], [tf.config.experimental.VirtualDeviceConfiguration(memory_limit=1024)])

anisayari · 2020-05-28T13:14:21Z

I got the same error after trying to train again a model... and I solve it with the same solution of @Shekhrozx

sergio12S · 2020-06-17T08:17:44Z

I solve this problem using this way:
physical_devices = tf.config.list_physical_devices('GPU') tf.config.experimental.set_memory_growth(physical_devices[0], enable=True)

arnabanerji · 2020-06-28T20:15:28Z

The recommended format directly from the TF docs in 2.0+ is:

try:
  tf.config.experimental.set_memory_growth(physical_devices[0], True)
except:
  # Invalid device or cannot modify virtual devices once initialized.
  pass

ChenMalobani · 2021-03-08T10:45:51Z

Try this. It works

gpus = tf.config.experimental.list_physical_devices('GPU')
for gpu in gpus:
        tf.config.experimental.set_memory_growth(gpu, True)
tf.config.experimental.set_virtual_device_configuration(gpus[0], [tf.config.experimental.VirtualDeviceConfiguration(memory_limit=1024)])

what to do?

RuntimeError: Physical devices cannot be modified after being initialized

Shekhrozx · 2021-03-11T01:30:32Z

Try this. It works

gpus = tf.config.experimental.list_physical_devices('GPU')
for gpu in gpus:
        tf.config.experimental.set_memory_growth(gpu, True)
tf.config.experimental.set_virtual_device_configuration(gpus[0], [tf.config.experimental.VirtualDeviceConfiguration(memory_limit=1024)])

what to do?

RuntimeError: Physical devices cannot be modified after being initialized

It seems that you are initializing your GPU two or more times. Please check your code and initialize your GPU only once.

Harsh188 · 2021-05-14T12:26:38Z

I got a similar issue with TF version 2.4.1. The problem was fixed after I upgraded my versions to TF 2.5.0 with CUDNN 8.1.0 and CUDA 11.2

minimaxir mentioned this issue Dec 10, 2018

UnknownError (see above for traceback): Fail to find the dnn implementation. minimaxir/textgenrnn#81

Closed

MuruganR96 mentioned this issue Dec 7, 2019

Restoring from checkpoint failed. mozilla/DeepSpeech#2583

Closed

BeechatNetworkSystemsLtd mentioned this issue Oct 29, 2020

Bug: tensorflow-gpu takes long time before beginning to compute tensorflow/tensorflow#18652

Closed

fchollet closed this as completed Jun 24, 2021

This was referenced Feb 10, 2022

GRU in model leads to error: Fail to find the dnn implementation Dobiasd/frugally-deep#317

Closed

Workaround for cudnn not found problem (issue #317) Dobiasd/frugally-deep#318

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TypeError: Fail to find the dnn implementation. #10634

TypeError: Fail to find the dnn implementation. #10634

rosefun commented Jul 10, 2018 •

edited

ChristofHenkel commented Jul 14, 2018 •

edited

duhaime commented Oct 20, 2018

ASH1998 commented Nov 9, 2018

kyleabeauchamp commented Feb 26, 2019 •

edited

infinitylogesh commented Feb 28, 2019

kyleabeauchamp commented Mar 6, 2019

shiningliang commented Mar 11, 2019 •

edited

oinksterthepig commented Apr 16, 2019

00krishna commented Apr 26, 2019

zyu511008 commented Apr 27, 2019 via email

cageyoko commented May 18, 2019

gerlaic commented May 20, 2019

VertexC commented Jun 10, 2019

morningsky commented Jun 18, 2019

FrozenWolf-Cyber commented Nov 18, 2019

FrozenWolf-Cyber commented Nov 18, 2019

tricyzhou commented Mar 23, 2020

Shekhrozx commented Apr 3, 2020 •

edited

anisayari commented May 28, 2020

sergio12S commented Jun 17, 2020

arnabanerji commented Jun 28, 2020

ChenMalobani commented Mar 8, 2021

Shekhrozx commented Mar 11, 2021

Harsh188 commented May 14, 2021

TypeError: Fail to find the dnn implementation. #10634

TypeError: Fail to find the dnn implementation. #10634

Comments

rosefun commented Jul 10, 2018 • edited

ChristofHenkel commented Jul 14, 2018 • edited

duhaime commented Oct 20, 2018

ASH1998 commented Nov 9, 2018

kyleabeauchamp commented Feb 26, 2019 • edited

infinitylogesh commented Feb 28, 2019

kyleabeauchamp commented Mar 6, 2019

shiningliang commented Mar 11, 2019 • edited

oinksterthepig commented Apr 16, 2019

00krishna commented Apr 26, 2019

zyu511008 commented Apr 27, 2019 via email

cageyoko commented May 18, 2019

gerlaic commented May 20, 2019

VertexC commented Jun 10, 2019

morningsky commented Jun 18, 2019

FrozenWolf-Cyber commented Nov 18, 2019

FrozenWolf-Cyber commented Nov 18, 2019

tricyzhou commented Mar 23, 2020

Shekhrozx commented Apr 3, 2020 • edited

anisayari commented May 28, 2020

sergio12S commented Jun 17, 2020

arnabanerji commented Jun 28, 2020

ChenMalobani commented Mar 8, 2021

Shekhrozx commented Mar 11, 2021

Harsh188 commented May 14, 2021

rosefun commented Jul 10, 2018 •

edited

ChristofHenkel commented Jul 14, 2018 •

edited

kyleabeauchamp commented Feb 26, 2019 •

edited

shiningliang commented Mar 11, 2019 •

edited

Shekhrozx commented Apr 3, 2020 •

edited