Running the model on TPUs? #55

vessenes · 2019-10-25T23:00:26Z

Hi,

I have the 256 and 512 models working on GCP with a Tesla V100. Text generates, but slowly, and I'm wanting to get faster generation out of the system. I thought running CTRL on TPUs could get me faster text, but I have no idea how to do that.

Do you have an incantation or pointer that would let me point CTRL at a TPU?

dimitri320 · 2019-10-26T07:37:06Z

Second this!

keskarnitish · 2019-10-28T17:27:02Z

I haven't quite figured out how to get TPUs to be faster than GPUs for inference. I'll probably look into this soon. It's especially more complicated with top-k/nucleus sampling and other add-ons. Seems like others have found the same behavior.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running the model on TPUs? #55

Running the model on TPUs? #55

vessenes commented Oct 25, 2019

dimitri320 commented Oct 26, 2019

keskarnitish commented Oct 28, 2019

Running the model on TPUs? #55

Running the model on TPUs? #55

Comments

vessenes commented Oct 25, 2019

dimitri320 commented Oct 26, 2019

keskarnitish commented Oct 28, 2019