Training loss doesn't decrease #2

Uirseita · 2019-04-06T23:45:49Z

Hi, I tried to run your notebooks. For notebook 2, my training loss is still around 0.19 after 200 epochs so the resulting model is still pretty bad. The only thing I changed is "use_multiprocessing" to False because it keeps getting an error when it's True.

Do you know how to fix this issue?

liuxuankai · 2019-04-21T08:30:11Z

me too ,tho loss doesn't decrease!!

Golbstein · 2019-05-17T09:42:36Z

Use notebook 2.1
The reason it doesn't work for notebook 1 and 2 is because we train it on a very small model compared to what we really want to train on.
It's just an example of how to fine-tune your pre-trained model on your internal dataset

Golbstein · 2019-05-19T15:04:59Z

I've made few changes now. Check it out

Uirseita · 2019-05-19T15:30:12Z

I've made few changes now. Check it out

Thanks for the response. I was able to run notebook 2.1 on my own pc with GTX1080Ti, but it always crashes because out of graphics memory on the computation center of my university which has Tesla V100 32GB. Do you know the reason for this by any chance?

Golbstein · 2019-05-20T12:59:35Z

How can I know if you didn't provide any details, not even the error message...
Usually it's due to the fact that your GPU is already "full" or you didn't reset it. Try to reboot the server..

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training loss doesn't decrease #2

Training loss doesn't decrease #2

Uirseita commented Apr 6, 2019

liuxuankai commented Apr 21, 2019

Golbstein commented May 17, 2019

Golbstein commented May 19, 2019

Uirseita commented May 19, 2019

Golbstein commented May 20, 2019

Training loss doesn't decrease #2

Training loss doesn't decrease #2

Comments

Uirseita commented Apr 6, 2019

liuxuankai commented Apr 21, 2019

Golbstein commented May 17, 2019

Golbstein commented May 19, 2019

Uirseita commented May 19, 2019

Golbstein commented May 20, 2019