UNet2DS tensorflow non-deterministic training #1

alexklibisz · 2017-07-17T16:31:47Z

Just making a note for future reference that training the UNet2DS model on the GPU with Tensorflow backend results in non-deterministic gradient updates, which results in non-deterministic final results. The final submission are typically within 2% of each other in terms of mean F1 score, but still this adds a confounding factor when trying to compare changes to the architecture or training strategy.

There is a lot of material online about TF's non-determinism. Most of it points to the fact that the underlying CuDNN implementation uses non-deterministic reductions for convolutions (i.e. floating point operations are not necessarily associative). The best, most recent insight I could find was in this pull-request, with comments indicating there is supposedly a forthcoming fix to address this issue.

alexklibisz · 2017-08-16T19:46:54Z

This also seems to make a non-trivial difference when training UNet1D. It seems most of the new libraries now are using CuDNN, so I'm not sure there's a way around this without some fix in CuDNN.

saeedalahmari3 · 2018-12-12T13:55:35Z

I have the same issue now with U-Net for segmentation making dice coef different (+3) every run with the same seed. Were you able to find a solution for this?

alexklibisz · 2018-12-12T20:18:01Z

No, and based on the issues linked to the PR in my original post, it looks like it hasn't been resolved yet.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UNet2DS tensorflow non-deterministic training #1

UNet2DS tensorflow non-deterministic training #1

alexklibisz commented Jul 17, 2017

alexklibisz commented Aug 16, 2017

saeedalahmari3 commented Dec 12, 2018

alexklibisz commented Dec 12, 2018

UNet2DS tensorflow non-deterministic training #1

UNet2DS tensorflow non-deterministic training #1

Comments

alexklibisz commented Jul 17, 2017

alexklibisz commented Aug 16, 2017

saeedalahmari3 commented Dec 12, 2018

alexklibisz commented Dec 12, 2018