multi layer RNN #79

WoodySG2018 · 2019-07-03T09:24:44Z

coded multi layer RNN. Using 2 layers as default, can change "num_layers" to adjust number of layers when calling DecoderWithAttention. Please check.

kmario23 · 2019-07-23T11:06:43Z

Does it achieve improved results when compared to a single layer one? If yes, by what margin?

WoodySG2018 · 2019-07-24T03:45:46Z

Does it achieve improved results when compared to a single layer one? If yes, by what margin?

Hi kmario23, the training is on-going, by now I saw clear different loss decreasing between 2-layer and 4-layer RNN (I can't trace back single layer RNN performance), but it's too early to tell relation of performance vs #layer on my working dataset.

Theoretically to capture highly hierarchical structure by just one layer is not optimal. So hopefully by integrating multilayer here we can enable users to test the model's power in another dimension on their own datasets.

nilinykh · 2020-01-06T18:44:33Z

Is there any update on this issue? Will it be merged? A very nice functionality, I would say.

Just a quick question related to how initial states for multi-layer LSTM are constructed: I am wondering why we need to initialize hidden/cell states of the LSTM by passing image representation through FC layer? Is it somehow the standard way of doing it?

kmario23 · 2020-01-07T15:48:06Z

Just a quick question related to how initial states for multi-layer LSTM are constructed: I am wondering why we need to initialize hidden/cell states of the LSTM by passing image representation through FC layer? Is it somehow the standard way of doing it?

That's how we pass the learned representation of the image by the CNN (encoder) to the RNN/LSTM (decoder). Usually the decoder dimension varies in size (i.e. a hyperparameter). So, to match the encoder dimension to the decoder dimension, we usually have to project the learned representation of image i.e. the encoder output, which is of dimension 2048 in this case, to the decoder dimension. Else it's not possible to feed the encoder output directly into the decoder.

WoodySG2018 and others added 2 commits July 3, 2019 17:23

multi layer RNN

71107e2

coded multi layer RNN. Using 2 layers as default, can change "num_layers" to adjust number of layers when calling DecoderWithAttention. Please check.

revised caption.py and eval.py to adapt multilayer RNN

bbe6dcb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi layer RNN #79

multi layer RNN #79

WoodySG2018 commented Jul 3, 2019

kmario23 commented Jul 23, 2019

WoodySG2018 commented Jul 24, 2019

nilinykh commented Jan 6, 2020

kmario23 commented Jan 7, 2020

multi layer RNN #79

Are you sure you want to change the base?

multi layer RNN #79

Conversation

WoodySG2018 commented Jul 3, 2019

kmario23 commented Jul 23, 2019

WoodySG2018 commented Jul 24, 2019

nilinykh commented Jan 6, 2020

kmario23 commented Jan 7, 2020