Question from character level RNN classifier, why not use the hidden state across epochs? #139

labJunky · 2019-11-27T06:41:48Z

In the RNN classification example, using characters of names to predict the names language, the train function re-zeros the hidden state (and gradient) every epoch. I was wondering why this is done, instead of carrying over the final hidden states of the epoch before?

ZhouXing19 · 2020-04-02T20:29:33Z

One epoch means a run-through of a word. If we start a new epoch, which means we are training the network with a new word, we need to redefine the hidden state of the initial letter of the new word, since states of different words are independent.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question from character level RNN classifier, why not use the hidden state across epochs? #139

Question from character level RNN classifier, why not use the hidden state across epochs? #139

labJunky commented Nov 27, 2019 •

edited

ZhouXing19 commented Apr 2, 2020

Question from character level RNN classifier, why not use the hidden state across epochs? #139

Question from character level RNN classifier, why not use the hidden state across epochs? #139

Comments

labJunky commented Nov 27, 2019 • edited

ZhouXing19 commented Apr 2, 2020

labJunky commented Nov 27, 2019 •

edited