maybe i find a point should be change #12

alphanlp · 2018-12-04T07:03:51Z

self.target_layer = TimeDistributed(Dense(o_tokens.num(), use_bias=False))
change to:
self.target_layer = TimeDistributed(Dense(o_tokens.num(), activation='softmax', use_bias=False))

alphanlp · 2018-12-05T01:07:33Z

it's very interesting, when i user softmax as proposed in paper, the loss can not down

lsdefine · 2018-12-05T03:20:15Z

The tf loss contains a softmax. In fact, you do softmax twice.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

maybe i find a point should be change #12

maybe i find a point should be change #12

alphanlp commented Dec 4, 2018

alphanlp commented Dec 5, 2018

lsdefine commented Dec 5, 2018

maybe i find a point should be change #12

maybe i find a point should be change #12

Comments

alphanlp commented Dec 4, 2018

alphanlp commented Dec 5, 2018

lsdefine commented Dec 5, 2018