Not able to train HAN because of the following error. #32

kk54709 · 2018-10-14T20:14:45Z

sentence_input = Input(shape=(MAX_SENT_LENGTH,), dtype='int32')
embedded_sequences = embedding_layer(sentence_input)
l_lstm = Bidirectional(GRU(100, return_sequences=True))(embedded_sequences)
l_att = AttLayer(100)(l_lstm)
sentEncoder = Model(sentence_input, l_att)

review_input = Input(shape=(MAX_SENTS, MAX_SENT_LENGTH), dtype='int32')
review_encoder = TimeDistributed(sentEncoder)(review_input)
l_lstm_sent = Bidirectional(GRU(100, return_sequences=True))(review_encoder)
l_att_sent = AttLayer(100)(l_lstm_sent)
preds = Dense(2, activation='softmax')(l_att_sent)
model = Model(review_input, preds)

model.compile(loss='categorical_crossentropy',
optimizer='rmsprop',
metrics=['acc'])

print("model fitting - Hierachical attention network")

Error
ValueError: Dimensions must be equal, but are 15 and 100 for 'att_layer_10/mul' (op: 'Mul') with input shapes: [?,15], [?,15,100].

cjopengler · 2018-10-15T02:19:49Z

I got the same error

cjopengler · 2018-10-15T03:46:18Z

I found that in AttLayer change code from

" def compute_mask(self, inputs, mask=None):
return mask
"
to

def compute_mask(self, inputs, mask=None):
    return None

it will be ok. It means that should not support mask.

kk54709 · 2018-10-15T06:07:26Z

Solved the issue.
Thanks @cjopengler .

cjopengler · 2018-10-15T06:21:07Z

Solved the issue.
Thanks @cjopengler .

But have you found that 'l_att = AttLayer(100)(l_lstm)' the first AttLayer no error, but the second 'l_att_sent = AttLayer(100)(l_lstm_sent)' at computing mask.

kk54709 · 2018-10-15T06:26:46Z

Well, I'm still experimenting with it. But earlier I was facing some issue with the same and now it is working fine.

MingleiLI · 2018-10-25T08:13:15Z

I have the same problem and changing the code as @cjopengler says works.

mingkin · 2019-02-13T07:57:06Z

Solved the issue.
Thanks @cjopengler

980202006 · 2019-06-02T16:53:01Z

it is because of the function timedistributed
previous_mask
<tf.Tensor 'time_distributed_14/Reshape_2:0' shape=(?, 15, 100) dtype=bool>
the methods of resolution can be refer to https://blog.csdn.net/songbinxu/article/details/80242211

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not able to train HAN because of the following error. #32

Not able to train HAN because of the following error. #32

kk54709 commented Oct 14, 2018 •

edited

cjopengler commented Oct 15, 2018

cjopengler commented Oct 15, 2018

kk54709 commented Oct 15, 2018

cjopengler commented Oct 15, 2018

kk54709 commented Oct 15, 2018

MingleiLI commented Oct 25, 2018

mingkin commented Feb 13, 2019

980202006 commented Jun 2, 2019

Not able to train HAN because of the following error. #32

Not able to train HAN because of the following error. #32

Comments

kk54709 commented Oct 14, 2018 • edited

cjopengler commented Oct 15, 2018

cjopengler commented Oct 15, 2018

kk54709 commented Oct 15, 2018

cjopengler commented Oct 15, 2018

kk54709 commented Oct 15, 2018

MingleiLI commented Oct 25, 2018

mingkin commented Feb 13, 2019

980202006 commented Jun 2, 2019

kk54709 commented Oct 14, 2018 •

edited