transpose? #9

hungpthanh · 2017-10-03T08:05:54Z

Why do you need transpose here
_s, state_word, _ = word_attn_model(mini_batch[i,:,:].transpose(0,1), state_word)

and here:
torch.from_numpy(main_matrix).transpose(0,1) in def pad_batch

Thanks :)

Sandeep42 · 2017-10-03T10:56:47Z

I think transpose was used because PyTorch expects the batch_size in the second dimension, it's been a while since I have coded this. But, I have checked all the dimensions from the start to the end when I developed it. :)

hungpthanh · 2017-10-04T02:56:39Z

Thank you so much 👍

gabrer · 2018-04-13T17:17:17Z

@Sandeep42 @hungthanhpham94
I wonder whether there is an error due to what Pytorch is expecting.

In the function train_data(), it's written:

 for i in xrange(max_sents):
        _s, state_word, _ = word_attn_model(mini_batch[i,:,:].transpose(0,1), state_word)

In this way, after the .transpose(0,1), the resulting mini_batch matrix has size (max_tokens, batch_size).

However, the first function to be called is the self.lookup(embed), which is expecting a (batch_size, list_of_indeces).

If this is correct, it requires to fix up all the following code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transpose? #9

transpose? #9

hungpthanh commented Oct 3, 2017 •

edited

Sandeep42 commented Oct 3, 2017

hungpthanh commented Oct 4, 2017

gabrer commented Apr 13, 2018 •

edited

transpose? #9

transpose? #9

Comments

hungpthanh commented Oct 3, 2017 • edited

Sandeep42 commented Oct 3, 2017

hungpthanh commented Oct 4, 2017

gabrer commented Apr 13, 2018 • edited

hungpthanh commented Oct 3, 2017 •

edited

gabrer commented Apr 13, 2018 •

edited