Difference between user and sequence representation #155

JoaoLages · 2019-04-16T23:10:23Z

Can anyone explain me why the user and sequence representation are calculated in this way?? seems like the last state of the LSTM is the sequence representation and the rest is the user representation. I'm not following how this works.

JoaoLages · 2019-04-17T09:01:11Z

Ok, I'm starting to understand. I missed this pad that you do to your input. You mask the input so that the task is kind of like 'predicting the last input token'.
That makes more sense.

However, I am not following why you still mask the input at the end and use the last input representation

JoaoLages · 2019-04-17T09:20:07Z

While predicting, for me it would make sense if you would use the whole input, non-padded, and the whole output afterwards, not only a portion of it related with the last input token. By that I mean, using the full user_representations variable and not adding any pad.

JoaoLages · 2019-04-17T09:32:03Z

Ah ok, I think I finally understood. Only user_representations will contain vectors that try to become as clone as the input vector, after passing through the embedding layer. This means that only the last embedding vector of user_representations will actually contain the predictions for the next input token, makes sense.

I will leave this issue open to see if somebody can confirm this.

maciejkula · 2019-04-25T03:52:04Z

This sounds about right: the representation at step i encodes all user behaviours up to i.

JoaoLages · 2019-04-29T08:19:19Z

It'd be cool if we could have a custom hidden layer with that behavior to add more non-linearities and transformations to the model

JoaoLages · 2019-04-29T08:20:14Z

What's the big advantage over training only one time step at a time? By that means, each i would have a single bakprop

JoaoLages changed the title ~~DIfference between user and sequence representation~~ Difference between user and sequence representation Apr 17, 2019

maciejkula added the question label Apr 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference between user and sequence representation #155

Difference between user and sequence representation #155

JoaoLages commented Apr 16, 2019

JoaoLages commented Apr 17, 2019

JoaoLages commented Apr 17, 2019 •

edited

JoaoLages commented Apr 17, 2019

maciejkula commented Apr 25, 2019

JoaoLages commented Apr 29, 2019 •

edited

JoaoLages commented Apr 29, 2019

Difference between user and sequence representation #155

Difference between user and sequence representation #155

Comments

JoaoLages commented Apr 16, 2019

JoaoLages commented Apr 17, 2019

JoaoLages commented Apr 17, 2019 • edited

JoaoLages commented Apr 17, 2019

maciejkula commented Apr 25, 2019

JoaoLages commented Apr 29, 2019 • edited

JoaoLages commented Apr 29, 2019

JoaoLages commented Apr 17, 2019 •

edited

JoaoLages commented Apr 29, 2019 •

edited