Bug with get the embedding from pre-trained model #147

enferas · 2018-06-10T16:50:46Z

Hello,

I was trying to to use pre-trained model for the embedding but there are two bugs.

The first one in the sample.py, with init the parameters. I think we shouldn't init the embedding.
for param in seq2seq.parameters(): param.data.uniform_(-0.08, 0.08)
I recommend to be like that
for param in [p for p in seq2seq.parameters() if p.requires_grad == True]: param.data.uniform_(-0.08, 0.08)

The second bug is with the optimization in supervised_trainer.py, that the optimization will throw an error when we will try to optimize the embedding.
optimizer = Optimizer(optim.Adam(model.parameters()), max_grad_norm=5)
I will recommend to be like that.
optimizer = Optimizer(optim.Adam(filter(lambda p: p.requires_grad, model.parameters())), max_grad_norm=5)

I have a question, do you thing it is important to have the pre-trained embedding in the decoder either or just in the encoder will be enough ?

The text was updated successfully, but these errors were encountered:

enferas · 2018-06-22T17:53:51Z

The same error with the resume option. should be
self.optimizer.optimizer = resume_optim.__class__(filter(lambda p: p.requires_grad, model.parameters()), **defaults)

pskrunner14 · 2018-09-01T15:16:33Z

@enferas thanks for pointing this out.

pskrunner14 self-assigned this Sep 1, 2018

pskrunner14 added bug medium priority labels Sep 1, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug with get the embedding from pre-trained model #147

Bug with get the embedding from pre-trained model #147

enferas commented Jun 10, 2018 •

edited

enferas commented Jun 22, 2018

pskrunner14 commented Sep 1, 2018

Bug with get the embedding from pre-trained model #147

Bug with get the embedding from pre-trained model #147

Comments

enferas commented Jun 10, 2018 • edited

enferas commented Jun 22, 2018

pskrunner14 commented Sep 1, 2018

enferas commented Jun 10, 2018 •

edited