Add a tracking of global loss in each epoch #23

ducovrossem · 2014-12-19T10:05:32Z

Something to look at while training the model :)

global_loss += 0.5 * entry_weight * (prediction - c_log(count)) **2

maciejkula · 2014-12-19T11:01:07Z

Makes sense to track the loss. A couple of comments:

Why not just use a primitive double that's initialized in Cython, and then maybe returned from the fit_vectors function? We could then avoid the array + instance attribute approach.
We should only print the loss if verbose == True.
If you are using more than one thread, the loss will be approximate (as the threads will be writing over each other).

fit_vectors returns global loss Global loss printing over epochs only enabled if verbose is True

ducovrossem · 2014-12-19T17:00:26Z

Changed it around. Had thought about your proposed structure but went with the previous method only because it seemed less of a re-write. Why avoid the the array + instance approach?
Added.
One could initiate a global_loss instance per no_threads and add them at the end... I am not sure how much of a detail this is.

maciejkula · 2014-12-23T11:05:37Z

It just seems strange to have a one-element array when the same purpose can be served by just having a single number.

Regarding the multithreading issue: I don't think this is a huge problem, just wanted to point out that that the loss numbers can be non-deterministic.

maciejkula · 2014-12-23T11:06:53Z

glove/glove.py

-
-            fit_vectors(self.word_vectors,
+            
+            self.global_loss = fit_vectors(self.word_vectors,
                        self.vectors_sum_gradients,


Could you just fix the indentation of these lines? self.vectors_sum_gradients should be indented to the same level as self.word_vectors.

maciejkula · 2014-12-23T12:31:27Z

I was going to go in sneakily after this is merged and do that :)

@ducovrossem as Radim pointed out, it will be more efficient to only calculate the (prediction - c_log(count)) portion once and then re-use it in the expressions for loss and global_loss.

As for the 0.5 factor: because the expression for the gradient of the loss does not have a 2 * entry_weight scaling factor it implies that the original loss was 0.5 * (x - y). As far as I know this is quite common, because it allows us to drop the 2 constant in the derivation.

piskvorky · 2014-12-24T16:37:42Z

Actually, why not factor out log(count) completely, out of the main loop, into the cooccurrence matrix? In other words, do cooccurrence_matrix = log(cooccurrence_matrix).

Or are the actual original counts/weights needed anywhere else, apart from log(count)?

Re. 0.5 * makes sense, thanks!

maciejkula · 2014-12-24T20:23:52Z

The raw value is used for the weighting (and I also quite like the fact that the co-occurrence matrix is marginally model agnostic and could conceivably support a different application).

In general there is a fair amount of things that can still be factored out to just happen once (look at the bias updates for instance). I'll probably do a pass soon and get those out of the way.

IronFarm · 2018-01-03T10:16:07Z

I know this pull request has gotten stale but is there any interest in getting it merged? I've managed to merge it into master locally and would be willing to fork and open a new pull request where we can discuss it. Three years have passed but it still looks like a valuable addition!

gokceneraslan · 2018-11-19T21:13:58Z

It'd be really great to add this feature!

ducovrossem added 2 commits December 19, 2014 10:50

Add global loss tracker

a5c1038

Add global loss computation

6bce0da

ducovrossem added 2 commits December 19, 2014 17:39

Verbose dependency and return fit_vectors

af11285

fit_vectors returns global loss Global loss printing over epochs only enabled if verbose is True

fit_vectors returns global_loss

f6c4f9f

maciejkula reviewed Dec 23, 2014
View reviewed changes

ducovrossem added 2 commits January 2, 2015 12:39

Indentation level in fit_vector function

0ac1497

Calculate weightless loss seperately

654eb4f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a tracking of global loss in each epoch #23

Add a tracking of global loss in each epoch #23

ducovrossem commented Dec 19, 2014

maciejkula commented Dec 19, 2014

ducovrossem commented Dec 19, 2014

maciejkula commented Dec 23, 2014

maciejkula Dec 23, 2014

maciejkula commented Dec 23, 2014

piskvorky commented Dec 24, 2014

maciejkula commented Dec 24, 2014

IronFarm commented Jan 3, 2018

gokceneraslan commented Nov 19, 2018

Add a tracking of global loss in each epoch #23

Are you sure you want to change the base?

Add a tracking of global loss in each epoch #23

Conversation

ducovrossem commented Dec 19, 2014

maciejkula commented Dec 19, 2014

ducovrossem commented Dec 19, 2014

maciejkula commented Dec 23, 2014

maciejkula Dec 23, 2014

Choose a reason for hiding this comment

maciejkula commented Dec 23, 2014

piskvorky commented Dec 24, 2014

maciejkula commented Dec 24, 2014

IronFarm commented Jan 3, 2018

gokceneraslan commented Nov 19, 2018