Fix for the graphsage feedforward algorithm #1792

aj111000 · 2020-08-21T23:12:35Z

I went into depth about the issue in #1789, so go there if you want to see more of the gory details, but I'll try to give a summary here.

Essentially, in the original graphsage paper, they first concatenate the previous layer's output with the aggregated feature vector and then applies a fully connected layer to the whole thing. However, in the current stellargraph implementation, weights are first applied to each vector and then the results are concatenated.

There are two main issues with this implementation. First, the network has about half as many parameters as it should, which leads to a lack of expressiveness. Secondly, and much more importantly, it means the first half of the output for each layer is ONLY based on the feature vector, and the second half is ONLY based on the aggregated vector, meaning that the network lacks the ability to gain any information about their interplay (at least within one layer).

I fixed this in a very simple way. First of all, I changed the output sizes of all the weights from outputdim/2 (plus some remainder terms) to just being outputdim. Next, I replaced the concatenation operation with a sum operation, which is the equivalent of saying that if A=[X,Y] and v = [x;y], then Av=[X,Y][x;y]=Xx+Yy, which is completely equivalent

I made no other changes, so it should still work with all other uses of the graphsage model

As a part of that, I had to change some of the tests that were related to how graphsage models were initialized (i.e., the ones that test the weight sizes for an initialized graphsage model)

codeclimate · 2020-08-21T23:13:10Z

Code Climate has analyzed commit e6effd0 and detected 0 issues on this pull request.

View more on Code Climate.

huonw · 2020-09-23T00:47:29Z

Thanks for the pull request and thanks for waiting. This looks good, although I'm following up with the authors of this implementation to see if there's some historical reason for this approach.

aj111000 · 2020-09-23T04:12:17Z

Alright, sounds good! Thanks a lot

aj111000 added 4 commits August 20, 2020 20:34

Update graphsage.py

635e147

Fixed weight dimension issue

8fb60de

Fixed whitespace

73c0ade

Fixed tests to match the changes that I'm making to graphsage

205d5d4

Replaced sum() with tf.add_n()

e6effd0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for the graphsage feedforward algorithm #1792

Fix for the graphsage feedforward algorithm #1792

aj111000 commented Aug 21, 2020 •

edited

codeclimate bot commented Aug 21, 2020 •

edited

huonw commented Sep 23, 2020

aj111000 commented Sep 23, 2020

Fix for the graphsage feedforward algorithm #1792

Are you sure you want to change the base?

Fix for the graphsage feedforward algorithm #1792

Conversation

aj111000 commented Aug 21, 2020 • edited

codeclimate bot commented Aug 21, 2020 • edited

huonw commented Sep 23, 2020

aj111000 commented Sep 23, 2020

aj111000 commented Aug 21, 2020 •

edited

codeclimate bot commented Aug 21, 2020 •

edited