CapsGNN (Loss=nan)? #16

diesel248 · 2020-05-19T03:17:26Z

I was trying to run this code but got this error. See pic below

lishu0716 · 2020-05-20T10:44:58Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

diesel248 · 2020-05-20T16:36:30Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

Hi,

Thank you for your help. Actually, I checked the author's commit history and have already added this line. It worked on the small size dataset (30 train, 30 test), but still got the same problem on the large size dataset (1000 train, 1000 test) after several iterations. And the predictions are all 0.

imSeaton · 2020-05-31T11:59:10Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

Hi,

Thank you for your help. Actually, I checked the author's commit history and have already added this line. It worked on the small size dataset (30 train, 30 test), but still got the same problem on the large size dataset (1000 train, 1000 test) after several iterations. And the predictions are all 0.

Hi！ I have meet the same problems with you! Have you got the solutions?

diesel248 · 2020-06-02T00:44:31Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

Hi,
Thank you for your help. Actually, I checked the author's commit history and have already added this line. It worked on the small size dataset (30 train, 30 test), but still got the same problem on the large size dataset (1000 train, 1000 test) after several iterations. And the predictions are all 0.

Hi！ I have meet the same problems with you! Have you got the solutions?

Not yet. Still waiting for someone's help.

holoword · 2020-06-25T14:11:39Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

same problem, need help!

dtzfast · 2020-07-04T09:12:20Z

Graph level classification, how to add batchsize？

jack6756 · 2020-10-25T00:23:53Z

wow

Wanghongyu97 · 2020-11-26T06:23:45Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

same problem
when epoch is 20, accuracy is only 0.33, and loss is around 2.5

imSeaton · 2020-11-26T08:06:23Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

same problem
when epoch is 20, accuracy is only 0.33, and loss is around 2.5

大兄弟，一起撸他的代码呀。我觉得他的代码中可能存在一些问题，例如：1、attention模块之前，tensor 的view操作打乱了数据分布，hidden_representation那里的view也是。2、attention模块和论文里的有点不一样。3、squash操作中，|mag|作为除数没有加小数防止溢出。4、正常的胶囊网络算法中，动态路由的前两次迭代中capsule是不要梯度的，应该用detach()隔绝一下，他这里没有这么做。
大兄弟，要不要加个QQ一起交流呀？

imSeaton · 2020-11-27T04:05:47Z

Graph level classification, how to add batchsize？

If the graph classification algorithm uses the DGL framework, it can divided a graph into mini-batches to accelerate the training.However, in my opinion, the author only uses the concept of batch to compute the average loss of a batch without distributed compution in the CapsGNN code above.

zhangxin9988 · 2021-07-21T02:47:43Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

same problem
when epoch is 20, accuracy is only 0.33, and loss is around 2.5

大兄弟，一起撸他的代码呀。我觉得他的代码中可能存在一些问题，例如：1、attention模块之前，tensor 的view操作打乱了数据分布，hidden_representation那里的view也是。2、attention模块和论文里的有点不一样。3、squash操作中，|mag|作为除数没有加小数防止溢出。4、正常的胶囊网络算法中，动态路由的前两次迭代中capsule是不要梯度的，应该用detach()隔绝一下，他这里没有这么做。
大兄弟，要不要加个QQ一起交流呀？

他这里面的维度变换真的很迷，特别是路由部分，真的有必要搞得如此复杂吗？

Wanghongyu97 · 2021-07-21T02:50:29Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

same problem
when epoch is 20, accuracy is only 0.33, and loss is around 2.5

大兄弟，一起撸他的代码呀。我觉得他的代码中可能存在一些问题，例如：1、attention模块之前，tensor 的view操作打乱了数据分布，hidden_representation那里的view也是。2、attention模块和论文里的有点不一样。3、squash操作中，|mag|作为除数没有加小数防止溢出。4、正常的胶囊网络算法中，动态路由的前两次迭代中capsule是不要梯度的，应该用detach()隔绝一下，他这里没有这么做。
大兄弟，要不要加个QQ一起交流呀？

他这里面的维度变换真的很迷，特别是路由部分，真的有必要搞得如此复杂吗？
github上有人复现了，可以参考shamnastv/GraphCaps

zezeze97 · 2021-10-28T06:05:34Z

In layers.py, add a line b_ij = b_ij + u_vj1 before line 143 b_max = torch.max(b_ij, dim = 2, keepdim = True)

same problem
when epoch is 20, accuracy is only 0.33, and loss is around 2.5

大兄弟，一起撸他的代码呀。我觉得他的代码中可能存在一些问题，例如：1、attention模块之前，tensor 的view操作打乱了数据分布，hidden_representation那里的view也是。2、attention模块和论文里的有点不一样。3、squash操作中，|mag|作为除数没有加小数防止溢出。4、正常的胶囊网络算法中，动态路由的前两次迭代中capsule是不要梯度的，应该用detach()隔绝一下，他这里没有这么做。大兄弟，要不要加个QQ一起交流呀？

加个qq交流一下不？

Repository owner deleted a comment from shamnastv Feb 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CapsGNN (Loss=nan)? #16

CapsGNN (Loss=nan)? #16

diesel248 commented May 19, 2020

lishu0716 commented May 20, 2020

diesel248 commented May 20, 2020

imSeaton commented May 31, 2020

diesel248 commented Jun 2, 2020

holoword commented Jun 25, 2020

dtzfast commented Jul 4, 2020

jack6756 commented Oct 25, 2020

Wanghongyu97 commented Nov 26, 2020

imSeaton commented Nov 26, 2020

imSeaton commented Nov 27, 2020

zhangxin9988 commented Jul 21, 2021

Wanghongyu97 commented Jul 21, 2021

zezeze97 commented Oct 28, 2021

CapsGNN (Loss=nan)? #16

CapsGNN (Loss=nan)? #16

Comments

diesel248 commented May 19, 2020

lishu0716 commented May 20, 2020

diesel248 commented May 20, 2020

imSeaton commented May 31, 2020

diesel248 commented Jun 2, 2020

holoword commented Jun 25, 2020

dtzfast commented Jul 4, 2020

jack6756 commented Oct 25, 2020

Wanghongyu97 commented Nov 26, 2020

imSeaton commented Nov 26, 2020

imSeaton commented Nov 27, 2020

zhangxin9988 commented Jul 21, 2021

Wanghongyu97 commented Jul 21, 2021

zezeze97 commented Oct 28, 2021