Why batch training？ #78

jiaojiaoguan · 2022-09-16T03:18:34Z

Hello everyone：

I have a error when I run e = Wh1 + Wh2.T

because the dimension of e is N by N, in my graph, n is about 400000. so it takes 400000 x 40000 x 4/1024/1024/1024 GB (about 600G). I want to know whether batch size is useful.

Thanks in advance!

The text was updated successfully, but these errors were encountered:

LQchen1 · 2022-09-21T12:09:34Z

Hello, I think it's useless, because GAT actually needs to compute the Attention of the entire graph

Preston2jager · 2024-02-06T10:19:01Z

From my understanding batch will work, for a shallow GAT example 2 layers, a single node only gathers information from nodes 2 steps away, nodes beyond that distance have no affect.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why batch training？ #78

Why batch training？ #78

jiaojiaoguan commented Sep 16, 2022 •

edited

LQchen1 commented Sep 21, 2022

Preston2jager commented Feb 6, 2024

Why batch training？ #78

Why batch training？ #78

Comments

jiaojiaoguan commented Sep 16, 2022 • edited

LQchen1 commented Sep 21, 2022

Preston2jager commented Feb 6, 2024

jiaojiaoguan commented Sep 16, 2022 •

edited