Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How long for training #14

Open
ShenZheng2000 opened this issue Apr 23, 2021 · 2 comments
Open

How long for training #14

ShenZheng2000 opened this issue Apr 23, 2021 · 2 comments

Comments

@ShenZheng2000
Copy link

ShenZheng2000 commented Apr 23, 2021

Hello, authors! I have a doubt about how long it takes to converge your model. Your manuscript mentions that "The proposed MBLLEN can be quickly converged after being trained for 5000 mini-batches". However, your train.py sets 200 epochs with 200 batches for each epoch. Therefore, the total mini-batches should be 40,000 instead of 5,000. Hope you can clarify it. Thanks!

@yjg123456
Copy link

是的 我感觉这里作者没有讲清楚

@Sherlock-hh
Copy link

hello did u solve the problem? I wonder if I change the batchsize 16 to 8, and step_epoch 200 to 400 ,because my gpu memory is not enough, dose it have influence to final results?

你好你好,我想问问您这边对batchsize这一块搞明白了,我现在在把batch_size 从16改成8了,step_epochs从200改成400了,因为gpu内存不够,这会对结果有影响吗,因为我的loss一直在1.2左右晃,感觉有点不对劲啊

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants