Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

预训练的时候在加载模型的过程中卡住 #917

Open
amanyara opened this issue Oct 13, 2023 · 0 comments
Open

预训练的时候在加载模型的过程中卡住 #917

amanyara opened this issue Oct 13, 2023 · 0 comments

Comments

@amanyara
Copy link

Describe the bug
A clear and concise description of what the bug is.
在我预训练模型的时候,模型加载的一晚上还在加载。显存没有跑满。

To Reproduce
Steps to reproduce the behavior:

  1. cd /home/pengc/ernie/ernie-develop/demo
  2. python pretrain.py --data_dir "../data/*.gz" --from_pretrained ../ernie-gram-zh --save_dir ./outabtrain_1012

Expected behavior
A clear and concise description of what you expected to happen.
期待模型开始训练,日志输出内容
054A01C8-4E38-4b0c-860F-060009F1ABB6

Screenshots
If applicable, add screenshots to help explain your problem.

Additional context
Add any other context about the problem here.
运行环境:Ubuntu18.04
显卡:A800*2
Python:3.8.18
cuDNN:8.4
PaddlePaddle-GPU: 2.5.1.post120
运行指令:如图

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant