Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

paddlex无法使用gpu训练,4060的显卡 #1726

Open
Sunny2884 opened this issue Jul 31, 2023 · 1 comment
Open

paddlex无法使用gpu训练,4060的显卡 #1726

Sunny2884 opened this issue Jul 31, 2023 · 1 comment
Assignees

Comments

@Sunny2884
Copy link

Checklist:

  1. 查找历史相关issue寻求解答
  2. 翻阅FAQ常见问题汇总和答疑
  3. 确认bug是否在新版本里还未修复
  4. 如果bug是由PaddleX API 2.0导致,且该bug在develop分支里已修复,参考FAQ Q4替换内置PaddleX API

描述问题

复现

  1. 请提供您出现的报错信息及相关log(log的查找见 FAQ Q2
    This log file path is D:\software\paddlex\projects\P0001\T0037\err.log
    注意:标志为WARNING/INFO类的仅为警告或提示类信息,非错误信息
    D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddle\tensor\creation.py:130: DeprecationWarning: np.object is a deprecated alias for the builtin object. To silence this warning, use object by itself. Doing this will not modify any behavior and is safe.
    Deprecated in NumPy 1.20; for more details and guidance: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations
    if data.dtype == np.object:
    Process Process-1:2:
    Traceback (most recent call last):
    File "multiprocessing\process.py", line 297, in _bootstrap
    File "multiprocessing\process.py", line 99, in run
    File "paddlexui\pms\model_tasks\tasks.py", line 73, in _call_paddlex_train
    File "paddlexui\pms\model_tasks\train\classification.py", line 118, in train
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddlex\cv\models\classifier.py", line 888, in init
    model_name=model_name, num_classes=num_classes, **params)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddlex\cv\models\classifier.py", line 70, in init
    self.net = self.build_net(**params)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddlex\cv\models\classifier.py", line 75, in build_net
    **params)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddlex\ppcls\arch\backbone\legendary_models\pp_lcnet.py", line 352, in PPLCNet_x1_0
    model = PPLCNet(scale=1.0, **kwargs)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddlex\ppcls\arch\backbone\legendary_models\pp_lcnet.py", line 183, in init
    stride=2)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddlex\ppcls\arch\backbone\legendary_models\pp_lcnet.py", line 93, in init
    bias_attr=False)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddle\nn\layer\conv.py", line 656, in init
    data_format=data_format)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddle\nn\layer\conv.py", line 135, in init
    default_initializer=_get_default_param_initializer())
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddle\fluid\dygraph\layers.py", line 424, in create_parameter
    default_initializer)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddle\fluid\layer_helper_base.py", line 378, in create_parameter
    **attr._to_kwargs(with_initializer=True))
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddle\fluid\framework.py", line 3137, in create_parameter
    initializer(param, self)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddle\fluid\initializer.py", line 719, in call
    stop_gradient=True)
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddle\fluid\framework.py", line 3167, in append_op
    kwargs.get("stop_gradient", False))
    File "D:\files\chrome\PaddleX_GUI_2.1.0_win10\PaddleX_GUI_2.1.0_win10\paddle\fluid\dygraph\tracer.py", line 45, in trace_op
    not stop_gradient)
    OSError: (External) CUSOLVER error(7).
    [Hint: 'CUSOLVER_STATUS_INTERNAL_ERROR'. An internal cuSolver operation failed. This error is usually caused by a cudaMemcpyAsync() failure.To correct: check that the hardware, an appropriate version of the driver, and the cuSolver library are correctly installed. Also, check that the memory passed as a parameter to the routine is not being deallocated prior to the routine’s completion.] (at ..\paddle/fluid/platform/device_context.h:418)
    [operator < gaussian_random > error]

  2. 请提供您使用的GUI版本号 2.1.0

  3. 请提供您使用的操作系统信息,如Linux/Windows/MacOS windows 11

  4. 请问您使用的CUDA/cuDNN的版本号是? cuda11.2 cudnn8.5.0

@dyning
Copy link
Collaborator

dyning commented Jan 18, 2024

欢迎尝试使用PaddleX新版本试试,看看问题是否可以解决,https://aistudio.baidu.com/intro/paddlex

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants