Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA Graph open fail #940

Open
tuanzhangCS opened this issue Oct 21, 2023 · 3 comments
Open

CUDA Graph open fail #940

tuanzhangCS opened this issue Oct 21, 2023 · 3 comments

Comments

@tuanzhangCS
Copy link

System information

  • OS Platform and Distribution (e.g., Linux Ubuntu 20.04): Linux
  • DeepRec version or commit id: be62ec3
  • Python version: 3.8
  • Bazel version (if compiling from source): 0.26.1
  • GCC/Compiler version (if compiling from source): gcc 9.4.0
  • CUDA/cuDNN version: cuda11.6/cuDNN8

Describe the current behavior
My usage is:

SessionOptions options;
options.config.mutable_gpu_options()->set_cuda_graph_enable_jit(true);
auto status = tensorflow::LoadSavedModel(options, tensorflow::RunOptions(), model_path, {"serve"}, &bundle);
for (int i = 0; i < loops; ++i) {
   status = bundle.session->Run(inputs, out_names, {}, &outputs);
}

But I can't observe cudaGraph used in nsight system.
image

Describe the expected behavior
Some sub graph can run by cuda Graph.

Are there any examples of using cuda Graph?

@tuanzhangCS
Copy link
Author

tuanzhangCS commented Oct 23, 2023

I found that because I didn't compile DeepRec with XLA.
After I fix it, there will be cudaGraphLaunch in the graph when running.
But sometimes the results are random error, why?

@candyzone
Copy link
Collaborator

session run error or random results?

@tuanzhangCS
Copy link
Author

session run error or random results?

Random results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants