Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add sm100 support to <nv/target> for NVCC #1745

Merged
merged 3 commits into from
May 23, 2024
Merged

Conversation

wmaxey
Copy link
Member

@wmaxey wmaxey commented May 15, 2024

Description

closes: #1738

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@wmaxey wmaxey requested review from a team as code owners May 15, 2024 20:34
@wmaxey wmaxey requested review from miscco and elstehle May 15, 2024 20:34
Copy link

copy-pr-bot bot commented May 15, 2024

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@wmaxey
Copy link
Member Author

wmaxey commented May 15, 2024

pre-commit.ci autofix

@wmaxey
Copy link
Member Author

wmaxey commented May 15, 2024

/ok to test

@wmaxey wmaxey requested a review from ahendriksen May 16, 2024 02:14
Copy link
Contributor

🟨 CI Results [ Failed: 1 | Passed: 301 | Total: 302 ]
  • 🟩 Project libcudacxx [ Failed: 0 | Passed: 104 | Total: 104 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 96  -- Total: 96 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 86  -- Total: 86 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 84  -- Total: 84 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 102 -- Total: 102
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 21  -- Total: 21 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 52  -- Total: 52 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 104 -- Total: 104
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 nvrtc (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
      🟩 verify_codegen (0% Fail)     Failed:  0  -- Passed:  1  -- Total:  1 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 49  -- Total: 49 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 27  -- Total: 27 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 30  -- Total: 30 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 17  -- Total: 17 
    
  • 🟨 Project thrust [ Failed: 1 | Passed: 98 | Total: 99 ]

    🔍 cpu: amd64 🔍
      🔍 amd64 (1% Fail)              Failed:  1  -- Passed: 90  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🔍 ctk: 12.4 🔍
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🔍 12.4 (1% Fail)               Failed:  1  -- Passed: 80  -- Total: 81 
    🔍 cudacxx_full: nvcc12.4 🔍
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🔍 nvcc12.4 (1% Fail)           Failed:  1  -- Passed: 78  -- Total: 79 
    🔍 cudacxx_name: nvcc 🔍
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🔍 nvcc (1% Fail)               Failed:  1  -- Passed: 96  -- Total: 97 
    🔍 cxx_full: clang16 🔍
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🔍 clang16 (7% Fail)            Failed:  1  -- Passed: 13  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🔍 cxx_name: clang 🔍
      🔍 clang (2% Fail)              Failed:  1  -- Passed: 42  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🔍 jobs: test 🔍
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🔍 test (12% Fail)              Failed:  1  -- Passed:  7  -- Total:  8 
    🔍 os: ubuntu22.04 🔍
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🔍 ubuntu22.04 (2% Fail)        Failed:  1  -- Passed: 43  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🔍 std: 20 🔍
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🔍 20 (6% Fail)                 Failed:  1  -- Passed: 15  -- Total: 16 
    🟨 gpu
      🟨 v100 (1% Fail)               Failed:  1  -- Passed: 98  -- Total: 99 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    
  • 🟩 Project cub [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    

🏃‍ Runner counts (total jobs: 302)

# Runner
232 linux-amd64-cpu16
28 linux-amd64-gpu-v100-latest-1
24 linux-arm64-cpu16
18 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental

Comment on lines +377 to +384
# undef NV_HAS_FEATURE_SM_100a
# define NV_HAS_FEATURE_SM_100a __NV_HAS_FEATURE_SM_100a
# if (defined(__CUDA_ARCH__) && (__CUDA_ARCH__ >= 1000) && defined(__CUDA_ARCH_FEAT_SM100_ALL))
# define _NV_TARGET_BOOL___NV_HAS_FEATURE_SM_100a 1
# else
# define _NV_TARGET_BOOL___NV_HAS_FEATURE_SM_100a 0
# endif

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I dont think that this is needed yet, as there is no SM100a feature

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I do think this is needed. SM100a features can build on top of this.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we possibly don't care. If sm100a doesn't exist then it doesn't matter. The only people that would be bothered are those relying on intellisense.

Copy link
Contributor

@ahendriksen ahendriksen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

Question:

  • do we want to have a test to check that it works?
  • As a follow-up should we add sm100 to the build matrix?

@wmaxey
Copy link
Member Author

wmaxey commented May 19, 2024

LGTM.

Question:

  • do we want to have a test to check that it works?
  • As a follow-up should we add sm100 to the build matrix?

There is something of a canary existing in the current tests. Compiling with an unknown architecture fails because we try to use CUDA_ARCH to synthesize the if target query.

@wmaxey
Copy link
Member Author

wmaxey commented May 22, 2024

/ok to test

Copy link
Contributor

🟩 CI Results: Pass: 100%/302 | Total Time: 2d 06h | Avg Time: 10m 46s | Hits: 81%/369251
  • 🟩 libcudacxx: Pass: 100%/104 | Total Time: 19h 42m | Avg Time: 11m 21s | Hits: 76%/251167

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total Time: 18h 45m | Avg Time: 11m 43s | Hits:  75%/228797
      🟩 arm64              Pass: 100%/8   | Total Time: 56m 18s | Avg Time:  7m 02s | Hits:  83%/22370 
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total Time:  2h 00m | Avg Time:  8m 02s | Hits:  87%/39815 
      🟩 11.8               Pass: 100%/3   | Total Time: 53m 26s | Avg Time: 17m 48s | Hits:  54%/8071  
      🟩 12.4               Pass: 100%/86  | Total Time: 16h 48m | Avg Time: 11m 43s | Hits:  75%/203281
    🟩 cudacxx_full
      🟩 clang-cuda16       Pass: 100%/2   | Total Time: 45m 07s | Avg Time: 22m 33s | Hits:  37%/6109  
      🟩 nvcc11.1           Pass: 100%/15  | Total Time:  2h 00m | Avg Time:  8m 02s | Hits:  87%/39815 
      🟩 nvcc11.8           Pass: 100%/3   | Total Time: 53m 26s | Avg Time: 17m 48s | Hits:  54%/8071  
      🟩 nvcc12.4           Pass: 100%/84  | Total Time: 16h 02m | Avg Time: 11m 27s | Hits:  76%/197172
    🟩 cudacxx_name
      🟩 clang-cuda         Pass: 100%/2   | Total Time: 45m 07s | Avg Time: 22m 33s | Hits:  37%/6109  
      🟩 nvcc               Pass: 100%/102 | Total Time: 18h 56m | Avg Time: 11m 08s | Hits:  77%/245058
    🟩 cxx_full
      🟩 clang9             Pass: 100%/6   | Total Time: 33m 41s | Avg Time:  5m 36s | Hits:  92%/16174 
      🟩 clang10            Pass: 100%/3   | Total Time: 26m 50s | Avg Time:  8m 56s | Hits:  86%/8116  
      🟩 clang11            Pass: 100%/4   | Total Time: 27m 04s | Avg Time:  6m 46s | Hits:  89%/11193 
      🟩 clang12            Pass: 100%/4   | Total Time: 41m 18s | Avg Time: 10m 19s | Hits:  83%/11193 
      🟩 clang13            Pass: 100%/4   | Total Time: 41m 13s | Avg Time: 10m 18s | Hits:  78%/11193 
      🟩 clang14            Pass: 100%/4   | Total Time: 55m 18s | Avg Time: 13m 49s | Hits:  73%/11193 
      🟩 clang15            Pass: 100%/4   | Total Time: 42m 58s | Avg Time: 10m 44s | Hits:  74%/11185 
      🟩 clang16            Pass: 100%/14  | Total Time:  2h 53m | Avg Time: 12m 23s | Hits:  74%/28479 
      🟩 gcc6               Pass: 100%/2   | Total Time:  4m 14s | Avg Time:  2m 07s | Hits:  99%/5047  
      🟩 gcc7               Pass: 100%/6   | Total Time: 14m 48s | Avg Time:  2m 28s | Hits:  99%/16160 
      🟩 gcc8               Pass: 100%/6   | Total Time:  1h 14m | Avg Time: 12m 27s | Hits:  88%/16168 
      🟩 gcc9               Pass: 100%/6   | Total Time: 40m 44s | Avg Time:  6m 47s | Hits:  82%/16172 
      🟩 gcc10              Pass: 100%/4   | Total Time: 41m 54s | Avg Time: 10m 28s | Hits:  83%/11193 
      🟩 gcc11              Pass: 100%/7   | Total Time:  1h 32m | Avg Time: 13m 16s | Hits:  71%/19256 
      🟩 gcc12              Pass: 100%/21  | Total Time:  3h 40m | Avg Time: 10m 28s | Hits:  92%/33938 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total Time:  1h 03m | Avg Time: 21m 13s | Hits:   3%/8106  
      🟩 MSVC14.16          Pass: 100%/1   | Total Time: 32m 12s | Avg Time: 32m 12s | Hits:   3%/2541  
      🟩 MSVC14.29          Pass: 100%/2   | Total Time: 58m 34s | Avg Time: 29m 17s | Hits:   2%/5444  
      🟩 MSVC14.39          Pass: 100%/3   | Total Time:  1h 36m | Avg Time: 32m 09s | Hits:   2%/8416  
    🟩 cxx_name
      🟩 clang              Pass: 100%/43  | Total Time:  7h 21m | Avg Time: 10m 16s | Hits:  80%/108726
      🟩 gcc                Pass: 100%/52  | Total Time:  8h 09m | Avg Time:  9m 24s | Hits:  87%/117934
      🟩 Intel              Pass: 100%/3   | Total Time:  1h 03m | Avg Time: 21m 13s | Hits:   3%/8106  
      🟩 MSVC               Pass: 100%/6   | Total Time:  3h 07m | Avg Time: 31m 12s | Hits:   2%/16401 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total Time: 19h 42m | Avg Time: 11m 21s | Hits:  76%/251167
    🟩 jobs
      🟩 build              Pass: 100%/91  | Total Time: 15h 59m | Avg Time: 10m 32s | Hits:  76%/251147
      🟩 nvrtc              Pass: 100%/4   | Total Time:  1h 17m | Avg Time: 19m 15s | Hits: 100%/20    
      🟩 test               Pass: 100%/8   | Total Time:  2h 24m | Avg Time: 18m 00s
      🟩 verify_codegen     Pass: 100%/1   | Total Time:  1m 45s | Avg Time:  1m 45s
    🟩 os
      🟩 ubuntu18.04        Pass: 100%/14  | Total Time:  1h 28m | Avg Time:  6m 18s | Hits:  93%/37274 
      🟩 ubuntu20.04        Pass: 100%/35  | Total Time:  5h 13m | Avg Time:  8m 57s | Hits:  84%/96528 
      🟩 ubuntu22.04        Pass: 100%/49  | Total Time:  9h 52m | Avg Time: 12m 06s | Hits:  74%/100964
      🟩 windows2022        Pass: 100%/6   | Total Time:  3h 07m | Avg Time: 31m 12s | Hits:   2%/16401 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total Time: 53m 26s | Avg Time: 17m 48s | Hits:  54%/8071  
      🟩 90a                Pass: 100%/4   | Total Time: 11m 40s | Avg Time:  2m 55s | Hits:  99%/11548 
    🟩 std
      🟩 11                 Pass: 100%/27  | Total Time:  4h 38m | Avg Time: 10m 18s | Hits:  88%/53654 
      🟩 14                 Pass: 100%/30  | Total Time:  6h 08m | Avg Time: 12m 17s | Hits:  68%/76233 
      🟩 17                 Pass: 100%/29  | Total Time:  4h 52m | Avg Time: 10m 06s | Hits:  78%/78222 
      🟩 20                 Pass: 100%/17  | Total Time:  4h 00m | Avg Time: 14m 08s | Hits:  71%/43058 
    
  • 🟩 thrust: Pass: 100%/99 | Total Time: 13h 09m | Avg Time: 7m 58s | Hits: 90%/50817

    🟩 cpu
      🟩 amd64              Pass: 100%/91  | Total Time: 12h 46m | Avg Time:  8m 25s | Hits:  89%/46709 
      🟩 arm64              Pass: 100%/8   | Total Time: 22m 57s | Avg Time:  2m 52s | Hits:  99%/4108  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total Time:  1h 23m | Avg Time:  5m 35s | Hits:  93%/7700  
      🟩 11.8               Pass: 100%/3   | Total Time:  9m 41s | Avg Time:  3m 13s | Hits:  99%/1542  
      🟩 12.4               Pass: 100%/81  | Total Time: 11h 36m | Avg Time:  8m 35s | Hits:  89%/41575 
    🟩 cudacxx_full
      🟩 clang-cuda16       Pass: 100%/2   | Total Time:  6m 35s | Avg Time:  3m 17s | Hits: 100%/1026  
      🟩 nvcc11.1           Pass: 100%/15  | Total Time:  1h 23m | Avg Time:  5m 35s | Hits:  93%/7700  
      🟩 nvcc11.8           Pass: 100%/3   | Total Time:  9m 41s | Avg Time:  3m 13s | Hits:  99%/1542  
      🟩 nvcc12.4           Pass: 100%/79  | Total Time: 11h 29m | Avg Time:  8m 43s | Hits:  88%/40549 
    🟩 cudacxx_name
      🟩 clang-cuda         Pass: 100%/2   | Total Time:  6m 35s | Avg Time:  3m 17s | Hits: 100%/1026  
      🟩 nvcc               Pass: 100%/97  | Total Time: 13h 02m | Avg Time:  8m 04s | Hits:  89%/49791 
    🟩 cxx_full
      🟩 clang9             Pass: 100%/6   | Total Time: 20m 32s | Avg Time:  3m 25s | Hits: 100%/3078  
      🟩 clang10            Pass: 100%/3   | Total Time: 11m 46s | Avg Time:  3m 55s | Hits: 100%/1539  
      🟩 clang11            Pass: 100%/4   | Total Time: 12m 33s | Avg Time:  3m 08s | Hits: 100%/2052  
      🟩 clang12            Pass: 100%/4   | Total Time: 12m 31s | Avg Time:  3m 07s | Hits: 100%/2052  
      🟩 clang13            Pass: 100%/4   | Total Time: 13m 21s | Avg Time:  3m 20s | Hits: 100%/2052  
      🟩 clang14            Pass: 100%/4   | Total Time: 13m 01s | Avg Time:  3m 15s | Hits: 100%/2052  
      🟩 clang15            Pass: 100%/4   | Total Time: 12m 52s | Avg Time:  3m 13s | Hits: 100%/2052  
      🟩 clang16            Pass: 100%/14  | Total Time:  1h 32m | Avg Time:  6m 36s | Hits:  99%/7182  
      🟩 gcc6               Pass: 100%/2   | Total Time:  5m 14s | Avg Time:  2m 37s | Hits:  99%/1026  
      🟩 gcc7               Pass: 100%/6   | Total Time: 17m 29s | Avg Time:  2m 54s | Hits:  99%/3084  
      🟩 gcc8               Pass: 100%/6   | Total Time: 17m 25s | Avg Time:  2m 54s | Hits:  99%/3084  
      🟩 gcc9               Pass: 100%/6   | Total Time: 18m 20s | Avg Time:  3m 03s | Hits:  99%/3084  
      🟩 gcc10              Pass: 100%/4   | Total Time: 13m 06s | Avg Time:  3m 16s | Hits:  99%/2056  
      🟩 gcc11              Pass: 100%/7   | Total Time: 22m 26s | Avg Time:  3m 12s | Hits:  99%/3598  
      🟩 gcc12              Pass: 100%/16  | Total Time:  1h 48m | Avg Time:  6m 48s | Hits:  94%/8224  
      🟩 Intel2023.2.0      Pass: 100%/3   | Total Time:  1h 39m | Avg Time: 33m 17s | Hits:   5%/1548  
      🟩 MSVC14.16          Pass: 100%/1   | Total Time: 45m 03s | Avg Time: 45m 03s | Hits:   0%/509   
      🟩 MSVC14.29          Pass: 100%/2   | Total Time:  1h 42m | Avg Time: 51m 04s | Hits:   0%/1018  
      🟩 MSVC14.39          Pass: 100%/3   | Total Time:  2h 30m | Avg Time: 50m 08s | Hits:   0%/1527  
    🟩 cxx_name
      🟩 clang              Pass: 100%/43  | Total Time:  3h 09m | Avg Time:  4m 23s | Hits:  99%/22059 
      🟩 gcc                Pass: 100%/47  | Total Time:  3h 22m | Avg Time:  4m 18s | Hits:  97%/24156 
      🟩 Intel              Pass: 100%/3   | Total Time:  1h 39m | Avg Time: 33m 17s | Hits:   5%/1548  
      🟩 MSVC               Pass: 100%/6   | Total Time:  4h 57m | Avg Time: 49m 35s | Hits:   0%/3054  
    🟩 gpu
      🟩 v100               Pass: 100%/99  | Total Time: 13h 09m | Avg Time:  7m 58s | Hits:  90%/50817 
    🟩 jobs
      🟩 build              Pass: 100%/91  | Total Time: 10h 54m | Avg Time:  7m 11s | Hits:  90%/46709 
      🟩 test               Pass: 100%/8   | Total Time:  2h 14m | Avg Time: 16m 50s | Hits:  89%/4108  
    🟩 os
      🟩 ubuntu18.04        Pass: 100%/14  | Total Time: 38m 42s | Avg Time:  2m 45s | Hits:  99%/7191  
      🟩 ubuntu20.04        Pass: 100%/35  | Total Time:  1h 56m | Avg Time:  3m 19s | Hits:  99%/17968 
      🟩 ubuntu22.04        Pass: 100%/44  | Total Time:  5h 36m | Avg Time:  7m 38s | Hits:  91%/22604 
      🟩 windows2022        Pass: 100%/6   | Total Time:  4h 57m | Avg Time: 49m 35s | Hits:   0%/3054  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total Time:  9m 41s | Avg Time:  3m 13s | Hits:  99%/1542  
      🟩 90a                Pass: 100%/4   | Total Time: 10m 44s | Avg Time:  2m 41s | Hits:  99%/2056  
    🟩 std
      🟩 11                 Pass: 100%/26  | Total Time:  2h 01m | Avg Time:  4m 41s | Hits:  96%/13354 
      🟩 14                 Pass: 100%/29  | Total Time:  5h 01m | Avg Time: 10m 24s | Hits:  83%/14881 
      🟩 17                 Pass: 100%/28  | Total Time:  4h 03m | Avg Time:  8m 41s | Hits:  89%/14372 
      🟩 20                 Pass: 100%/16  | Total Time:  2h 02m | Avg Time:  7m 40s | Hits:  93%/8210  
    
  • 🟩 cub: Pass: 100%/99 | Total Time: 21h 24m | Avg Time: 12m 58s | Hits: 91%/67267

    🟩 cpu
      🟩 amd64              Pass: 100%/91  | Total Time: 20h 54m | Avg Time: 13m 46s | Hits:  91%/61635 
      🟩 arm64              Pass: 100%/8   | Total Time: 29m 59s | Avg Time:  3m 44s | Hits:  99%/5632  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total Time:  1h 56m | Avg Time:  7m 45s | Hits:  90%/9350  
      🟩 11.8               Pass: 100%/3   | Total Time: 12m 12s | Avg Time:  4m 04s | Hits:  99%/2112  
      🟩 12.4               Pass: 100%/81  | Total Time: 19h 15m | Avg Time: 14m 16s | Hits:  91%/55805 
    🟩 cudacxx_full
      🟩 clang-cuda16       Pass: 100%/2   | Total Time:  6m 45s | Avg Time:  3m 22s | Hits: 100%/1116  
      🟩 nvcc11.1           Pass: 100%/15  | Total Time:  1h 56m | Avg Time:  7m 45s | Hits:  90%/9350  
      🟩 nvcc11.8           Pass: 100%/3   | Total Time: 12m 12s | Avg Time:  4m 04s | Hits:  99%/2112  
      🟩 nvcc12.4           Pass: 100%/79  | Total Time: 19h 08m | Avg Time: 14m 32s | Hits:  91%/54689 
    🟩 cudacxx_name
      🟩 clang-cuda         Pass: 100%/2   | Total Time:  6m 45s | Avg Time:  3m 22s | Hits: 100%/1116  
      🟩 nvcc               Pass: 100%/97  | Total Time: 21h 17m | Avg Time: 13m 10s | Hits:  91%/66151 
    🟩 cxx_full
      🟩 clang9             Pass: 100%/6   | Total Time: 24m 13s | Avg Time:  4m 02s | Hits: 100%/4002  
      🟩 clang10            Pass: 100%/3   | Total Time: 13m 57s | Avg Time:  4m 39s | Hits: 100%/2118  
      🟩 clang11            Pass: 100%/4   | Total Time: 16m 12s | Avg Time:  4m 03s | Hits: 100%/2824  
      🟩 clang12            Pass: 100%/4   | Total Time: 16m 08s | Avg Time:  4m 02s | Hits: 100%/2824  
      🟩 clang13            Pass: 100%/4   | Total Time: 16m 10s | Avg Time:  4m 02s | Hits: 100%/2824  
      🟩 clang14            Pass: 100%/4   | Total Time: 16m 44s | Avg Time:  4m 11s | Hits: 100%/2824  
      🟩 clang15            Pass: 100%/4   | Total Time: 16m 14s | Avg Time:  4m 03s | Hits: 100%/2816  
      🟩 clang16            Pass: 100%/14  | Total Time:  5h 36m | Avg Time: 24m 01s | Hits: 100%/9564  
      🟩 gcc6               Pass: 100%/2   | Total Time:  6m 13s | Avg Time:  3m 06s | Hits:  99%/1256  
      🟩 gcc7               Pass: 100%/6   | Total Time: 21m 18s | Avg Time:  3m 33s | Hits:  99%/4005  
      🟩 gcc8               Pass: 100%/6   | Total Time: 41m 47s | Avg Time:  6m 57s | Hits:  92%/4005  
      🟩 gcc9               Pass: 100%/6   | Total Time: 21m 40s | Avg Time:  3m 36s | Hits:  99%/4005  
      🟩 gcc10              Pass: 100%/4   | Total Time: 16m 35s | Avg Time:  4m 08s | Hits:  99%/2824  
      🟩 gcc11              Pass: 100%/7   | Total Time: 28m 21s | Avg Time:  4m 03s | Hits:  99%/4928  
      🟩 gcc12              Pass: 100%/16  | Total Time:  5h 09m | Avg Time: 19m 20s | Hits:  99%/11264 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total Time:  1h 48m | Avg Time: 36m 12s | Hits:   3%/1890  
      🟩 MSVC14.16          Pass: 100%/1   | Total Time: 50m 38s | Avg Time: 50m 38s | Hits:   0%/549   
      🟩 MSVC14.29          Pass: 100%/2   | Total Time:  1h 31m | Avg Time: 45m 41s | Hits:   0%/1098  
      🟩 MSVC14.39          Pass: 100%/3   | Total Time:  2h 12m | Avg Time: 44m 04s | Hits:   0%/1647  
    🟩 cxx_name
      🟩 clang              Pass: 100%/43  | Total Time:  7h 36m | Avg Time: 10m 36s | Hits: 100%/29796 
      🟩 gcc                Pass: 100%/47  | Total Time:  7h 25m | Avg Time:  9m 28s | Hits:  98%/32287 
      🟩 Intel              Pass: 100%/3   | Total Time:  1h 48m | Avg Time: 36m 12s | Hits:   3%/1890  
      🟩 MSVC               Pass: 100%/6   | Total Time:  4h 34m | Avg Time: 45m 42s | Hits:   0%/3294  
    🟩 gpu
      🟩 v100               Pass: 100%/99  | Total Time: 21h 24m | Avg Time: 12m 58s | Hits:  91%/67267 
    🟩 jobs
      🟩 build              Pass: 100%/91  | Total Time: 12h 00m | Avg Time:  7m 55s | Hits:  91%/61635 
      🟩 test               Pass: 100%/8   | Total Time:  9h 23m | Avg Time:  1h 10m | Hits:  99%/5632  
    🟩 os
      🟩 ubuntu18.04        Pass: 100%/14  | Total Time:  1h 05m | Avg Time:  4m 41s | Hits:  96%/8801  
      🟩 ubuntu20.04        Pass: 100%/35  | Total Time:  2h 25m | Avg Time:  4m 09s | Hits:  99%/24710 
      🟩 ubuntu22.04        Pass: 100%/44  | Total Time: 13h 18m | Avg Time: 18m 09s | Hits:  93%/30462 
      🟩 windows2022        Pass: 100%/6   | Total Time:  4h 34m | Avg Time: 45m 42s | Hits:   0%/3294  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total Time: 12m 12s | Avg Time:  4m 04s | Hits:  99%/2112  
      🟩 90a                Pass: 100%/4   | Total Time: 13m 09s | Avg Time:  3m 17s | Hits:  99%/2816  
    🟩 std
      🟩 11                 Pass: 100%/26  | Total Time:  4h 03m | Avg Time:  9m 21s | Hits:  96%/17873 
      🟩 14                 Pass: 100%/29  | Total Time:  6h 57m | Avg Time: 14m 24s | Hits:  88%/19520 
      🟩 17                 Pass: 100%/28  | Total Time:  6h 12m | Avg Time: 13m 18s | Hits:  89%/18901 
      🟩 20                 Pass: 100%/16  | Total Time:  4h 09m | Avg Time: 15m 37s | Hits:  94%/10973 
    

🏃‍ Runner counts (total jobs: 302)

# Runner
232 linux-amd64-cpu16
28 linux-amd64-gpu-v100-latest-1
24 linux-arm64-cpu16
18 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental

@wmaxey wmaxey requested a review from miscco May 23, 2024 04:42
@miscco miscco merged commit 5c07723 into NVIDIA:main May 23, 2024
340 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

[FEA]: Add support for SM_100 and SM_100a in <nv/target>
3 participants