Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Backport to 2.8: Review/Deprecate CUB util.ptx for CCCL 2.x (#3342) #3389

Merged
merged 1 commit into from
Jan 15, 2025

Conversation

bernhardmgruber
Copy link
Contributor

No description provided.

Copy link
Contributor

🟩 CI finished in 1h 47m: Pass: 100%/96 | Total: 2d 17h | Avg: 40m 49s | Max: 1h 16m | Hits: 189%/12392
  • 🟩 cub: Pass: 100%/47 | Total: 1d 15h | Avg: 50m 38s | Max: 1h 16m | Hits: 114%/3132

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 13h | Avg: 50m 16s | Max:  1h 16m | Hits: 114%/3132  
      🟩 arm64              Pass: 100%/2   | Total:  1h 57m | Avg: 58m 53s | Max: 59m 57s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  5h 46m | Avg: 49m 30s | Max: 59m 55s | Hits: 113%/783   
      🟩 12.5               Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m
      🟩 12.6               Pass: 100%/38  | Total:  1d 07h | Avg: 50m 05s | Max:  1h 16m | Hits: 114%/2349  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 nvcc11.1           Pass: 100%/7   | Total:  5h 46m | Avg: 49m 30s | Max: 59m 55s | Hits: 113%/783   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m
      🟩 nvcc12.6           Pass: 100%/36  | Total:  1d 05h | Avg: 49m 30s | Max:  1h 16m | Hits: 114%/2349  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 nvcc               Pass: 100%/45  | Total:  1d 13h | Avg: 50m 12s | Max:  1h 16m | Hits: 114%/3132  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  3h 41m | Avg: 55m 24s | Max:  1h 02m
      🟩 Clang10            Pass: 100%/1   | Total: 54m 26s | Avg: 54m 26s | Max: 54m 26s
      🟩 Clang11            Pass: 100%/1   | Total: 58m 28s | Avg: 58m 28s | Max: 58m 28s
      🟩 Clang12            Pass: 100%/1   | Total: 58m 47s | Avg: 58m 47s | Max: 58m 47s
      🟩 Clang13            Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 Clang14            Pass: 100%/1   | Total: 56m 46s | Avg: 56m 46s | Max: 56m 46s
      🟩 Clang15            Pass: 100%/1   | Total: 55m 15s | Avg: 55m 15s | Max: 55m 15s
      🟩 Clang16            Pass: 100%/1   | Total: 56m 42s | Avg: 56m 42s | Max: 56m 42s
      🟩 Clang17            Pass: 100%/1   | Total: 54m 37s | Avg: 54m 37s | Max: 54m 37s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 45m | Avg: 49m 19s | Max:  1h 00m
      🟩 GCC6               Pass: 100%/2   | Total:  1h 35m | Avg: 47m 30s | Max: 47m 30s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 49m | Avg: 54m 32s | Max: 56m 25s
      🟩 GCC8               Pass: 100%/1   | Total: 56m 42s | Avg: 56m 42s | Max: 56m 42s
      🟩 GCC9               Pass: 100%/3   | Total:  2h 25m | Avg: 48m 25s | Max: 53m 12s
      🟩 GCC10              Pass: 100%/1   | Total: 56m 48s | Avg: 56m 48s | Max: 56m 48s
      🟩 GCC11              Pass: 100%/1   | Total: 59m 38s | Avg: 59m 38s | Max: 59m 38s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 37m | Avg: 32m 39s | Max: 55m 05s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 38m | Avg: 34m 47s | Max:  1h 00m
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 59m 47s | Avg: 59m 47s | Max: 59m 47s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 59m 55s | Avg: 59m 55s | Max: 59m 55s | Hits: 113%/783   
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m | Hits: 118%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 16m | Hits: 113%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 17h 02m | Avg: 53m 48s | Max:  1h 02m
      🟩 GCC                Pass: 100%/21  | Total: 14h 58m | Avg: 42m 47s | Max:  1h 00m
      🟩 Intel              Pass: 100%/1   | Total: 59m 47s | Avg: 59m 47s | Max: 59m 47s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 29m | Avg:  1h 07m | Max:  1h 16m | Hits: 114%/3132  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 42m 52s | Avg: 21m 26s | Max: 26m 41s
      🟩 v100               Pass: 100%/45  | Total:  1d 14h | Avg: 51m 56s | Max:  1h 16m | Hits: 114%/3132  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 13h | Avg: 55m 59s | Max:  1h 16m | Hits: 114%/3132  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 36s | Avg: 22m 36s | Max: 22m 36s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 01m | Avg: 20m 25s | Max: 22m 56s
      🟩 TestGPU            Pass: 100%/2   | Total: 41m 40s | Avg: 20m 50s | Max: 23m 11s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 42m 52s | Avg: 21m 26s | Max: 26m 41s
      🟩 90a                Pass: 100%/1   | Total: 24m 50s | Avg: 24m 50s | Max: 24m 50s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  4h 19m | Avg: 51m 53s | Max: 59m 51s
      🟩 14                 Pass: 100%/4   | Total:  3h 42m | Avg: 55m 33s | Max:  1h 02m | Hits: 113%/783   
      🟩 17                 Pass: 100%/12  | Total: 11h 32m | Avg: 57m 42s | Max:  1h 08m | Hits: 116%/1566  
      🟩 20                 Pass: 100%/26  | Total: 20h 06m | Avg: 46m 23s | Max:  1h 16m | Hits: 111%/783   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 1d 01h | Avg: 32m 39s | Max: 1h 04m | Hits: 215%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 41m 38s | Avg: 20m 49s | Max: 29m 50s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total: 23h 58m | Avg: 32m 41s | Max:  1h 04m | Hits: 215%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 52s | Max: 34m 02s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  3h 32m | Avg: 30m 22s | Max: 54m 24s | Hits: 182%/1852  
      🟩 12.5               Pass: 100%/2   | Total:  1h 46m | Avg: 53m 19s | Max: 55m 47s
      🟩 12.6               Pass: 100%/37  | Total: 19h 43m | Avg: 31m 58s | Max:  1h 04m | Hits: 223%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 58m 39s | Avg: 29m 19s | Max: 29m 21s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  3h 32m | Avg: 30m 22s | Max: 54m 24s | Hits: 182%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 46m | Avg: 53m 19s | Max: 55m 47s
      🟩 nvcc12.6           Pass: 100%/35  | Total: 18h 44m | Avg: 32m 07s | Max:  1h 04m | Hits: 223%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 58m 39s | Avg: 29m 19s | Max: 29m 21s
      🟩 nvcc               Pass: 100%/44  | Total:  1d 00h | Avg: 32m 48s | Max:  1h 04m | Hits: 215%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  1h 49m | Avg: 27m 25s | Max: 30m 37s
      🟩 Clang10            Pass: 100%/1   | Total: 34m 15s | Avg: 34m 15s | Max: 34m 15s
      🟩 Clang11            Pass: 100%/1   | Total: 32m 40s | Avg: 32m 40s | Max: 32m 40s
      🟩 Clang12            Pass: 100%/1   | Total: 30m 33s | Avg: 30m 33s | Max: 30m 33s
      🟩 Clang13            Pass: 100%/1   | Total: 35m 12s | Avg: 35m 12s | Max: 35m 12s
      🟩 Clang14            Pass: 100%/1   | Total: 31m 38s | Avg: 31m 38s | Max: 31m 38s
      🟩 Clang15            Pass: 100%/1   | Total: 34m 56s | Avg: 34m 56s | Max: 34m 56s
      🟩 Clang16            Pass: 100%/1   | Total: 35m 38s | Avg: 35m 38s | Max: 35m 38s
      🟩 Clang17            Pass: 100%/1   | Total: 31m 30s | Avg: 31m 30s | Max: 31m 30s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 51m | Avg: 24m 30s | Max: 31m 58s
      🟩 GCC6               Pass: 100%/2   | Total: 51m 39s | Avg: 25m 49s | Max: 27m 26s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 01m | Avg: 30m 45s | Max: 34m 53s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 40s | Avg: 31m 40s | Max: 31m 40s
      🟩 GCC9               Pass: 100%/3   | Total:  1h 28m | Avg: 29m 21s | Max: 33m 48s
      🟩 GCC10              Pass: 100%/1   | Total: 34m 40s | Avg: 34m 40s | Max: 34m 40s
      🟩 GCC11              Pass: 100%/1   | Total: 33m 52s | Avg: 33m 52s | Max: 33m 52s
      🟩 GCC12              Pass: 100%/1   | Total: 40m 26s | Avg: 40m 26s | Max: 40m 26s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 12m | Avg: 24m 05s | Max: 41m 03s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 42m 50s | Avg: 42m 50s | Max: 42m 50s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 54m 24s | Avg: 54m 24s | Max: 54m 24s | Hits: 182%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m | Hits: 178%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 35m | Avg: 51m 59s | Max:  1h 04m | Hits: 238%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 19s | Max: 55m 47s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  9h 07m | Avg: 28m 49s | Max: 35m 38s
      🟩 GCC                Pass: 100%/19  | Total:  8h 54m | Avg: 28m 08s | Max: 41m 03s
      🟩 Intel              Pass: 100%/1   | Total: 42m 50s | Avg: 42m 50s | Max: 42m 50s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 30m | Avg: 54m 09s | Max:  1h 04m | Hits: 215%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 46m | Avg: 53m 19s | Max: 55m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  1d 01h | Avg: 32m 39s | Max:  1h 04m | Hits: 215%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total: 23h 37m | Avg: 35m 26s | Max:  1h 04m | Hits: 177%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 11s | Avg: 16m 23s | Max: 33m 47s | Hits: 365%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 35m 20s | Avg: 11m 46s | Max: 12m 12s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 19m 11s | Avg: 19m 11s | Max: 19m 11s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  2h 05m | Avg: 25m 02s | Max: 26m 42s
      🟩 14                 Pass: 100%/4   | Total:  2h 27m | Avg: 36m 50s | Max: 54m 24s | Hits: 182%/1852  
      🟩 17                 Pass: 100%/12  | Total:  7h 53m | Avg: 39m 29s | Max:  1h 00m | Hits: 176%/3704  
      🟩 20                 Pass: 100%/23  | Total: 11h 54m | Avg: 31m 03s | Max:  1h 04m | Hits: 270%/3704  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 59s | Avg: 4m 59s | Max: 7m 45s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  7m 45s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  7m 45s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  7m 45s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  7m 45s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  7m 45s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  7m 45s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  7m 45s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 14s | Avg:  2m 14s | Max:  2m 14s
      🟩 Test               Pass: 100%/1   | Total:  7m 45s | Avg:  7m 45s | Max:  7m 45s
    
  • 🟩 python: Pass: 100%/1 | Total: 25m 52s | Avg: 25m 52s | Max: 25m 52s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 25m 52s | Avg: 25m 52s | Max: 25m 52s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 25m 52s | Avg: 25m 52s | Max: 25m 52s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 25m 52s | Avg: 25m 52s | Max: 25m 52s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 25m 52s | Avg: 25m 52s | Max: 25m 52s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 25m 52s | Avg: 25m 52s | Max: 25m 52s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 25m 52s | Avg: 25m 52s | Max: 25m 52s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 25m 52s | Avg: 25m 52s | Max: 25m 52s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 25m 52s | Avg: 25m 52s | Max: 25m 52s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 96)

# Runner
71 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@miscco miscco merged commit 9092760 into NVIDIA:branch/2.8.x Jan 15, 2025
113 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants