Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Backport to 2.8: Deprecate cub::DeviceSpmv (#3320) #3374

Open
wants to merge 2 commits into
base: branch/2.8.x
Choose a base branch
from

Conversation

bernhardmgruber
Copy link
Contributor

No description provided.

@bernhardmgruber bernhardmgruber requested review from a team as code owners January 14, 2025 08:45
@bernhardmgruber
Copy link
Contributor Author

That error is truly haunting me these days:
/home/coder/cccl/lib/cmake/cub/../../../cub/cub/device/dispatch/dispatch_spmv_orig.cuh(89): internal error #2656: assertion failed: alloc_copy_of_pending_pragma: copied pragma has source sequence entry (pragma.c, line 518 in alloc_copy_of_pending_pragma)

Copy link
Contributor

🟨 CI finished in 1h 23m: Pass: 95%/96 | Total: 16h 32m | Avg: 10m 20s | Max: 51m 26s | Hits: 422%/12392
  • 🟨 cub: Pass: 91%/47 | Total: 8h 53m | Avg: 11m 21s | Max: 51m 26s | Hits: 589%/3132

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  91%/45  | Total:  8h 43m | Avg: 11m 37s | Max: 51m 26s | Hits: 589%/3132  
      🟩 arm64              Pass: 100%/2   | Total: 10m 31s | Avg:  5m 15s | Max:  5m 16s
    🔍 ctk: 11.1 🔍
      🔍 11.1               Pass:  42%/7   | Total:  1h 00m | Avg:  8m 41s | Max: 42m 56s | Hits: 589%/783   
      🟩 12.5               Pass: 100%/2   | Total: 18m 26s | Avg:  9m 13s | Max:  9m 22s
      🟩 12.6               Pass: 100%/38  | Total:  7h 34m | Avg: 11m 57s | Max: 51m 26s | Hits: 589%/2349  
    🔍 cudacxx: nvcc11.1 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 54s
      🔍 nvcc11.1           Pass:  42%/7   | Total:  1h 00m | Avg:  8m 41s | Max: 42m 56s | Hits: 589%/783   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 26s | Avg:  9m 13s | Max:  9m 22s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  7h 25m | Avg: 12m 21s | Max: 51m 26s | Hits: 589%/2349  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 54s
      🔍 nvcc               Pass:  91%/45  | Total:  8h 44m | Avg: 11m 39s | Max: 51m 26s | Hits: 589%/3132  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/19  | Total:  2h 39m | Avg:  8m 22s | Max: 33m 05s
      🔍 GCC                Pass:  80%/21  | Total:  3h 34m | Avg: 10m 13s | Max: 51m 26s
      🟩 Intel              Pass: 100%/1   | Total:  6m 53s | Avg:  6m 53s | Max:  6m 53s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 14m | Avg: 33m 38s | Max: 42m 56s | Hits: 589%/3132  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 26s | Avg:  9m 13s | Max:  9m 22s
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 20m 40s | Avg: 10m 20s | Max: 16m 16s
      🔍 v100               Pass:  91%/45  | Total:  8h 33m | Avg: 11m 24s | Max: 51m 26s | Hits: 589%/3132  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  90%/40  | Total:  5h 35m | Avg:  8m 22s | Max: 42m 56s | Hits: 589%/3132  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 30m 04s | Avg: 30m 04s | Max: 30m 04s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 48s | Avg: 15m 48s | Max: 15m 48s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 49s | Max: 26m 49s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 24m | Avg: 42m 15s | Max: 51m 26s
    🟨 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 23m 16s | Avg:  5m 49s | Max:  7m 14s
      🟩 Clang10            Pass: 100%/1   | Total:  9m 39s | Avg:  9m 39s | Max:  9m 39s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 29s | Avg:  5m 29s | Max:  5m 29s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 39s | Avg:  5m 39s | Max:  5m 39s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 48s | Avg:  5m 48s | Max:  5m 48s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 Clang16            Pass: 100%/1   | Total:  6m 07s | Avg:  6m 07s | Max:  6m 07s
      🟩 Clang17            Pass: 100%/1   | Total:  6m 20s | Avg:  6m 20s | Max:  6m 20s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 25m | Avg: 12m 14s | Max: 33m 05s
      🟥 GCC6               Pass:   0%/2   | Total:  3m 54s | Avg:  1m 57s | Max:  1m 59s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 41s | Avg:  5m 20s | Max:  5m 36s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 57s | Avg:  5m 57s | Max:  5m 57s
      🟨 GCC9               Pass:  33%/3   | Total: 10m 03s | Avg:  3m 21s | Max:  5m 44s
      🟩 GCC10              Pass: 100%/1   | Total:  6m 10s | Avg:  6m 10s | Max:  6m 10s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 GCC12              Pass: 100%/3   | Total: 26m 33s | Avg:  8m 51s | Max: 16m 16s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 25m | Avg: 18m 12s | Max: 51m 26s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 53s | Avg:  6m 53s | Max:  6m 53s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 42m 56s | Avg: 42m 56s | Max: 42m 56s | Hits: 589%/783   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 30m 10s | Avg: 30m 10s | Max: 30m 10s | Hits: 589%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 43s | Max: 32m 52s | Hits: 589%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 26s | Avg:  9m 13s | Max:  9m 22s
    🟨 std
      🟨 11                 Pass:  60%/5   | Total: 19m 55s | Avg:  3m 59s | Max:  6m 18s
      🟨 14                 Pass:  75%/4   | Total: 57m 41s | Avg: 14m 25s | Max: 42m 56s | Hits: 589%/783   
      🟨 17                 Pass:  91%/12  | Total:  2h 00m | Avg: 10m 04s | Max: 30m 10s | Hits: 589%/1566  
      🟩 20                 Pass: 100%/26  | Total:  5h 35m | Avg: 12m 54s | Max: 51m 26s | Hits: 589%/783   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 20m 40s | Avg: 10m 20s | Max: 16m 16s
      🟩 90a                Pass: 100%/1   | Total:  4m 13s | Avg:  4m 13s | Max:  4m 13s
    
  • 🟩 thrust: Pass: 100%/46 | Total: 7h 02m | Avg: 9m 11s | Max: 34m 45s | Hits: 366%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 23m 01s | Avg: 11m 30s | Max: 17m 22s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  6h 53m | Avg:  9m 23s | Max: 34m 45s | Hits: 366%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 24s | Avg:  4m 42s | Max:  4m 55s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 51m 53s | Avg:  7m 24s | Max: 26m 57s | Hits: 368%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 27m 56s | Avg: 13m 58s | Max: 14m 07s
      🟩 12.6               Pass: 100%/37  | Total:  5h 42m | Avg:  9m 15s | Max: 34m 45s | Hits: 365%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  5m 08s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 51m 53s | Avg:  7m 24s | Max: 26m 57s | Hits: 368%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 27m 56s | Avg: 13m 58s | Max: 14m 07s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  5h 32m | Avg:  9m 30s | Max: 34m 45s | Hits: 365%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  5m 08s
      🟩 nvcc               Pass: 100%/44  | Total:  6h 52m | Avg:  9m 22s | Max: 34m 45s | Hits: 366%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 20m 48s | Avg:  5m 12s | Max:  6m 25s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 37s | Avg:  6m 37s | Max:  6m 37s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 10s | Avg:  5m 10s | Max:  5m 10s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 43s | Avg:  5m 43s | Max:  5m 43s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 Clang18            Pass: 100%/7   | Total: 52m 02s | Avg:  7m 26s | Max: 18m 39s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 03s | Avg:  4m 01s | Max:  4m 04s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 28s | Avg:  4m 44s | Max:  5m 01s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 26s | Avg:  5m 26s | Max:  5m 26s
      🟩 GCC9               Pass: 100%/3   | Total: 13m 57s | Avg:  4m 39s | Max:  5m 25s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 10m | Avg:  8m 47s | Max: 18m 39s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 54s | Avg:  6m 54s | Max:  6m 54s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 26m 57s | Avg: 26m 57s | Max: 26m 57s | Hits: 368%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 26m 59s | Avg: 26m 59s | Max: 26m 59s | Hits: 365%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 31m | Avg: 30m 35s | Max: 34m 45s | Hits: 365%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 27m 56s | Avg: 13m 58s | Max: 14m 07s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 57m | Avg:  6m 12s | Max: 18m 39s
      🟩 GCC                Pass: 100%/19  | Total:  2h 04m | Avg:  6m 32s | Max: 18m 39s
      🟩 Intel              Pass: 100%/1   | Total:  6m 54s | Avg:  6m 54s | Max:  6m 54s
      🟩 MSVC               Pass: 100%/5   | Total:  2h 25m | Avg: 29m 08s | Max: 34m 45s | Hits: 366%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 27m 56s | Avg: 13m 58s | Max: 14m 07s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  7h 02m | Avg:  9m 11s | Max: 34m 45s | Hits: 366%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  5h 18m | Avg:  7m 57s | Max: 28m 31s | Hits: 366%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 54s | Avg: 16m 38s | Max: 34m 45s | Hits: 365%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 54m 40s | Avg: 18m 13s | Max: 18m 39s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 26s | Avg:  4m 26s | Max:  4m 26s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 22m 16s | Avg:  4m 27s | Max:  6m 02s
      🟩 14                 Pass: 100%/4   | Total: 42m 27s | Avg: 10m 36s | Max: 26m 57s | Hits: 368%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 59m | Avg:  9m 57s | Max: 28m 30s | Hits: 365%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 35m | Avg:  9m 22s | Max: 34m 45s | Hits: 365%/3704  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 01s | Avg: 5m 00s | Max: 8m 00s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  8m 00s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  8m 00s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  8m 00s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  8m 00s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  8m 00s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  8m 00s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  8m 00s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 01s | Avg:  2m 01s | Max:  2m 01s
      🟩 Test               Pass: 100%/1   | Total:  8m 00s | Avg:  8m 00s | Max:  8m 00s
    
  • 🟩 python: Pass: 100%/1 | Total: 25m 47s | Avg: 25m 47s | Max: 25m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 25m 47s | Avg: 25m 47s | Max: 25m 47s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 25m 47s | Avg: 25m 47s | Max: 25m 47s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 25m 47s | Avg: 25m 47s | Max: 25m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 25m 47s | Avg: 25m 47s | Max: 25m 47s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 25m 47s | Avg: 25m 47s | Max: 25m 47s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 25m 47s | Avg: 25m 47s | Max: 25m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 25m 47s | Avg: 25m 47s | Max: 25m 47s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 25m 47s | Avg: 25m 47s | Max: 25m 47s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 96)

# Runner
71 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

3 participants