hipCUB-2.10.13 for ROCm 5.0.0
Fixed
- Added missing includes to hipcub.hpp
Added
- Bfloat16 support to test cases (device_reduce & device_radix_sort)
- Device merge sort
- Block merge sort
- API update to CUB 1.14.0
Changed
- The SetupNVCC.cmake automatic target selector select all of the capabalities of all available card for NVIDIA backend.