[BUG]: block-level InclusiveScan
with initial value doesn't compute the block aggregate correctly
#3383
Labels
bug
Something isn't working right.
Is this a duplicate?
Type of Bug
Silent Failure
Component
CUB
Describe the bug
The recently-added block-level
InclusiveScan
with initial value support seems to not be taking into account such initial value for the block aggregate calculation.How to Reproduce
to catch2_test_block_scan_api.cu.
It should result in a failure due to the block aggregate computed being 0 and not 1.
FYI: a similar test is implemented in catch2_test_block_scan_api.cu, but it uses the
cub::Max()
operator so it does seem to oversee the aggregates miscalculation.Expected behavior
The block InclusiveScan with initial values should also include the initial value in the aggregates computation.
Reproduction link
No response
Operating System
No response
nvidia-smi output
No response
NVCC version
No response
The text was updated successfully, but these errors were encountered: