-
Notifications
You must be signed in to change notification settings - Fork 179
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Specialize relevant cuda::(std::)
types for __half/bfloat16/fp8
#525
Comments
Hi @jrhemstad, |
Hey @srinivasyadav18, thanks for your interest in helping make CCCL better! @griwes was just starting to look into this issue. He'll have a better idea of the details of what will be required and what parts you could help out with. For example, specializing |
@jrhemstad Thanks! I will coordinate with @griwes to see what I can help. |
I'm guessing we're going to have to be more careful about how we include those headers because we support versions of the CTK that may not have those headers yet. So it'll require some careful ifdefs. Here's an example from CUB: https://github.com/NVIDIA/cub/blob/0fc3c3701632a4be906765b73be20a9ad0da603d/cub/util_type.cuh#L43C1-L48 @miscco @gevtushenko may be able to help figure out the right way to guard including those headers. |
cuda::(std::)
types for __half/bfloat16
cuda::(std::)
types for __half/bfloat16/fp8
@gevtushenko says that CUB already has some of the relevant values for |
Is there a reason this needs to happen in
|
Hello team, |
Yes, this is |
Yes see #2749 where I started adding more implementations for extended floating point types |
Adding extended floating-point to |
The CUDA extended floating point types
__half
and__nv_bfloat16
and fp8 (and others) are important types for many CUDA C++ developers.As a CUDA C++ developer, I'd like it if relevant CCCL utilities like
<type_traits>
,atomic<T>
,complex<T>
all worked with these types.Tasks
complex<T>
for half and bfloat #1139<limits>
for half and bfloat16 #3044<type_traits>
atomic<T>
The text was updated successfully, but these errors were encountered: