Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Task/rhornung67/device numeric limits #1196

Merged
merged 18 commits into from
Aug 12, 2024

Conversation

rhornung67
Copy link
Member

@rhornung67 rhornung67 commented Oct 4, 2023

Summary

  • This PR adds a type alias layer to be able to access C++ std::numeric_limits functionality in host and device code (CUDA and HIP) in a syntactically identical way.
  • This will enable us to replace our usage of std::numeric_limits with axom::numeric_limits for consistency across the code and to prevent unseemly compiler warnings.

Resolves #1195

Also, see #1338

@rhornung67
Copy link
Member Author

rhornung67 commented Oct 5, 2023

With the changes in this PR, is there any reason we need this stuff: https://github.com/LLNL/axom/blob/develop/src/axom/core/numerics/floating_point_limits.hpp ? We are currently not consistent across the code.


namespace axom
{
#if defined(AXOM_USE_CUDA) && defined(AXOM_DEVICE_CODE)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would it make sense to remove if defined(AXOM_DEVICE_CODE) and just have the AXOM_USE_CUDA guard? It's my impression that cuda::std should work on both the host and device.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If it does, that would be preferable. I need to look into that. @kennyweiss discussed this PR yesterday. That resulted in some concerns about several things in the code that we should discuss as a team. Unfortunately, that will have to wait for a couple of weeks as next Monday is a LLNL holiday and NECDC is the week after that.

Copy link
Member Author

@rhornung67 rhornung67 Oct 6, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@publixsubfan cuda::std does work in device and host code. I ran into some issues with some Axom tests where cuda::std::numeric_limits does not support long double. The intent of my change was to use std::numeric_limits in host code for all builds. However, it's not clear to me that we need to support long double. long double is automatically converted to double in device code and attempting to pass long double data between host and device code is problematic.

Copy link
Contributor

@gunney1 gunney1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @rhornung67 This looks useful. I hope the long double issue is not a blocker.

@rhornung67 rhornung67 marked this pull request as ready for review July 11, 2024 21:13
@rhornung67 rhornung67 changed the title [WIP] Task/rhornung67/device numeric limits Task/rhornung67/device numeric limits Jul 15, 2024
@rhornung67
Copy link
Member Author

Folks, could I get another pair of eyeballs (at least) to review this PR?

// configured with CUDA enabled. No need to rely on two different
// header files in that case.
template <typename T>
using numeric_limits = cuda::std::numeric_limits<T>;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the point of using CUDA-specific numeric limits to get host device on them? Or, does that not matter since they are probably all constexpr? Does HIP have a header like this? Thanks.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The main point of this is to prevent compiler warnings (calling host function from host-device function) that a few users have reported. It is not needed for HIP since the amdclang compiler is a unified host-device compiler and know how to sort everything out.

@rhornung67 rhornung67 merged commit f036036 into develop Aug 12, 2024
13 checks passed
@rhornung67 rhornung67 deleted the task/rhornung67/device-numeric-limits branch August 12, 2024 22:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Replace C++ 'std' methods with Axom core methods in host-device functions.
5 participants