Version 0.5.5: Minor changes
Changes since v0.5.4:
Run-time compilation functionality
- #397 : The NVRTC compilation options class now supports passing extra options to PTXAS, and also supports
--dopt
- #403 : The program builder class can now accept named header additions using
std::string
's for the name and/or header source (rather than only C-styleconst char*
strings).
Bug fixes
- #396 :
scoped_existence_ensurer_t
, the gadget for ensuring there is some current context (regardless of which) will now make sure the driver has been initialized. - #395 : Can now start profiling with our nvtx component even if the driver not yet being initialized.
Other changes
- #400 : Added an alias for waiting/synchronizing on an event: You can now execute
cuda::wait(my_event)
, not justcuda::synchronize(my_event)
. - #399 :
time_elapsed_between()
can now acceptstd::pair
's of events. - #398 : Added another example program, the CUDA sample
bandwidthtest
- #401 : Made all stream enqueuing methods
const
(so you can now enqueue on a stream passed by const-reference). - #404 : Can now construct
grid::overall_dimensions_t
from adim3
object, so that they're more interoperable with CUDA-related values you obtained elsewhere.