Skip to content

Releases: NVIDIA/TorchFort

v0.2.0

06 Sep 17:48
30b5c6f
Compare
Choose a tag to compare
v0.2.0 Pre-release
Pre-release

What's Changed

This release includes several major updates to TorchFort, including:

  • Enabling compilation of library with alternative compilers to NVHPC (e.g., GNU)
  • Enabling model/RL system training and inference on CPU
  • Enabling CPU-only builds without CUDA/NCCL
  • New reinforcement learning features, including PPO algorithms and on-policy algorithms
  • Improvements to build scripts

Breaking Changes

#14 enables placing and running models/RL systems on CPU. To enable this, an additional device argument was added to the model/system creation APIs (e.g., torchfort_model_create). Please refer to the documentation for more details.

PRs included in this release

  • Enable support for complex gradient reduction in distributed cases. (#2)
  • remove extraneous mpi call from cmake (#4)
  • generalize cmake to build for different cuda archs (#7)
  • remove hardcoded yaml-cpp path from CMakeLists.txt (#5)
  • Build updates and improvements (#10)
  • Update setup.cpp (#11)
  • merging rl changes ( #13)
  • Enable model training/inference on CPU or GPU devices. Enabling usage of alternative compilers to NVHPC, (#14)
  • Tkurth/rl ppo (#12)
  • Tkurth/rl tests (#15)
  • Add train and inference functions for 5d Fortran arrays. (#17)
  • Fix interface issues in Fortran module with gfortran. (#18)
  • Enable builds without CUDA/GPU support (#19)
  • Fixing up documentation. (#20)
  • 0.2.0 release (#21)

Full Changelog: v0.1.0...v0.2.0

v0.1.0

01 Aug 21:55
Compare
Choose a tag to compare
v0.1.0 Pre-release
Pre-release

Initial release of TorchFort.