Skip to content

v1.4: multi-node training and inference mode

Compare
Choose a tag to compare
@regisss regisss released this 12 Feb 23:05
· 4 commits to v1.4-release since this release

Multi-node training

This release adds support for multi-node training through DeepSpeed. This enables you to scale out up to thousands of nodes to speed up your trainings even more!

  • Add support for multi-node training #116

Check out the documentation to get started.

Inference through HPU graphs

You can now perform inference faster on Gaudi with HPU graphs.

  • Add support for inference through HPU graphs in GaudiTrainer #151

HPU graphs are currently only supported for single-device runs. Check out the documentation for more information.

Synapse AI 1.8

This release is fully compatible with SynapseAI 1.8.0, which is the latest version. Check out Habana's documentation for more information about the new features.

DeepSpeed's gradient checkpointing

DeepSpeed's gradient checkpointing is now automatically used when setting gradient_checkpointing=True in a DeepSpeed run.

  • Enable DeepSpeed activation checkpointing #142