v1.4: multi-node training and inference mode
Multi-node training
This release adds support for multi-node training through DeepSpeed. This enables you to scale out up to thousands of nodes to speed up your trainings even more!
- Add support for multi-node training #116
Check out the documentation to get started.
Inference through HPU graphs
You can now perform inference faster on Gaudi with HPU graphs.
- Add support for inference through HPU graphs in GaudiTrainer #151
HPU graphs are currently only supported for single-device runs. Check out the documentation for more information.
Synapse AI 1.8
This release is fully compatible with SynapseAI 1.8.0, which is the latest version. Check out Habana's documentation for more information about the new features.
DeepSpeed's gradient checkpointing
DeepSpeed's gradient checkpointing is now automatically used when setting gradient_checkpointing=True
in a DeepSpeed run.
- Enable DeepSpeed activation checkpointing #142