Skip to content

v0.7.0: SDK3.3, Whisper on 1 IPU, MT5, transformers 4.29

Compare
Choose a tag to compare
@jimypbr jimypbr released this 13 Jul 11:29

What's Changed

  • Optimum has been updated to support Poplar SDK 3.3.
  • A new feature in that SDK is the poptorch.cond operation, which enables conditional compute. This enabled us to implement some new optimisations.
  • Using the the new cond operation we are able to fit Whisper-tiny encoder and decoder on a single IPU. To enable, pass the option use_cond_encoder to Whisper's parallelize method.
  • Added the option for cross-attention KV caching in Whisper, also using the cond op. To enable, pass the option use_cross_cache to Whisper's parallelize method.
  • We added support for the MT5 model for summarisation and translation tasks.
  • The version of transformers has been updated to 4.29. One of the things this enables in Optimum is Whisper timestamp decoding.
  • Added optimum.graphcore.models.whisper.WhisperProcessorTorch - a faster, drop-in replacement for transformers.WhisperProcessor.
  • The pod_type argument, which was deprecated in 0.6.1, has been removed.

Commits

Full Changelog: v0.6.1...v0.7.0