Skip to content

Release Notes

Latest
Compare
Choose a tag to compare
@YifanShenSZ YifanShenSZ released this 22 Jan 03:00
· 5 commits to main since this release
1a0d051
  • Increase conversion support coverage for models produced by torch.export
    • All common ops have been covered, that is 83% parity with our mature torch.jit.trace converter
    • A new model supported: torchaudio wav2vec
  • A new transformers graph pass "common::scaled_dot_product_attention_sliced_q" for improved performance of sdpa calculation with long sequence lengths (e.g. Depth-Anything model, with sequence length 1814, transformed by the graph pass was 34% faster and used 45% less memory on ANE)
  • New torch ops supported
    • native_group_norm
    • bool type triu
  • Bug fixes
    • Incorrect behaviour of torch.linspace
    • Wrong power 2 - sqrt fusion when power exponent is a tensor

Special thanks to our open source community contributors for this release: @RGooBS24 @fukatani @twoertwein @kasper0406