Skip to content

Tensile 4.30.0 for ROCm 4.5.0

Compare
Choose a tag to compare
@lawruble13 lawruble13 released this 27 Oct 21:30
bb19eec

Added

  • Custom Kernel mechanism for adding custom assembly kernels to Tensile
  • New assertions for problems sizes, alpha/beta values, and C equals D
  • Support setting VectorWidth in M dimension in MFMA SourceSwap configuration

Fixed

  • Fix merge.py keeping duplicate solutions
  • Fix ScheduleIterAlg 2,3 cases for aldebaran