From 7818d0355e947bc923a5e85472416b277572fe29 Mon Sep 17 00:00:00 2001 From: Alex Liu <35415350+zaliu@users.noreply.github.com> Date: Thu, 6 Jan 2022 15:22:08 -0800 Subject: [PATCH] Changelog update for ROCm 5.0 --- CHANGELOG.md | 20 +++++++++++++++++++- 1 file changed, 19 insertions(+), 1 deletion(-) diff --git a/CHANGELOG.md b/CHANGELOG.md index bd963a675..3e467f348 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,6 +1,24 @@ # Change Log for Tensile -## (Unreleased) Tensile 4.31.0 +## Tensile 4.31.0 for ROCm 5.0.0 +### Added +- DirectToLds support (x2/x4) +- DirectToVgpr support for DGEMM +- Parameter to control number of files kernels are merged into to better parallelize kernel compilation +- FP16 alternate implementation for HPA HGEMM on aldebaran +### Optimized +- Add DGEMM NN custom kernel for HPL on aldebaran +### Changed +- Update tensile_client executable to std=c++14 +### Removed +- Remove unused old Tensile client code +### Fixed +- Fix hipErrorInvalidHandle during benchmarks +- Fix addrVgpr for atomic GSU +- Fix for Python 3.8: add case for Constant nodeType +- Fix architecture mapping for gfx1011 and gfx1012 +- Fix PrintSolutionRejectionReason verbiage in KernelWriter.py +- Fix vgpr alignment problem when enabling flat buffer load ## Tensile 4.30.0 for ROCm 4.5.0 ### Added