Skip to content

Releases: ROCm/omniperf

v2.0.0 (17 May 2023)

17 May 19:35
5f0978c
Compare
Choose a tag to compare

Updates

  • improved logging than spans all modes (#177) (#317) (#335) (#341)
  • overhauled CI/CD that spans all modes (#179)
  • extensible SoC classes to support easy addition of new hardware configs (#180)
  • --kernel-verbose no longer overwrites kernel names (#193)
  • general cleanup and improved organization of source code (#200) (#210)
  • separate requirement files for docs and testing dependencies (#205) (#262) (#358)
  • add support for MI300 hardware (#231)
  • upgrade Grafana assets and build script to latest release (#235)
  • update minimum ROCm and Python requirements (#277)
  • sort rocprofiler input files prior to profiling (#304)
  • new --quiet option will suppress verbose output and show a progress bar (#308)
  • roofline support for Ubuntu 22.04 (#319)

Documentation available at https://rocm.github.io/omniperf/
Associated release tarball: omniperf-v2.0.0.tar.gz

v2.0.0 RC #1 (15 March 2024)

15 Mar 22:58
Compare
Choose a tag to compare
Pre-release

This is the first release candidate for a forthcoming v2.0.0 release expected in March 2024. The 2.0 release provides a significant refactor of the underlying code base and introduces support for MI300.

Tech Preview documentation available at: https://amdresearch.github.io/omniperf/2.x
Associated release tarball: omniperf-2.0.0-RC1.tar.gz

v2.0.0 Tech Preview #1 (03 March 2024)

03 Mar 17:55
Compare
Choose a tag to compare
Pre-release

This is a tech preview release for a forthcoming v2.0.0 release expected in March 2024. The 2.0 release provides a significant refactor of the underlying code base and introduces support for MI300.

Tech Preview documentation available at: https://rocm.github.io/omniperf/2.x
Associated release tarball: omniperf-2.0.0-Tech-Preview1.tar.gz

v1.1.0-PR1 (13 October 2023)

13 Oct 18:53
720369f
Compare
Choose a tag to compare
Pre-release

Updates

  • standardize headers to use 'avg' instead of 'mean'
  • add color code thresholds to standalone gui to match grafana
  • modify kernel name shortener to use cpp_filt (#168)
  • enable stochastic kernel dispatch selection (#183)
  • patch grafana plugin module to address a known issue in the latest version (#186)
  • enhanced communication between analyze mode kernel flags (#187)

Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.1.0-PR1.tar.gz

v1.0.10 (22 August 2023)

22 Aug 19:06
b65aacf
Compare
Choose a tag to compare

Updates

  • critical patch for detection of llvm in rocm installs on SLURM systems

Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.10.tar.gz

v1.0.9 (17 August 2023)

17 Aug 19:43
eb3dc98
Compare
Choose a tag to compare

Updates

  • add units to L2 per-channel panel (#133)
  • new quickstart guide for Grafana setup in docs (#135)
  • more detail on kernel and dispatch filtering in docs (#136, #137)
  • patch manual join utility for ROCm >5.2.x (#139)
  • add % of peak values to low level speed-of-light panels (#140)
  • patch critical bug in Grafana by removing a deprecated plugin (#141)
  • enhancements to KernelName demangeler (#142)
  • general metric updates and enhancements (#144, #155, #159)
  • add min/max/avg breakdown to instruction mix panel (#154)

Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.9.tar.gz

v1.0.8 (30 May 2023)

31 May 00:01
Compare
Choose a tag to compare

PR1 Updates

  • add --kernel-names option to toggle kernelName overlay in standalone roofline plot (#93)
  • remove unused python modules (#96)
  • fix empirical roofline calculation for single dispatch workloads (#97)
  • match color of arithmetic intensity points to corresponding bw lines

PR2 Updates

  • ux improvements in standalone GUI (#101)
  • enhanced readability for filtering dropdowns in standalone GUI (#102)
  • new logfile to capture rocprofiler output (#106)
  • roofline support for sles15 sp4 and future service packs (#109)
  • adding dockerfiles for all supported Linux distros
  • new examples for --roof-only and --kernel options added to documentation

Additional Updates

  • enable cli analysis in Windows (#110)
  • optional random port number in standalone GUI (#111)
  • limit length of visible kernelName in --kernel-names option (#115)
  • adjust metric definitions (#117, #130)
  • manually merge rocprof runs, overriding default rocprofiler implementation (#125)
  • fixed compatibility issues with Python 3.11 (#131)

Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.8.tar.gz

v1.0.8-PR2 (17 April 2023)

18 Apr 02:20
Compare
Choose a tag to compare
Pre-release

Updates

  • ux improvements in standalone GUI (#101)
  • enhanced readability for filtering dropdowns in standalone GUI (#102)
  • new logfile to capture rocprofiler output (#106)
  • roofline support for sles15 sp4 and future service packs (#109)
  • adding dockerfiles for all supported Linux distos
  • new examples for --roof-only and --kernel options added to documentation

Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.8-PR2.tar.gz

v1.0.8-PR1 (13 March 2023)

13 Mar 21:52
ac10ad2
Compare
Choose a tag to compare
Pre-release

Updates

  • add --kernel-names option to toggle kernelName overlay in standalone roofline plot (#93)
  • remove unused python modules (#96)
  • fix empirical roofline calculation for single dispatch workloads (#97)
  • match color of arithmetic intensity points to corresponding bw lines

Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.8-PR1.tar.gz

v1.0.7 (22 February 2023)

22 Feb 21:08
Compare
Choose a tag to compare

Updates

  • update documentation (#52, #64)
  • improved detection of invalid command line arguments (#58, #76)
  • enhancements to standalone roofline (#61)
  • enable Omniperf on systems with X-server (#62)
  • raise minimum version requirement for rocm (#64)
  • enable baseline comparison in CLI analysis (#65)
  • add multi-normalization to new metrics (#68, #81)
  • support alternative profilers (#70)
  • add MI100 configs to override rocprofiler's incomplete default (#75)
  • improve error message when no GPU(s) detected (#85)
  • separate CI tests by Linux distro and add status badges

Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.7.tar.gz