Skip to content

New Neural Net Architecture! (a few more followup bugfixes)

Compare
Choose a tag to compare
@lightvector lightvector released this 11 Jan 05:09
· 462 commits to master since this release

This release is not the latest release, see newer release v1.12.3 for further bugfixes!

This is a bugfix release following a release of KataGo that supports a new neural net architecture, v1.12.0!
If you want to know more about the improvements and/or other API changes, check the release notes there!*

Users of the TensorRT version upgrading to this version of KataGo will also need to upgrade from TensorRT 8.2 to TensorRT 8.5

If you're a new user, don't forget to check out this section for getting started and basic usage! If you don't know which version to choose (OpenCL, CUDA, TensorRT, Eigen, Eigen AVX2), read this: https://github.com/lightvector/KataGo#opencl-vs-cuda-vs-tensorrt-vs-eigen

Also, KataGo is continuing to improve at https://katagotraining.org/ and if you'd like to donate your spare GPU cycles and support it, it could use your help there!

Changes

In addition to the bugfix to TensorRT computing incorrect values in v1.12.1, this release:

  • Fixes some major issues in OpenCL (not just TensorRT) where the OpenCL tuner may select extremely poorly performing or even outright bad or failing parameters sometimes.
  • Upgrades TensorRT from 8.2 to 8.5 and substantially improves loading and timing-cache initialization times for multi-GPU machines, and removes dependency of TensorRT on CUDNN, also supports newer GPUs. Thanks to @hyln9 for all of this work!
  • Adds some support in config parsing to be able to specify file paths, passwords, or other strings with hash signs or trailing spaces.
  • Adds some better internal tests and error checking for contributing data to the public run.