v3.8.3
v3.8.3
Improvements
- Add support for CUDA 12 #3352
- Modernize documentation style and content #3351
- memcpy performance improvements #3144
- JIT performance improvements #3144
- join performance improvements #3144
- Improve support for Intel and newer Clang compilers #3334
- CCache support on Windows #3257
Fixes
- Fix issue with some locales with OpenCL kernel generation #3294
- Internal improvements
- Fix leak in clfft on exit.
- Fix some cases where ndims was incorrectly used ot calculate shape #3277
- Fix issue when setDevice was not called in new threads #3269
- Restrict initializer list to just fundamental types #3264
Contributions
Special thanks to our contributors:
Carlo Cabrera
Guillaume Schmid
Willy Born
ktdq