Releases: ProjectPhysX/OpenCL-Benchmark
Releases · ProjectPhysX/OpenCL-Benchmark
OpenCL-Benchmark v1.3
- workaround for Nvidia driver bug:
enqueueFillBuffer
is broken for large buffers on Nvidia GPUs - fixed slow numeric drift issues
- fixed terrible performance on ARM GPUs by macro-replacing fused-multiply-add (
fma
) witha*b+c
- added automatic OS detection in
make.sh
OpenCL-Benchmark v1.2
- corrected TFlops/s estimate for Intel Data Center GPU Max series
- made correction of wrong memory reporting on Intel Arc more robust
- made CPU/GPU buffer initialization significantly faster with
std::fill
andenqueueFillBuffer
- added operating system info to OpenCL device driver version printout
- bug fix in
print_message()
function inutilities.hpp
OpenCL-Benchmark v1.1
- fixed several issues with macOS
OpenCL-Benchmark v1.0
Initial Release. Have fun!