Skip to content

1.1.0

Latest
Compare
Choose a tag to compare
@jan-wassenberg jan-wassenberg released this 18 Feb 01:33
· 345 commits to master since this release
  • Add BitCastScalar, DispatchedTarget, Foreach
  • Add Div/Mod and MaskedDiv/ModOr, SaturatedAbs, SaturatedNeg
  • Add InterleaveWholeLower/Upper, Dup128VecFromValues
  • Add IsInteger, IsIntegerLaneType, RemoveVolatile, RemoveCvRef
  • Add MaskedAdd/Sub/Mul/Div/Gather/Min/Max/SatAdd/SatSubOr
  • Add MaskFalse, IfNegativeThenNegOrUndefIfZero, PromoteEven/OddTo
  • Add ReduceMin/Max, 8-bit reductions, f16 <-> f64 conversions
  • Add Span, AlignedArray, matrix-vector mul
  • Add SumsOf2/4, I8 SumsOf8, SumsOfAdjQuadAbsDiff, SumsOfShuffledQuadAbsDiff
  • Add ThreadPool, hierarchical profiler
  • Build: use bazel_platforms
  • Enable clang16 Arm/PPC runtime dispatch, F16 for GCC AVX3_SPR
  • Extend Dot to f32*bf16, FMA to integer
  • Fix: RVV 8-bit overflow, UB in vqsort, big-endian bugs, PPC HTM
  • Improved codegen in various ops, fp16/bf16 tests and conversions
  • New targets: HWY_Z14, HWY_Z15
  • Test: add foreign_arch builders, CodeQL