Skip to content

Feature Release

Compare
Choose a tag to compare
@shehzan10 shehzan10 released this 28 Aug 19:40
· 3198 commits to master since this release

The source code with submodules can be downloaded directly from the following link:
http://arrayfire.com/arrayfire_source/arrayfire-full-3.1.0.tar.bz2

Release Notes {#releasenotes}

v3.1.0

Function Additions

  • Computer Vision Functions
    • nearestNeighbour() - Nearest Neighbour with SAD, SSD and SHD distances
    • harris() - Harris Corner Detector
    • susan() - Susan Corner Detector
    • sift() - Scale Invariant Feature Transform (SIFT)
      • Method and apparatus for identifying scale invariant features"
        "in an image and use of same for locating an object in an image," David"
        "G. Lowe, US Patent 6,711,293 (March 23, 2004). Provisional application"
        "filed March 8, 1999. Asignee: The University of British Columbia. For"
        "further details, contact David Lowe (lowe@cs.ubc.ca) or the"
        "University-Industry Liaison Office of the University of British"
        "Columbia.")
      • SIFT is available for compiling but does not ship with ArrayFire
        hosted installers/pre-built libraries
    • dog() - Difference of Gaussians
  • Image Processing Functions
    • ycbcr2rgb() and rgb2ycbcr() - RGB <->YCbCr color space conversion
    • wrap() and unwrap() Wrap and Unwrap
    • sat() - Summed Area Tables
    • loadImageMem() and saveImageMem() - Load and Save images to/from memory
      • af_image_format - Added imageFormat (af_image_format) enum
  • Array & Data Handling
    • copy() - Copy
    • array::lock() and array::unlock() - Lock and Unlock
    • select() and replace() - Select and Replace
    • Get array reference count (af_get_data_ref_count)
  • Signal Processing
    • fftInPlace() - 1D in place FFT
    • fft2InPlace() - 2D in place FFT
    • fft3InPlace() - 3D in place FFT
    • ifftInPlace() - 1D in place Inverse FFT
    • ifft2InPlace() - 2D in place Inverse FFT
    • ifft3InPlace() - 3D in place Inverse FFT
    • fftR2C() - Real to complex FFT
    • fftC2R() - Complex to Real FFT
  • Linear Algebra
    • svd() and svdInPlace() - Singular Value Decomposition
  • Other operations
    • sigmoid() - Sigmoid
    • Sum (with option to replace NaN values)
    • Product (with option to replace NaN values)
  • Graphics
    • Window::setSize() - Window resizing using Forge API
  • Utility
    • Allow users to set print precision (print, af_print_array_gen)
    • saveArray() and readArray() - Stream arrays to binary files
    • toString() - toString function returns the array and data as a string
  • CUDA specific functionality
    • getStream() - Returns default CUDA stream ArrayFire uses for the current device
    • getNativeId() - Returns native id of the CUDA device

Improvements

  • dot
    • Allow complex inputs with conjugate option
  • AF_INTERP_LOWER interpolation
    • For resize, rotate and transform based functions
  • 64-bit integer support
    • For reductions, random, iota, range, diff1, diff2, accum, join, shift
      and tile
  • convolve
    • Support for non-overlapping batched convolutions
  • Complex Arrays
    • Fix binary ops on complex inputs of mixed types
    • Complex type support for exp
  • tile
    • Performance improvements by using JIT when possible.
  • Add AF_API_VERSION macro
    • Allows disabling of API to maintain consistency with previous versions
  • Other Performance Improvements
    • Use reference counting to reduce unnecessary copies
  • CPU Backend
    • Device properties for CPU
    • Improved performance when all buffers are indexed linearly
  • CUDA Backend
    • Use streams in CUDA (no longer using default stream)
    • Using async cudaMem ops
    • Add 64-bit integer support for JIT functions
    • Performance improvements for CUDA JIT for non-linear 3D and 4D arrays
  • OpenCL Backend
    • Improve compilation times for OpenCL backend
    • Performance improvements for non-linear JIT kernels on OpenCL
    • Improved shared memory load/store in many OpenCL kernels (PR 933)
    • Using cl.hpp v1.2.7

Bug Fixes

  • Common
    • Fix compatibility of c32/c64 arrays when operating with scalars
    • Fix median for all values of an array
    • Fix double free issue when indexing (30cbbc7)
    • Fix bug in rank
    • Fix default values for scale throwing exception
    • Fix conjg raising exception on real input
    • Fix bug when using conjugate transpose for vector input
    • Fix issue with const input for array_proxy::get()
  • CPU Backend
    • Fix randn generating same sequence for multiple calls
    • Fix setSeed for randu
    • Fix casting to and from complex
    • Check NULL values when allocating memory
    • Fix offset issue for CPU element-wise operations

The source code with submodules can be downloaded directly from the following link:
http://arrayfire.com/arrayfire_source/arrayfire-full-3.1.0.tar.bz2

New Examples

  • Match Template
  • Susan
  • Heston Model (contributed by Michael Nowotny)

Distribution Changes

  • Fixed automatic detection of ArrayFire when using with CMake in the Windows
    Installer
  • Compiling ArrayFire with FreeImage as a static library for Linux x86
    installers

Known Issues

  • OpenBlas can cause issues with QR factorization in CPU backend
  • FreeImage older than 3.10 can cause issues with loadImageMem and
    saveImageMem
  • OpenCL backend issues on OSX
    • AMD GPUs not supported because of driver issues
    • Intel CPUs not supported
    • Linear algebra functions do not work on Intel GPUs.
  • Stability and correctness issues with open source OpenCL implementations such as Beignet, GalliumCompute.