The source code with submodules can be downloaded directly from the following link:
http://arrayfire.com/arrayfire_source/arrayfire-full-3.1.0.tar.bz2

Release Notes {#releasenotes}

v3.1.0

Function Additions

Computer Vision Functions
- nearestNeighbour() - Nearest Neighbour with SAD, SSD and SHD distances
- harris() - Harris Corner Detector
- susan() - Susan Corner Detector
- sift() - Scale Invariant Feature Transform (SIFT)
  - Method and apparatus for identifying scale invariant features"
    "in an image and use of same for locating an object in an image," David"
    "G. Lowe, US Patent 6,711,293 (March 23, 2004). Provisional application"
    "filed March 8, 1999. Asignee: The University of British Columbia. For"
    "further details, contact David Lowe (lowe@cs.ubc.ca) or the"
    "University-Industry Liaison Office of the University of British"
    "Columbia.")
  - SIFT is available for compiling but does not ship with ArrayFire
    hosted installers/pre-built libraries
- dog() - Difference of Gaussians
Image Processing Functions
- ycbcr2rgb() and rgb2ycbcr() - RGB <->YCbCr color space conversion
- wrap() and unwrap() Wrap and Unwrap
- sat() - Summed Area Tables
- loadImageMem() and saveImageMem() - Load and Save images to/from memory
  - af_image_format - Added imageFormat (af_image_format) enum
Array & Data Handling
- copy() - Copy
- array::lock() and array::unlock() - Lock and Unlock
- select() and replace() - Select and Replace
- Get array reference count (af_get_data_ref_count)
Signal Processing
- fftInPlace() - 1D in place FFT
- fft2InPlace() - 2D in place FFT
- fft3InPlace() - 3D in place FFT
- ifftInPlace() - 1D in place Inverse FFT
- ifft2InPlace() - 2D in place Inverse FFT
- ifft3InPlace() - 3D in place Inverse FFT
- fftR2C() - Real to complex FFT
- fftC2R() - Complex to Real FFT
Linear Algebra
- svd() and svdInPlace() - Singular Value Decomposition
Other operations
- sigmoid() - Sigmoid
- Sum (with option to replace NaN values)
- Product (with option to replace NaN values)
Graphics
- Window::setSize() - Window resizing using Forge API
Utility
- Allow users to set print precision (print, af_print_array_gen)
- saveArray() and readArray() - Stream arrays to binary files
- toString() - toString function returns the array and data as a string
CUDA specific functionality
- getStream() - Returns default CUDA stream ArrayFire uses for the current device
- getNativeId() - Returns native id of the CUDA device

Improvements

dot
- Allow complex inputs with conjugate option
AF_INTERP_LOWER interpolation
- For resize, rotate and transform based functions
64-bit integer support
- For reductions, random, iota, range, diff1, diff2, accum, join, shift
  and tile
convolve
- Support for non-overlapping batched convolutions
Complex Arrays
- Fix binary ops on complex inputs of mixed types
- Complex type support for exp
tile
- Performance improvements by using JIT when possible.
Add AF_API_VERSION macro
- Allows disabling of API to maintain consistency with previous versions
Other Performance Improvements
- Use reference counting to reduce unnecessary copies
CPU Backend
- Device properties for CPU
- Improved performance when all buffers are indexed linearly
CUDA Backend
- Use streams in CUDA (no longer using default stream)
- Using async cudaMem ops
- Add 64-bit integer support for JIT functions
- Performance improvements for CUDA JIT for non-linear 3D and 4D arrays
OpenCL Backend
- Improve compilation times for OpenCL backend
- Performance improvements for non-linear JIT kernels on OpenCL
- Improved shared memory load/store in many OpenCL kernels (PR 933)
- Using cl.hpp v1.2.7

Bug Fixes

Common
- Fix compatibility of c32/c64 arrays when operating with scalars
- Fix median for all values of an array
- Fix double free issue when indexing (30cbbc7)
- Fix bug in rank
- Fix default values for scale throwing exception
- Fix conjg raising exception on real input
- Fix bug when using conjugate transpose for vector input
- Fix issue with const input for array_proxy::get()
CPU Backend
- Fix randn generating same sequence for multiple calls
- Fix setSeed for randu
- Fix casting to and from complex
- Check NULL values when allocating memory
- Fix offset issue for CPU element-wise operations

The source code with submodules can be downloaded directly from the following link:
http://arrayfire.com/arrayfire_source/arrayfire-full-3.1.0.tar.bz2

New Examples

Match Template
Susan
Heston Model (contributed by Michael Nowotny)

Distribution Changes

Fixed automatic detection of ArrayFire when using with CMake in the Windows
Installer
Compiling ArrayFire with FreeImage as a static library for Linux x86
installers

Known Issues

OpenBlas can cause issues with QR factorization in CPU backend
FreeImage older than 3.10 can cause issues with loadImageMem and
saveImageMem
OpenCL backend issues on OSX
- AMD GPUs not supported because of driver issues
- Intel CPUs not supported
- Linear algebra functions do not work on Intel GPUs.
Stability and correctness issues with open source OpenCL implementations such as Beignet, GalliumCompute.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Release