Published Roadmap #163

sirinath · 2015-11-12T03:20:43Z

Can you publish a road map to get an idea on the direction which project is going?

sirinath · 2015-11-12T03:22:11Z

Same as: #162

* Add -Wno-c++11-narrowing to ComputeCpp device compiler flags to avoid build errors on 32-bit targets. * Added SYCL support to DeviceSpec.parse_from_string - fixes a regression in running the Resnet sample from the TensorFlow models repository with SYCL. * Bumped Eigen version. * [OpenCL] Adds option to disable SYCL vectorization (tensorflow#161) Adds an option to the configure script to disable SYCL vectorization. This also rewrites and cleans up the computecpp.tpl build script, though the actual behaviour has not changed. * [OpenCL] Fixes Variable Resource op for SYCL (tensorflow#162) Recent changes to the VariableResource ops were broken for SYCL. This fixes the errors introduced by those changes. * [OpenCL] Alignment fixed in Eigen Don't need to use the alignment workaround any more, as the underlying problem is fixed in Eigen. * [OpenCL] Adds Eigen changes for new RC * [OpenCL] Adds support for SYCL devices to nn_ops_test * [OpenCL] Fixes multiple registrations of same op The registration of `ReadVariableOp` does not depend on the datatype, so we were registering more than ne of the same op. * [OpenCL] Adds naive forward pass Conv2D kernel Provides a very naive unoptimised forward convolution SYCL kernel. * [OpenCL] Adds naive backprop for SYCL Conv2D Adds both filter and input backprop * [OpenCL] Fixes multiple registrations of same op (tensorflow#163) The registration of `ReadVariableOp` does not depend on the datatype, so we were registering more than ne of the same op. * [ACL] Adding ARM Compute Library * [ACL] Adds gemm code * [ACL] Adds ARM_NO_EXCEPTIONS * [ACL] Don't register half for ARM * [ACL] Adds linking to OpenCL * Tidied up formatting of ACL integration. * Bug fixes to ARM Compute Library GEMM integration into matmul, from Duncan McBain. * Fixed typos in configure.py help messages. * Reverted formatting and logging changes that aren't related to ACL.

The GPUIndexIntrinsicOpLowering template is currently used by the code in both the GPUToNVVM and GPUToROCDL dirs. Moving it to a common location to remove code duplication. Closes #163 COPYBARA_INTEGRATE_REVIEW=tensorflow/mlir#163 from deven-amd:deven-refactor-gpu-index-ops-lowering b8dc2a5f5353df196039b6ff2ad42106028693ed PiperOrigin-RevId: 272863297

sirinath closed this as completed Nov 12, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Published Roadmap #163

Published Roadmap #163

sirinath commented Nov 12, 2015

sirinath commented Nov 12, 2015

Published Roadmap #163

Published Roadmap #163

Comments

sirinath commented Nov 12, 2015

sirinath commented Nov 12, 2015