Could port to OpenCL? #28

jamesliu96 · 2015-11-09T19:08:37Z

Could the project be port to OpenCL? Currently it only supports CUDA which is proprietary. OpenCL can be more wide spread and portable.

vrv · 2015-11-09T19:10:18Z

Thanks for the feedback -- this is tracked in #22

* Fixed AVX-512 intrinsic implementation. * OR'ed LIBXSMM_DNN_CONV_OPTION_OVERWRITE into convolution options, which folds zeroing the input buffer on first use. This removes the call to libxsmm_dnn_zero_buffer in case of LIBXSMM_DNN_COMPUTE_KIND_FWD. * Rely on libxsmm_hash rather than std::hash. Brought xsmm_conv2d.cc up-to-date with TF/master. * Code cleanup: use LIBXSMM_DNN_CONV_OPTION_WU_EXT_FILTER_REDUCE_OVERWRITE rather than assembling the option from separate flags. * Avoid to destroy the handle in case of LIBXSMM_DNN_WARN_FALLBACK since the next iteration may double-delete the same handle. One would need to update the handle-cache to allow destruction at this place. However, all handles are destructed when TF terminates (cache cleanup). * Rely on default configuration arguments, and thereby lower the dependence from LIBXSMM internals.

* Fixed AVX-512 intrinsic layer (sparse_matmul_op.h). Incorporated LIBXSMM_DNN_CONV_OPTION_OVERWRITE. (#26) * Fixed AVX-512 intrinsic implementation. * OR'ed LIBXSMM_DNN_CONV_OPTION_OVERWRITE into convolution options, which folds zeroing the input buffer on first use. This removes the call to libxsmm_dnn_zero_buffer in case of LIBXSMM_DNN_COMPUTE_KIND_FWD. * Made xsmm_conv2d.cc up-to-date with TF/master, avoid double-free in case of LIBXSMM_DNN_WARN_FALLBACK, use libxsmm_hash instead of std::hash, code cleanup (#27) * Fixed AVX-512 intrinsic implementation. * OR'ed LIBXSMM_DNN_CONV_OPTION_OVERWRITE into convolution options, which folds zeroing the input buffer on first use. This removes the call to libxsmm_dnn_zero_buffer in case of LIBXSMM_DNN_COMPUTE_KIND_FWD. * Rely on libxsmm_hash rather than std::hash. Brought xsmm_conv2d.cc up-to-date with TF/master. * Code cleanup: use LIBXSMM_DNN_CONV_OPTION_WU_EXT_FILTER_REDUCE_OVERWRITE rather than assembling the option from separate flags. * Avoid to destroy the handle in case of LIBXSMM_DNN_WARN_FALLBACK since the next iteration may double-delete the same handle. One would need to update the handle-cache to allow destruction at this place. However, all handles are destructed when TF terminates (cache cleanup). * Configure LIBXSMM with default arguments (#28) * Fixed AVX-512 intrinsic implementation. * OR'ed LIBXSMM_DNN_CONV_OPTION_OVERWRITE into convolution options, which folds zeroing the input buffer on first use. This removes the call to libxsmm_dnn_zero_buffer in case of LIBXSMM_DNN_COMPUTE_KIND_FWD. * Rely on libxsmm_hash rather than std::hash. Brought xsmm_conv2d.cc up-to-date with TF/master. * Code cleanup: use LIBXSMM_DNN_CONV_OPTION_WU_EXT_FILTER_REDUCE_OVERWRITE rather than assembling the option from separate flags. * Avoid to destroy the handle in case of LIBXSMM_DNN_WARN_FALLBACK since the next iteration may double-delete the same handle. One would need to update the handle-cache to allow destruction at this place. However, all handles are destructed when TF terminates (cache cleanup). * Rely on default configuration arguments, and thereby lower the dependence from LIBXSMM internals.

* Fixed AVX-512 intrinsic implementation. * OR'ed LIBXSMM_DNN_CONV_OPTION_OVERWRITE into convolution options, which folds zeroing the input buffer on first use. This removes the call to libxsmm_dnn_zero_buffer in case of LIBXSMM_DNN_COMPUTE_KIND_FWD. * Rely on libxsmm_hash rather than std::hash. Brought xsmm_conv2d.cc up-to-date with TF/master. * Code cleanup: use LIBXSMM_DNN_CONV_OPTION_WU_EXT_FILTER_REDUCE_OVERWRITE rather than assembling the option from separate flags. * Avoid to destroy the handle in case of LIBXSMM_DNN_WARN_FALLBACK since the next iteration may double-delete the same handle. One would need to update the handle-cache to allow destruction at this place. However, all handles are destructed when TF terminates (cache cleanup). * Rely on default configuration arguments, and thereby lower the dependence from LIBXSMM internals.

Fix typo mistake, fixes tensorflow#27.

* [OpenCL] Registers Conv2DBackpropFilter * Aligned '\'

* Fixed AVX-512 intrinsic implementation. * OR'ed LIBXSMM_DNN_CONV_OPTION_OVERWRITE into convolution options, which folds zeroing the input buffer on first use. This removes the call to libxsmm_dnn_zero_buffer in case of LIBXSMM_DNN_COMPUTE_KIND_FWD. * Rely on libxsmm_hash rather than std::hash. Brought xsmm_conv2d.cc up-to-date with TF/master. * Code cleanup: use LIBXSMM_DNN_CONV_OPTION_WU_EXT_FILTER_REDUCE_OVERWRITE rather than assembling the option from separate flags. * Avoid to destroy the handle in case of LIBXSMM_DNN_WARN_FALLBACK since the next iteration may double-delete the same handle. One would need to update the handle-cache to allow destruction at this place. However, all handles are destructed when TF terminates (cache cleanup). * Rely on default configuration arguments, and thereby lower the dependence from LIBXSMM internals.

Revert "Add Boost.Locale as a dependency for the build"

…ch_to_eigen_fork Switch to using the ROCm fork for eigen.

vrv closed this as completed Nov 9, 2015

wangyongliang mentioned this issue Apr 27, 2016

core dump when import tensorflow #2129

Closed

vahidk mentioned this issue Jan 27, 2017

Tensorflow freezes on iOS during Session::Run #7108

Closed

bluezone2015 mentioned this issue Mar 8, 2017

import tensorflow Segmentation fault (core dumped) #8197

Closed

tarasglek pushed a commit to tarasglek/tensorflow that referenced this issue Jun 20, 2017

Merge pull request tensorflow#28 from LiberiFatali/master

1d1b340

Fix typo mistake, fixes tensorflow#27.

zhangdingfei mentioned this issue Jun 21, 2017

tensorflow-1.2.0 import tensorflow Segmentation fault #10870

Closed

jakiechris mentioned this issue Sep 4, 2017

protobuf crashes at runtime when loading tensor lib. #12794

Closed

lukeiwanski referenced this issue in codeplaysoftware/tensorflow Oct 26, 2017

[OpenCL] Registers Conv2DBackpropFilter (#28)

3b38766

* [OpenCL] Registers Conv2DBackpropFilter * Aligned '\'

ychen404 mentioned this issue Aug 19, 2018

Not able to port a 6-layered mobilenet tflite model to mobile #21368

Closed

chenjiasheng mentioned this issue Dec 12, 2018

Distributed Training Randomly Stops During the Training Process #12667

Closed

lorenzoriano mentioned this issue Jan 11, 2019

BUS Error, likely with blas #24844

Closed

eggonlea pushed a commit to eggonlea/tensorflow that referenced this issue Mar 12, 2019

Merge pull request tensorflow#28 from mozilla/boost-dep

c10efb2

Revert "Add Boost.Locale as a dependency for the build"

isra60 mentioned this issue Mar 25, 2019

Segmentation Fault with TensorRT create interference graph #27100

Closed

dkashkin mentioned this issue Apr 25, 2019

TFLite Interpreter fails to load quantized model on Android (stock ssd_mobilenet_v2) #28163

Closed

chengdianxuezi mentioned this issue Nov 1, 2019

Bug: tensorflow-gpu takes long time before beginning to compute #18652

Closed

yanceyblog mentioned this issue Nov 28, 2019

armeabi-v7a libtensorflowlite_jni.so：signal 7 (SIGBUS), code 1 (BUS_ADRALN), fault addr 0xeef5445f #34669

Closed

cjolivier01 pushed a commit to Cerebras/tensorflow that referenced this issue Dec 6, 2019

Merge pull request tensorflow#28 from ROCmSoftwarePlatform/deven_swit…

10c273b

…ch_to_eigen_fork Switch to using the ROCm fork for eigen.

nammbash referenced this issue in Intel-tensorflow/tensorflow May 18, 2020

Klockwork code clean (#28)

2f53385

dinkdeep mentioned this issue Apr 7, 2021

Segmentation fault in tf-opt while running a tf dialect mlir file #48365

Open

rsanthanam-amd mentioned this issue Jul 1, 2021

[ROCm] This change replaces the original assert for detecting multiple #49232

Closed

lyz1005 mentioned this issue Oct 26, 2023

Interpreter run crash #62240

Closed

spacycoder mentioned this issue Dec 11, 2023

Why does my full integer quantized tflite model crash when loaded? #62618

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could port to OpenCL? #28

Could port to OpenCL? #28

jamesliu96 commented Nov 9, 2015

vrv commented Nov 9, 2015

Could port to OpenCL? #28

Could port to OpenCL? #28

Comments

jamesliu96 commented Nov 9, 2015

vrv commented Nov 9, 2015