Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PLAN] Improve performance with dimension compaction and indexer #105

Open
sonots opened this issue Nov 12, 2018 · 0 comments
Open

[PLAN] Improve performance with dimension compaction and indexer #105

sonots opened this issue Nov 12, 2018 · 0 comments

Comments

@sonots
Copy link
Owner

sonots commented Nov 12, 2018

  1. Stop using ndloop and compute an operation with one CUDA kernel using indexer
  2. Compact dimension to make computation of indexer fast

Element-wise (binary ops) is already done at #64.
But, reduction and others such as store_from are not yet done.

Without this, cumo (and red-chainer) can not compete with cupy (and chainer)

Current performance comparison on k80 machine:

  • chainer mnist: 5 sec / epoch
  • red-chainer mnist: 13 sec / epoch
naitoh pushed a commit to naitoh/cumo that referenced this issue Jan 18, 2019
@sonots sonots changed the title Improve performance with dimension compaction and indexer [PLAN] Improve performance with dimension compaction and indexer Mar 23, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant