question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[CUDA] Support popcount intrinsic

See original GitHub issue

Using tvm.popcount in a function seems to cause problem when a gpu schedule is applied. For example, the function matmul = tvm.compute((batch, out_dim), lambda i, j: \ tvm.sum(tvm.popcount(data[i, k] ^ weight[j, k]), axis=k), \ tag='binary_dense') runs fine on cpu but throws the error TVMError: [04:35:49] src/codegen/llvm/codegen_llvm.cc:630: unknown intrinsic popcount when scheduled for the GPU despite a popcount intrinsic existing for cuda. This is quite an important feature for supporting binarization which is an increasingly popular method of network optimization.

Issue Analytics

  • State:closed
  • Created 6 years ago
  • Comments:9 (9 by maintainers)

github_iconTop GitHub Comments

1reaction
Huyuweicommented, Jan 11, 2018
0reactions
tqchencommented, Nov 28, 2017

any update on this?

Read more comments on GitHub >

github_iconTop Results From Across the Web

[CUDA] Support popcount intrinsic · Issue #644 · apache/tvm
Using tvm.popcount in a function seems to cause problem when a gpu schedule is applied. For example, the function
Read more >
1.10. Integer Intrinsics - NVIDIA Documentation Center
This section describes integer intrinsic functions that are only supported in device code. To use these functions you do not need to include...
Read more >
Compute bit parity in CUDA - Stack Overflow
CUDA has popcount intrinsics for 32-bit and 64-bit types: __popc() and __popcll() . Does CUDA also have intrinsics to get the parity of...
Read more >
Intel® Intrinsics Guide
Intel® Intrinsics Guide includes C-style functions that provide access to other instructions without writing assembly code.
Read more >
Installation — Numba 0.50.1 documentation
$ conda install numba. or: · $ conda update numba. Note that Numba, like Anaconda, only supports PPC in 64-bit little-endian mode. ·...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found