[CUDA] Support popcount intrinsic
See original GitHub issueUsing tvm.popcount in a function seems to cause problem when a gpu schedule is applied. For example, the function
matmul = tvm.compute((batch, out_dim), lambda i, j: \ tvm.sum(tvm.popcount(data[i, k] ^ weight[j, k]), axis=k), \ tag='binary_dense')
runs fine on cpu but throws the error
TVMError: [04:35:49] src/codegen/llvm/codegen_llvm.cc:630: unknown intrinsic popcount
when scheduled for the GPU despite a popcount intrinsic existing for cuda. This is quite an important feature for supporting binarization which is an increasingly popular method of network optimization.
Issue Analytics
- State:
- Created 6 years ago
- Comments:9 (9 by maintainers)
Top Results From Across the Web
[CUDA] Support popcount intrinsic · Issue #644 · apache/tvm
Using tvm.popcount in a function seems to cause problem when a gpu schedule is applied. For example, the function
Read more >1.10. Integer Intrinsics - NVIDIA Documentation Center
This section describes integer intrinsic functions that are only supported in device code. To use these functions you do not need to include...
Read more >Compute bit parity in CUDA - Stack Overflow
CUDA has popcount intrinsics for 32-bit and 64-bit types: __popc() and __popcll() . Does CUDA also have intrinsics to get the parity of...
Read more >Intel® Intrinsics Guide
Intel® Intrinsics Guide includes C-style functions that provide access to other instructions without writing assembly code.
Read more >Installation — Numba 0.50.1 documentation
$ conda install numba. or: · $ conda update numba. Note that Numba, like Anaconda, only supports PPC in 64-bit little-endian mode. ·...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
#774
any update on this?