Automatically infer HCC_AMDGPU_TARGET
See original GitHub issue- build: Support
CUPY_NVCC_GENERATE_CODE=current
equivalent in ROCm to setHCC_AMDGPU_TARGET
automatically - runtime: Automatically infer
HCC_AMDGPU_TARGET
(this environment variable is only needed for hipcc case?)
Issue Analytics
- State:
- Created 3 years ago
- Comments:6 (6 by maintainers)
Top Results From Across the Web
New Features of C++: Automatic Type Inference
Automatic type inference (or type deduction) is where you allow the compiler to infer the type information used for a declaration by ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I skimmed hipcc code (it’s written in perl 😅 ), it seems the output from
/opt/rocm/bin/rocm_agent_enumerator -t GPU
(which prints arch for each GPU installed) is used whenHCC_AMDGPU_TARGET
is unset. I’m curious how hiprtc would behave, though.I just did a quick test locally. It seems if only building for local use, as mentioned in the above linked comment
hipcc
can also inferHCC_AMDGPU_TARGET
correctly. I confirmed this by removing thecub
andthrust
modules, unset bothHCC_AMDGPU_TARGET
and__HIP_PLATFORM_HCC__
, and rebuild. It builds just fine.After that, we can examine the binary by ROCm’s built-in tool (similar to
cuobjdump
on CUDA):which generates three files, two of which are for my arch (gfx906):
Could be relevant to #4720.