question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[BUG] Using AVX512 for CPU-Offload

See original GitHub issue

Hi, I have a question on AVX512 support for CPU-offload. My server configuration shows that it is capable of using AVX512, but I found that CPUAdam uses AVX2, instead. Training message explicitly prints out CPUAdam uses AVX2 instead of AVX512 from this code.

Looked into where __AVX512__ and __AVX256__ are defined and found this.

In the above code, “cpu_info[‘flags’]” looks for ‘avx512’ but not for ‘avx512f’, ‘avx512dq’, ‘avx512cd’, ‘avx512ow’, or ‘avx512vl’. I do not know what these different types of avx512 are, but am curious if they cannot be used for CPU-Offload with AVX512 support.

My machine shows : /proc/cpuinfo:641:flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd mba ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm mpx rdt_a avx512f avx512dq rdseed adx smap clflushopt clwb intel_pt avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts pku ospke md_clear flush_l1d arch_capabilities for grep avx /proc/cpuinfo, which includes different types of avx512 (‘f’, ‘dq’, and so on). My feeling says that the machine has some sort of AVX512 support, but it just does not have the __AVX512__ flag.

I was wondering if builder.py needs modification. Thanks

Issue Analytics

  • State:closed
  • Created 9 months ago
  • Comments:6 (2 by maintainers)

github_iconTop GitHub Comments

1reaction
cmikeh2commented, Dec 17, 2022

Just pushed a commit on the PR with a fix along those lines you suggested. Please feel free to try!

0reactions
taehyunzzzcommented, Dec 17, 2022

image Works like a charm! Thanks!

Read more comments on GitHub >

github_iconTop Results From Across the Web

Why is Intel obsessed with AVX512 and beyond when ... - Reddit
In these applications, SIMD and AVX(51)2 are invaluable because the workload is so conducive to SIMD and AVX(51)2 parallelization. However, we ...
Read more >
Core Specialization for AVX-512 Using Fault-and-Migrate
Heavy use of AVX-512 causes a CPU core to reduce its clock frequency as the core ... which cores should be specialized cores...
Read more >
Bug #1739665 “[FFE][Feature] KVM CLX avx512_vnni”
So getting a Feature Freeze ack by the release team is very likely. But lets stay formal and do that. Afterwards I can...
Read more >
What are some scalability problems when using Intel's AVX ...
So the presence of AVX-512 is a scalability issue, but it is not really a problem. It is something to consider, along with...
Read more >
Alder Lake AVX-512 undocummented support with ASUS ...
Just watched Derbauer's video, and it's a bugged CPU-Z that was allowing validating of these CPU's with an incorrect setting in the BIOS....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found