pbc with multigrid DF benchmark
See original GitHub issueHi PySCF team,
I am attempting to run some larger-scale DFT calculations within the pbc/df module. Specifically, I am hoping to reproduce some benchmarks in the 2020 paper (https://aip.scitation.org/doi/pdf/10.1063/5.0006074) Table II. It appears that pyscf/examples/2-benchmark/fock_multigrid.py sets up this benchmark. However, I ran into a few problems/questions:
-
For the nao = 2560 water box, using LDA and 96 threads @ 2.00GHz I get a Fock matrix build time of 100s (see line 546 of attached output) which seems quite off from the paper result (20 sec with 32 threads (?) @ 2.3 GHz).
-
Attempting to run a full DFT calculation, I run out of memory (tries to allocate a 470gb array! see end of attached file).
-
In Table II, it is explained that “Integral screening and lattice summation cutoff were controlled by an overall threshold of 10^−6 a.u.”. I assume setting
precision = 1e-6
does exactly this? Or do I need to set the integral screening separately?
For reproduceability, I ran the calculation within the latest Docker image (pyscf/pyscf:latest).
Issue Analytics
- State:
- Created 2 years ago
- Comments:5 (2 by maintainers)
As you go to a larger number of threads sometimes the overhead cost of parallelization can slow things down especially as you get to more than 80 threads. A lot of factors, like the problem size and CPU type, can affect this. If you’re interesting in finding the most performant settings on your hardware, I’d suggest trying several different values for the number of threads to find what’s best for you.
Thanks James and Xing! I’ll try out the fix.