Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

pbc with multigrid DF benchmark

See original GitHub issue

Hi PySCF team,

I am attempting to run some larger-scale DFT calculations within the pbc/df module. Specifically, I am hoping to reproduce some benchmarks in the 2020 paper (https://aip.scitation.org/doi/pdf/10.1063/5.0006074) Table II. It appears that pyscf/examples/2-benchmark/fock_multigrid.py sets up this benchmark. However, I ran into a few problems/questions:

For the nao = 2560 water box, using LDA and 96 threads @ 2.00GHz I get a Fock matrix build time of 100s (see line 546 of attached output) which seems quite off from the paper result (20 sec with 32 threads (?) @ 2.3 GHz).
Attempting to run a full DFT calculation, I run out of memory (tries to allocate a 470gb array! see end of attached file).
In Table II, it is explained that “Integral screening and lattice summation cutoff were controlled by an overall threshold of 10^−6 a.u.”. I assume setting precision = 1e-6 does exactly this? Or do I need to set the integral screening separately?

For reproduceability, I ran the calculation within the latest Docker image (pyscf/pyscf:latest).

multigrid_test.txt

Issue Analytics

State:
Created 2 years ago
Comments:5 (2 by maintainers)

Top GitHub Comments

1reaction

jamesETsmithcommented, Jul 30, 2021

As you go to a larger number of threads sometimes the overhead cost of parallelization can slow things down especially as you get to more than 80 threads. A lot of factors, like the problem size and CPU type, can affect this. If you’re interesting in finding the most performant settings on your hardware, I’d suggest trying several different values for the number of threads to find what’s best for you.

0reactions

pedersorcommented, Aug 2, 2021

Thanks James and Xing! I’ll try out the fix.