Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[Question] Best practices regarding Sobol sampling

See original GitHub issue

In the gpei_hartmann_service.ipynb tutorial, before exploring using GPEI, a Sobol strategy has been used. Just curious if there is any reference on the best practices regarding the Sobol strategy: mainly what’s the relationship between the search space and the number of Sobol trials.

Issue Analytics

State:
Created a year ago
Reactions:2
Comments:5 (4 by maintainers)

Top GitHub Comments

4reactions

Balandatcommented, Apr 20, 2022

In general, you should get away with relatively few Sobol points. This is really just an initialization and mostly in place to prime the model to be able to fit to some data. In general, if there are too few points to get a good model fit, the model uncertainty should be high across the entire search space and cause any reasonable acquisition function to behave more or less like random exploration. So if you have a small iteration budget it would be reasonable to use a small number of Sobol points. Using a few more points will probably reduce the chances of the model running into numerical issues and make the model fitting more stable (this is more an intuition / hypothesis than proven fact though, I’ve never formally investigated this).

2reactions

sgbairdcommented, Jun 2, 2022

Interestingly, for GPEI there were nearly identical results using a 10/90 Sobol/GPEI vs. a 46/54 Sobol/GPEI with 23 hyperparameters, see Figure 2 of https://doi.org/10.1016/j.commatsci.2022.111505.

Top Results From Across the Web

Understanding Sobol sampling - Zemax Knowledgebase

Sobol sampling takes a different approach. Rather than generating random numbers, it generates a uniform distribution in probability space.

To Sobol or not to Sobol? The effects of sampling schemes in ...

Here, we consider three sampling strategies: random sampling, Latin hypercube sampling (LHS), and Sobol sequences. Figure 1A shows a visual ...

Sobol Sequence Explained by Antoine Savine - Medium

Sobol Sequence Explained by Antoine Savine. It is considered best practice in financial Monte-Carlo simulations to apply quasi-random numbers ...

(PDF) Sampling based on Sobol' sequences for Monte Carlo ...

Quasi-random (QR) sequences such as the Sobol′ sequences are designed to generate a sample that is uniformly distributed over the unit ...

Efficient sampling algorithm for large-scale optimization under ...

Quasi-Monte Carlo methods based on low-discrepancy sequences (Niederreiter, 1992) like ... problem scrambled Sobol sampling was introduced by Chi et al.