Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Selective noise inference for some observations in field experiment

See original GitHub issue

Hi! We are a group of social scientists trying to use Bayesian optimizations in our experiment. We are running optimization in a complete field experiment, which means that there are some cases where we are only able to attain a very small number of observations for certain parameters. This means that we cannot get a good estimate of SEM (bootstrapping will give us SEM=0, naturally). Therefore, the data we input to the Ax experiment might look something like this (all arbitrary numbers):

	arm_name	metric_name	mean	sem	n
0	0_0	score	6.51	0.94	4
1	0_1	score	7.33	0.55	4
2	0_2	score	6.94	0.53	3
3	0_3	score	9.42	0.91	2
4	0_4	score	3.91	0	1
5	0_5	score	2.50	0	1

We were wondering if Ax takes account of the fact that SEM being 0 when n=1 does not mean that we are fully confident that we have the right value. If it does not, what is the best way to proceed? More generally, what can we do when we are relatively less confident about the value of some values for some parameters?

We know that we can incorporate unknown variance by putting in np.nan, but it looks like we can’t put it selectively for certain parameters that we are not confident about - when we tried, it raises this error:

ValueError: Mix of known and unknown variances indicates valuation function errors. Variances should all be specified, or none should be.

Let us know if anything is unclear, and thank you so much in advance!

Issue Analytics

State:
Created 3 years ago
Comments:16 (9 by maintainers)

Top GitHub Comments

1reaction

Balandatcommented, May 5, 2021

@nwrim That indeed makes sense and inasmuch as your setup does this it should do the right thing already.

I don’t have much additional wisdom to dispense, only that if you’re trying to estimate a SEM from a very small number of observations then your error will likely not be Gaussian. So technically you’re going to be violating some of the modeling assumptions, but from a practical perspective you’re probably going to be fine (at least from an optimization perspective, but maybe be careful not to trust the model too much).

0reactions

nwrimcommented, May 5, 2021

Thank you so much again @Balandat!

Top Results From Across the Web

Selective inference with a randomized response - jstor

In this work, we consider selective inference in a general setting that include nonparametric settings. In addition, we introduced the use of randomized ......

Evaluating methods for Lasso selective inference in ...

Our study has some limitations. It was restricted to recent approaches to selective inference with a focus on the submodel view of inference....

Experimental noise cutoff boosts inferability of transcriptional ...

Here, we develop the analytical and numerical tools to quantify the fundamental limits for inferring transcriptional networks from gene knockout ...

Statistical learning and selective inference - PNAS

We describe some recent new developments in selective inference and illustrate their use in forward stepwise regression, the lasso, and ...

TUTORIAL: Recent Development in Selective Inference I

Adaptive Data Analysis WorkshopSnigdha Panigrahihttps://simons.berkeley.edu/talks/tutorial-recent-development- selective - inference -i.

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

Selective noise inference for some observations in field experiment

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

Noisy objective function not taken into account in `SimpleExperiment` when suggesting best parameters

[Feature Request] Optimization over high-dim discrete / combinatorial search spaces