question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

FixedNoiseGaussianLikelihood with z-scored data

See original GitHub issue

Howdy folks,

I am doing some sensor/data fusion using an exact single task regression GP. The targets come from various sensors, each of which has its own 1-sigma uncertainty. I am grouping the targets according to the sensors Y = [y1, y1, ... y1, y2, ...y2, .... yn, yn, ...., yn] for sensors 1 through n.

Then I am grouping my 1-sigma noises as Y_std = [d1, d1, ..., d1, d2, d2, ..., d2, ..., dn, dn, ... dn], where d1 is the 1-sigma noise of sensor 1, d2 the 1-sigma noise of sensor 2, etc.

Then I am z-scoring my data:

Y'= (Y-mean(Y))/std(Y)
X'= (X-mean(X))/std(X)

Then before passing Y_std**2 to the noise of the FixedNoiseGaussianLikelihood constructor, I obviously need to correctly adjust for the target z-scoring done on the targets.

The right way of doing this is the main premise of my thread and is what I am searching for.

I started out with z-scoring the sensor standard deviations using the mean and std of the target data

Y_std' = (Y_std-mean(Y))/std(Y)

However, this doesn’t feel right to me, and I get counter intuitive results when I experiment with different settings for the sensor noise values.

I changed this by only dividing the noise by the std of the targets :

Y_std'=Y_std/std(Y)

which appears to work a lot better and both noise and target is undergoing the same scaling.

z-scoring the sensor noise using it’s own mean and std didn’t feel like the right thing to do, as I was afraid I would loose the correct relationship to the target data (essentially I’d be scaling the targets and the sensor noise by different values which doesn’t feel right to me)

I guess I am asking what the right way is for adjusting the sensor noises before passing them off as variances in the FixedNoiseGaussianLikelihood?

Thanks

Galto

Issue Analytics

  • State:open
  • Created 4 years ago
  • Comments:8 (1 by maintainers)

github_iconTop GitHub Comments

1reaction
Balandatcommented, Jan 11, 2020

If I use a standard GaussianLikelihood won’t I loose influence on how I weigh my sources?

Yes, that’s true. I guess Geoff is wondering to what extent that’s necessary (if the sensors aren’t all that different it may not be).

is s_n^2, i.e. the variance of the noise, the same noise we are talking about for the FixedNoiseGaussianLikelihood?

Yes, it is.

0reactions
Galto2000commented, Jan 11, 2020

Thanks @Balandat

Gotcha, so yes, in my case, there are very good reasons to “weigh” one data source over another 😃 (or so, at least I think so).

Read more comments on GitHub >

github_iconTop Results From Across the Web

FixedNoiseGaussianLikelihood Vs GaussianLikelihood with ...
Hello, I am working on implementing in GPyTorch a Gaussian process emulator (GPE). In my case, this is essentially a Gaussian process with ......
Read more >
How to calculate Z-scores (formula review) (article)
A z-score measures exactly how many standard deviations above or below the mean a data point is. Here's the formula for calculating a...
Read more >
Z-score: Definition, Formula, and Uses - Statistics By Jim
Z-scores describe how data values compare to the mean by indicating how many standard deviations a value falls above or below the mean....
Read more >
Z-score Calculator
Calculator to find out the z-score of a normal distribution, convert between z-score and probability, and find the probability between 2 z-scores.
Read more >
Z-Score: Definition, Formula and Calculation - Statistics How To
Simply put, a z-score (also called a standard score) gives you an idea of how far from the mean a data point is....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found