Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

FixedNoiseGaussianLikelihood with z-scored data

See original GitHub issue

Howdy folks,

I am doing some sensor/data fusion using an exact single task regression GP. The targets come from various sensors, each of which has its own 1-sigma uncertainty. I am grouping the targets according to the sensors Y = [y1, y1, ... y1, y2, ...y2, .... yn, yn, ...., yn] for sensors 1 through n.

Then I am grouping my 1-sigma noises as Y_std = [d1, d1, ..., d1, d2, d2, ..., d2, ..., dn, dn, ... dn], where d1 is the 1-sigma noise of sensor 1, d2 the 1-sigma noise of sensor 2, etc.

Then I am z-scoring my data:

Y'= (Y-mean(Y))/std(Y)
X'= (X-mean(X))/std(X)

Then before passing Y_std**2 to the noise of the FixedNoiseGaussianLikelihood constructor, I obviously need to correctly adjust for the target z-scoring done on the targets.

The right way of doing this is the main premise of my thread and is what I am searching for.

I started out with z-scoring the sensor standard deviations using the mean and std of the target data

Y_std' = (Y_std-mean(Y))/std(Y)

However, this doesn’t feel right to me, and I get counter intuitive results when I experiment with different settings for the sensor noise values.

I changed this by only dividing the noise by the std of the targets :

Y_std'=Y_std/std(Y)

which appears to work a lot better and both noise and target is undergoing the same scaling.

z-scoring the sensor noise using it’s own mean and std didn’t feel like the right thing to do, as I was afraid I would loose the correct relationship to the target data (essentially I’d be scaling the targets and the sensor noise by different values which doesn’t feel right to me)

I guess I am asking what the right way is for adjusting the sensor noises before passing them off as variances in the FixedNoiseGaussianLikelihood?

Thanks

Galto

Issue Analytics

State:
Created 4 years ago
Comments:8 (1 by maintainers)

Top GitHub Comments

1reaction

Balandatcommented, Jan 11, 2020

If I use a standard GaussianLikelihood won’t I loose influence on how I weigh my sources?

Yes, that’s true. I guess Geoff is wondering to what extent that’s necessary (if the sensors aren’t all that different it may not be).

is s_n^2, i.e. the variance of the noise, the same noise we are talking about for the FixedNoiseGaussianLikelihood?

Yes, it is.

0reactions

Galto2000commented, Jan 11, 2020

Thanks @Balandat

Gotcha, so yes, in my case, there are very good reasons to “weigh” one data source over another 😃 (or so, at least I think so).

Top Results From Across the Web

FixedNoiseGaussianLikelihood Vs GaussianLikelihood with ...

Hello, I am working on implementing in GPyTorch a Gaussian process emulator (GPE). In my case, this is essentially a Gaussian process with ......

How to calculate Z-scores (formula review) (article)

A z-score measures exactly how many standard deviations above or below the mean a data point is. Here's the formula for calculating a...

Z-score: Definition, Formula, and Uses - Statistics By Jim

Z-scores describe how data values compare to the mean by indicating how many standard deviations a value falls above or below the mean....

Z-score Calculator

Calculator to find out the z-score of a normal distribution, convert between z-score and probability, and find the probability between 2 z-scores.

Z-Score: Definition, Formula and Calculation - Statistics How To

Simply put, a z-score (also called a standard score) gives you an idea of how far from the mean a data point is....

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

FixedNoiseGaussianLikelihood with z-scored data

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

Arc Kernel for Conditional Spaces

Bulding multiclass classification model