question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Thompson Sampling for Gaussian priors?

See original GitHub issue

Hi!

I learned that mabwiser does not implement Thompson Sampling for Gaussian priors. As far as I know (please note that I’m quite new to multi-armed bandits), the Gaussian distribution is a conjugate prior and it’s possible to apply the procedure of Thompson Sampling on Gaussian priors. Would the maintainers be open to adding that as a learning policy? Or, is it the case that the optimality of Thompson sampling is only proved for Beta priors?

Thanks a lot!

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:6 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
skadiocommented, Mar 14, 2022

@jaywonchung good luck with your submission! This of course takes high priority with or without mabwiser.

I am enjoying the technical brainstorming on the thread, and others might also find it useful. If it works out, especially it works well in your own experiments/paper, I would encourage the introduction of a new bandit as suggested.

We have a detailed write-up that shows a step-by-step intro to how to add a new bandit to mabwiser here https://fidelity.github.io/mabwiser/new_bandit.html. When you go that route, I can see us collaborating on the same PR --which might be fun! 😃

Good luck with your paper!

Serdar

0reactions
skadiocommented, May 7, 2022

Hope all is well, and your submission is now all good! Let’s us archive this issue now for future reference inc case it comes handy for others. Thank you again for the positive feedback!

Read more comments on GitHub >

github_iconTop Results From Across the Web

Thompson Sampling using Conjugate Priors | by Steve Roberts
Thompson sampling overcomes this problem, and introduces exploration, by sampling from the prior probability distribution. The variance of the ...
Read more >
Introduction to Thompson Sampling 1 Bayesian learning
We present a Thompson Sampling algorithm using Gaussian priors. As proved earlier, in this case, we can compute a closed form of the...
Read more >
Optimality of Thompson Sampling for Gaussian Bandits ...
In this paper we discuss the optimality of TS for the model of normal distributions with unknown means and variances as one of...
Read more >
Optimality of Thompson Sampling for Gaussian Bandits ... - arXiv
Abstract: In stochastic bandit problems, a Bayesian policy called Thompson sampling (TS) has recently attracted much attention for its ...
Read more >
A Tutorial on Thompson Sampling - Stanford University
Thompson sampling is an algorithm for online decision prob- ... φ from a Gaussian distribution with mean µ and covariance matrix Σ, and...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found