Thompson Sampling for Gaussian priors?
See original GitHub issueHi!
I learned that mabwiser
does not implement Thompson Sampling for Gaussian priors. As far as I know (please note that I’m quite new to multi-armed bandits), the Gaussian distribution is a conjugate prior and it’s possible to apply the procedure of Thompson Sampling on Gaussian priors. Would the maintainers be open to adding that as a learning policy? Or, is it the case that the optimality of Thompson sampling is only proved for Beta priors?
Thanks a lot!
Issue Analytics
- State:
- Created 2 years ago
- Comments:6 (4 by maintainers)
Top Results From Across the Web
Thompson Sampling using Conjugate Priors | by Steve Roberts
Thompson sampling overcomes this problem, and introduces exploration, by sampling from the prior probability distribution. The variance of the ...
Read more >Introduction to Thompson Sampling 1 Bayesian learning
We present a Thompson Sampling algorithm using Gaussian priors. As proved earlier, in this case, we can compute a closed form of the...
Read more >Optimality of Thompson Sampling for Gaussian Bandits ...
In this paper we discuss the optimality of TS for the model of normal distributions with unknown means and variances as one of...
Read more >Optimality of Thompson Sampling for Gaussian Bandits ... - arXiv
Abstract: In stochastic bandit problems, a Bayesian policy called Thompson sampling (TS) has recently attracted much attention for its ...
Read more >A Tutorial on Thompson Sampling - Stanford University
Thompson sampling is an algorithm for online decision prob- ... φ from a Gaussian distribution with mean µ and covariance matrix Σ, and...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@jaywonchung good luck with your submission! This of course takes high priority with or without mabwiser.
I am enjoying the technical brainstorming on the thread, and others might also find it useful. If it works out, especially it works well in your own experiments/paper, I would encourage the introduction of a new bandit as suggested.
We have a detailed write-up that shows a step-by-step intro to how to add a new bandit to mabwiser here https://fidelity.github.io/mabwiser/new_bandit.html. When you go that route, I can see us collaborating on the same PR --which might be fun! 😃
Good luck with your paper!
Serdar
Hope all is well, and your submission is now all good! Let’s us archive this issue now for future reference inc case it comes handy for others. Thank you again for the positive feedback!