question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[Question] Thoughts on using Ax for stochastic trials

See original GitHub issue

This is a fairly open an vague question apologies, but I’d like to see if anyone has thoughts on using the approach implemented by Ax for “stochastic trials”, or stated another way “policy learning”.

What I mean by this is instead of running a trial with a fixed set of parameters, it is common in reinforcement learning to deploy a stochastic policy (for example a policy defined to output a value from a parameterized distribution). This allows for applying algorithms like REINFORCE.

I am trying to understand if there is a bridge between these two approaches which is useful. One way of modelling this would be to just have the Trial define the parameters of a stochastic policy. This seems okay, but applying the rest of the BO toolbox to this kind of data seems tricky.

Any thoughts from people using Ax?

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:5 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
Balandatcommented, Oct 11, 2022

Cool. Closing this out for now then, feel free to reopen if needed.

0reactions
Padarncommented, Oct 11, 2022

You’re completely right. Evaluation is cheap, but potential correlated across evaluations (which is why BO was the initial thought).

Interesting thought on reward shaping. I’ll need to think a bit more about it. It sounds very sensible but perhaps not immediately necessary.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Stochastic Modeling - Science topic - ResearchGate
Explore the latest questions and answers in Stochastic Modeling, and find Stochastic ... I am new to ground motion simulation using stochastic methods....
Read more >
An Introduction to Stochastic Modeling, Third Edition
mathematical and statistical studies. This book is intended as a beginning text in stochastic processes for stu- dents familiar with elementary probability ......
Read more >
Stochastic Calculus: An Introduction with Applications
If X is a random variable, then its expectation, E[X] can be thought of as the best guess for X given no information...
Read more >
Stochastic Choice - Harvard University
Question: • Are “close” decisions faster or slower? Intuitions: • People “overthink” decision problems which don't matter,. “underthink” those ...
Read more >
SC505 STOCHASTIC PROCESSES Class Notes | MIT
event occurs out of n independent trials. Then, X is a random variable with discrete range {0, 1,...,n}. A simple representation of x...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found