Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[Question] Thoughts on using Ax for stochastic trials

See original GitHub issue

This is a fairly open an vague question apologies, but I’d like to see if anyone has thoughts on using the approach implemented by Ax for “stochastic trials”, or stated another way “policy learning”.

What I mean by this is instead of running a trial with a fixed set of parameters, it is common in reinforcement learning to deploy a stochastic policy (for example a policy defined to output a value from a parameterized distribution). This allows for applying algorithms like REINFORCE.

I am trying to understand if there is a bridge between these two approaches which is useful. One way of modelling this would be to just have the Trial define the parameters of a stochastic policy. This seems okay, but applying the rest of the BO toolbox to this kind of data seems tricky.

Any thoughts from people using Ax?

Issue Analytics

State:
Created a year ago
Comments:5 (3 by maintainers)

Top GitHub Comments

1reaction

Balandatcommented, Oct 11, 2022

Cool. Closing this out for now then, feel free to reopen if needed.

0reactions

Padarncommented, Oct 11, 2022

You’re completely right. Evaluation is cheap, but potential correlated across evaluations (which is why BO was the initial thought).

Interesting thought on reward shaping. I’ll need to think a bit more about it. It sounds very sensible but perhaps not immediately necessary.

Top Results From Across the Web

Stochastic Modeling - Science topic - ResearchGate

Explore the latest questions and answers in Stochastic Modeling, and find Stochastic ... I am new to ground motion simulation using stochastic methods....

An Introduction to Stochastic Modeling, Third Edition

mathematical and statistical studies. This book is intended as a beginning text in stochastic processes for stu- dents familiar with elementary probability ......

Stochastic Calculus: An Introduction with Applications

If X is a random variable, then its expectation, E[X] can be thought of as the best guess for X given no information...

Stochastic Choice - Harvard University

Question: • Are “close” decisions faster or slower? Intuitions: • People “overthink” decision problems which don't matter,. “underthink” those ...

SC505 STOCHASTIC PROCESSES Class Notes | MIT

event occurs out of n independent trials. Then, X is a random variable with discrete range {0, 1,...,n}. A simple representation of x...