Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Q1: How do I remove a trial from a scheduler's experiment data?; Q2: How do I update the Generation strategy for a scheduler?

See original GitHub issue

Hi, I wanted to ask for help for the following cases:

Q1: Let’s assume that I ran a scheduler for 5 iterations. On the 5th iteration, due to errors in server, the job fails on the server. See image attached below: Screen Shot 2022-06-13 at 14 09 27

If I resume my scheduler, I obtain an error in the get_job_result api (based on the scheduler tutorial, my server provides the get_job_result api). This happens because the server returns a None object due to failed job.

Is there a way for me to remove the last trial data or overwrite it with a (custom Trial) in the scheduler.experiment data? (assuming I don’t want the trial to be abandoned, or failed). I am just wondering if its possible to edit the experiment data in the scheduler and if it’s ok to do so?

Q2: Also, suppose, I would like to change the generation strategy that I passed initially to the scheduler, is it possible to update it? for eg. can I run a few iterations of SOBOL trials let’s say at trial 16 from the above case?

Thanks for the help!

Issue Analytics

State:
Created a year ago
Comments:5 (3 by maintainers)

Top GitHub Comments

1reaction

bernardbeckermancommented, Jun 23, 2022

@mango-picket the experiment is designed to be a record of all trials run, including failed/abandoned trials, so setting a status (e.g., FAILED) that causes Ax to ignore the trial is currently the best way to accomplish what you’re trying to do. As things are currently set up, if you try to simply delete a trial, the experiment could become erroneous or get into an invalid state.

0reactions

bernardbeckermancommented, Aug 22, 2022

The basic difference is that FAILED trials can be retried by the experiment, whereas ABANDONED trials will not be retried. See here for more details. Also, closing this out since it’s been a couple months.