question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Q1: How do I remove a trial from a scheduler's experiment data?; Q2: How do I update the Generation strategy for a scheduler?

See original GitHub issue

Hi, I wanted to ask for help for the following cases:

Q1: Let’s assume that I ran a scheduler for 5 iterations. On the 5th iteration, due to errors in server, the job fails on the server. See image attached below: Screen Shot 2022-06-13 at 14 09 27

If I resume my scheduler, I obtain an error in the get_job_result api (based on the scheduler tutorial, my server provides the get_job_result api). This happens because the server returns a None object due to failed job.

Is there a way for me to remove the last trial data or overwrite it with a (custom Trial) in the scheduler.experiment data? (assuming I don’t want the trial to be abandoned, or failed). I am just wondering if its possible to edit the experiment data in the scheduler and if it’s ok to do so?

Q2: Also, suppose, I would like to change the generation strategy that I passed initially to the scheduler, is it possible to update it? for eg. can I run a few iterations of SOBOL trials let’s say at trial 16 from the above case?

Thanks for the help!

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:5 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
bernardbeckermancommented, Jun 23, 2022

@mango-picket the experiment is designed to be a record of all trials run, including failed/abandoned trials, so setting a status (e.g., FAILED) that causes Ax to ignore the trial is currently the best way to accomplish what you’re trying to do. As things are currently set up, if you try to simply delete a trial, the experiment could become erroneous or get into an invalid state.

0reactions
bernardbeckermancommented, Aug 22, 2022

The basic difference is that FAILED trials can be retried by the experiment, whereas ABANDONED trials will not be retried. See here for more details. Also, closing this out since it’s been a couple months.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Trial Schedulers (tune.schedulers) — Ray 2.2.0
These Trial Schedulers can early terminate bad trials, pause trials, clone trials, and alter hyperparameters of a running trial.
Read more >
Scheduling Operations - PJM
Spinning Reserve and Regulation Scheduling Software (SPREGO) . ... generation schedules updates to specific generation owners only, as required.
Read more >
Schedule Management Handbook - NASA
This chapter provides an introduction to key elements of NASA's strategic framework for Schedule Management, tying best practices to Agency P/p ...
Read more >
Process Schedulers in Operating System - GeeksforGeeks
The process scheduling is the activity of the process manager that handles the removal of the running process from the CPU and the...
Read more >
8 Updating Exadata Software - Oracle Help Center
There are different types of software used on an Exadata Database Machine that ... required for Exadata Database Machine helps you plan an...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found