Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Max Backtracking Option and print out current failure casues

See original GitHub issue

What’s the problem this feature will solve? When a user has a complex requirements set it’s possible that the backtracking can take hours / days / years to find a solution.

In a large requirements list it can be unclear why pip is backtracking as the conflict may exist many layers deep that the user does not know about.

Adding a max backtracking option attempts to solve 2 use cases:

When a user is debugging they can set max backtracking to 0 and be able to manually inspect the failure causes and improve their requirements
When a user is paying for CPU time, e.g. in a cloud environment, they may prefer pip to fail early in backtracking rather than run for hours. In this case they could set say a “reasonable” max backtracking such as 100 or 1000

Describe the solution you’d like

Add a maximum backtrack count to pip CLI and pass to the Resolution object
Add a self._backtrack_count to Resolution object
When backtracking increment self._backtrack_count and check if exceeds the maximum backtrack count
If it exceeds the maximum backtrack count log an error message that this happened and raise raise ResolutionImpossible(causes) so the user can inspect the current error was causing the backtracking

Additional context

This requires adding to the pip CLI, updating resolvelib, adding many test cases, and updating the documentation. I currently don’t have a strong enough understanding of pip’s code base to implement all of this. But if no one else works on this I will try and eventually submit the relevant PRs.

Issue Analytics

State:
Created 2 years ago
Comments:5 (5 by maintainers)

Top GitHub Comments

1reaction

notatallshawcommented, Aug 29, 2021

In principle, I’m +1 on this. However, backtracking is done a lot in normal processing (as I imagine you are aware) and it will be extremely hard to give good advice on what would be a “reasonable” number to set the backtracking parameter to.

I think the hardest part of doing this is likely to be to document it in a way that will (a) help users not familiar with the backtracking algorithm when they hit complex resolution issues, and (b) improve the quality of issue reports we receive when users cannot work out what’s going wrong.

I agree, my thought here is it’s “better than nothing” which is currently the situation for some users. For example in https://github.com/pypa/pip/issues/10373 pip is not helping the user that much, it backtracks seemingly forever with no useful messages for the user as to why. Personally I could spend an hour or 2 on the issue and inject print statements and breakpoints in to pips codebase and eventually figure out how to change the order of the requirements to something that will resolve quicker, but this doesn’t help the general user.

I feel like particularly “–max-backtracks 0” or a “–no-backtrack” flag would be extremely useful for projects which ship very tight requirements such as Airflow, so they can put it in to their ci/cd pipeline and resolve the requirements file as issues come up. It may also help tools that want to build on top of pip but not want pip to do the resolution.

My idea to suggest “–max-backtracks” rather than just “–no-backtrack” was to give the users flexibility in their approach. It would be difficult for pip to suggest a “reasonable” number but a motivated enough user could figure out their own “reasonable” number e.g. they may determine for their particular project under normal circumstances there are less than N backtracks and if there are more than 10*N backtracks something has gone terribly wrong.

Another idea I had to improve information to users / quality of bug reports to pip is to log the failure causes when they change, similar to how raising ResolutionImpossible(causes) displays the final conflicts, this could also be logged each time the causes change (as not to overwhelm the user with 1000s of the same message). It looks like https://github.com/sarugaku/resolvelib/pull/81 and https://github.com/pypa/pip/pull/10258 makes a start on this.

0reactions

uranusjrcommented, Sep 3, 2021

Yeah, it does not cover use case 2. But personally IMO it’s lost cause to even try to do that. Installing any kind of requirement that does not do strict pinning (==) in unattended environments without explicitly capping the resource usage is a recipe to disaster in the first place. Perhaps we could do e.g. automatically detect CI environment variables and switch to a “CI mode” that refuse those inputs outright, or something like that, but I don’t think we can effectively rescue a user from backtracking after it happens (but I can very well be wrong so don’t let my opinion stop you).

Top Results From Across the Web

Give the user (optional) control over the backtracking ... - GitHub

Limiting the number of backtracking similar to Max Backtracking Option and print out current failure casues #10417.

Backtracking Algorithms - GeeksforGeeks

Backtracking can be defined as a general algorithmic technique that considers searching every possible combination in order to solve a computational problem.

Lecture 15: Backtracking - Stony Brook Computer Science

At each step in the backtracking algorithm, we start from a given partial solution, ... If not, the critical issue is whether the...

What is Backtracking Algorithm with Examples & its Application

Backtracking is a general algorithm for solving some computational problems, most notably constraint satisfaction problems, ...

Backtracking

options recursively. . n Queens. The prototypical backtracking problem is the classical n Queens Problem, first proposed by German chess enthusiast Max ......