question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

"Terminate" does not stop pipeline run execution

See original GitHub issue

What steps did you take:

Hitting “Terminate” fails to actually stop pipeline execution if there are steps with retryStrategies or running init containers

What happened:

We are running a pipeline that has ±50 parallel step executions. If we want to stop pipeline run we have to delete Kubernetes Workflow resource rather than using “Terminate” UI button as it does not stop the run if:

  1. There are steps = containers with retryStrategy, as they are restarting instead of being terminated
  2. If there are any init containers, the step won’t be stopped until they finish and the main one takes over

What did you expect to happen:

All pipelines steps/pods are terminated the instant user hits “Terminate” with no retries, as if the corresponding Workflow is deleted

How did you deploy Kubeflow Pipelines (KFP)? Kubeflow Pipelines are installed as a part of Kubeflow project

KFP version: Build commit: 743746b KFP SDK version: 0.5.1

Anything else you would like to add:

Please let me know if additional info is required

/kind bug /area frontend /area backend

Issue Analytics

  • State:open
  • Created 3 years ago
  • Reactions:9
  • Comments:25 (13 by maintainers)

github_iconTop GitHub Comments

1reaction
axeltidemanncommented, Apr 29, 2022

Creating a minimal pipeline would require a lot of GitHub archeology, recurrent scheduling - and would probably not provoke the error, since the Kubeflow cluster is now upgraded.

What worked in the end was to remove the row that described the pipeline run from the MySQL database. This screenshot shows the database:

image

In the table run_details, delete the row with the same UUID as seen in the Kubeflow UI.

image

Read more comments on GitHub >

github_iconTop Results From Across the Web

How can I kill (not cancel) an errant Azure Pipeline run, stage ...
For example, canceling the pipeline does not stop it immediately if a condition is configured incorrectly. If the condition resolves to true the ......
Read more >
bash - What is the most succinct way of terminating the rest of ...
Workaround 2. Execute everything but check the return results This will still run all commands no matter what, but it does let you...
Read more >
I have a stuck Pipeline and I can not stop it - CloudBees Support
After 30 seconds of the build not stopping that should generate a link in the Console of the build which says "Click here...
Read more >
Terminate - AWS Data Pipeline
AWS Data Pipeline attempts to put the activity, resource, or data node into the CANCELLED state if it does not start by the...
Read more >
Pipeline with manual job never terminate - GitLab CI/CD
The default timeout for all pipelines is 60 minutes (unless otherwise specified), so it should not run forever. Take our pipeline as an...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found