question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Pipeline node output artifacts should not be overridden if a node is executed multiple times

See original GitHub issue

Scenario:

  • Create a notebook pipeline.
  • Create a notebook (abc.iynb) that produces a unique result during each execution.
  • Add the same [notebook] node twice to the pipeline.
  • Run the pipeline.
  • Inspect the output artifacts abc.ipynb and abc.html

The outputs of notebook execution 1 are overridden by the outputs of notebook execution 2.

Issue Analytics

  • State:open
  • Created 3 years ago
  • Comments:6 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
ptitzlercommented, Sep 16, 2020

The overwrite doesn’t occur when the node is processed, but rather when the outputs are uploaded to COS

Yes, that’s what I was trying to raise.

0reactions
MLnickcommented, Oct 19, 2020

@ptitzler yes, hyperparameter tuning will generally have the same input data, with the only difference in runs being the input parameters to the ML pipeline / model. The results (trained model) would generally be different.

And yes also random init of models will lead to slightly different results each training run (unless a fixed random seed is used, which is not uncommon for reproducibility of pipelines).

But yes this would cover any case where the output of the node depends on the input parameters / env variables.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Pipeline: Basic Steps - Jenkins
fileExists : Verify if file exists in workspace. Checks if the given file exists on the current node. Returns true | false ....
Read more >
node.js - Gitlab: Passing artefacts through jobs and stages
1 Answer 1 · artifacts.name is not defined so the default "artifacts" string would be used. · If you store report. · Seems...
Read more >
Troubleshoot pipeline runs - Azure DevOps - Microsoft Learn
If a pipeline doesn't start at all, check the following common trigger related issues. UI settings override YAML trigger setting; Pull request triggers...
Read more >
Build specification reference for CodeBuild
To override the default buildspec file name, location, or both, do one of the following: Run the AWS CLI create-project or update-project command,...
Read more >
`.gitlab-ci.yml` keyword reference - GitLab Docs
Zero-downtime upgrades for multi-node instances ... Choose when jobs run · CI/CD job token ... Pipeline artifacts .gitlab-ci.yml.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found