question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

guide: concerns with "Persisting" experiments

See original GitHub issue

Follow-up to https://github.com/iterative/dvc.org/pull/2845

Term/Concept of “persisting”

_Per https://github.com/iterative/dvc.org/pull/2845#issuecomment-937151219_

First, “persisting” usually means to continue something despite it being difficult. We want it to mean to make something (that was ephemeral) persistent. So not sure it’s a good title already.

One option is to be explicit, avoid term “persist”, and say “commit experiments to Git”. This is a probably good first step regardless.

Then we should consider all the reasons why people want to do commit experiments and generalize them into a term. Currently we only mention sharing experiments, but then this whole content should be inside https://dvc.org/doc/user-guide/experiment-management/sharing-experiments instead (which I doubt is right but maybe?)

Other

Issue Analytics

  • State:open
  • Created 2 years ago
  • Comments:24 (24 by maintainers)

github_iconTop GitHub Comments

1reaction
iesahincommented, Mar 12, 2022

I think persist (as @dberenbaum in an earlier discussion said) was a bit larger than what we mean by committing experiments to Git. Persist may also mean to send these to a common repository (dvc exp push), for example. What we’re trying to figure out is to convey the meaning for dvc exp apply operation, basically moving Git ref in .git/refs/exps/ABCD123... as a commit from the HEAD.

DVC experiments requires Git to run, so these better reflect that connection too. So, although I understand the concern behind commit, among the candidates I tried to produce above, it looked like the best.

1reaction
omessercommented, Mar 10, 2022

@jorgeorpinel - Good point calling me out on mentioning “persist” + “track” as equals here. I see your point doubting my usage of “persist” for experiments. It’s not widely used indeed, it’s too narrow

When I wrote:

If I were to read/skim these docs and compare to other experiment tracking solutions I would look for (search / grep) the terms “persist” and “track” (=what) and not “commit” (=how) - all the more important when it comes to titles!

…“Persisting” was probably my backend instincts kicking in for understanding where something is saved(persisted, if you will) if I weren’t to find the term “track”. But, you’re right, the correct thing for me to have done would be to search for “track[ing]”. So while I definitely see “persist” used in the way I mentioned (to save to persistency, I stand behind this being familiar 😉 ), it’s indeed not used in the context of ML experiments. The conventional term (it’s in the name) is experiment tracking.

Let me modify/focus my above claim: what: track, how: by commit to git

“In my mind “committing” is not an implementation detail but actually the user’s goal (=what)”

Can this be a little iterative/dvc “biased” though? meaning - your “what” here already assumes git-centric approach which is “unique” to us (user knows they want to commit to git).

What I’m referring to is familiarity/discoverability of people that don’t know how this works. Most (all?) other tools for “tracking” experiments don’t revolve around git as persistency / tracking mechanism. So, I would say commit to git is our “how” (how we track experiments). The “layman” ds/ml eng wouldn’t necessarily know / look for a way to “commit” their experiment but to track their experiments. Obviously, tracking is wider than just persisting - it can include discoverability / comparability / monitoring / insights… Committing to git satisfies some of those to some extent, for sure, maybe not all. And that’s why I (currently) perceive it as “how” experiments are tracked

Read more comments on GitHub >

github_iconTop Results From Across the Web

Ethics in field experimentation: A call to establish new ... - PNAS
The guiding documents of ethical research tell us the public should not be manipulated without consent, debriefing (respect for persons), and a ...
Read more >
Guidelines and considerations for designing field experiments ...
Here, we review and synthesize approaches, challenges, and opportunities for conducting PMEs in forests, with the goal of guiding design ...
Read more >
Definitions and guidelines for research on antibiotic persistence
Several approaches have independently emerged to define and measure persistence. Research groups following seemingly similar procedures may ...
Read more >
Plan your AWS FIS experiments - AWS Fault Injection Simulator
Basic principles and guidelines · Always start experimenting with AWS FIS in a test environment. · Build your team's confidence in your application...
Read more >
The Use and Interpretation of Quasi-Experimental Studies in ...
Quasi-experimental study designs, often described as nonrandomized, ... then the ethical issues of randomizing patients are sometimes raised. In the area of ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found