Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Model versioning

See original GitHub issue

I’m curious if the following feature is possible in metaflow. I don’t think so because of some pretty low-level python limitations, but I figured I would as the community and see if I’m missing something.

A pretty wonderful use case for metaflow would be to standardize the validation of models over time. To do that you would need to load in some saved version of the model, and predict on some new data.

However, saving and loading data in metaflow relies on pickle, which would evaluate all dependencies at runtime.

So, am I right that this is an unavoidable problem

train/1/x - model v 1.0.0
train/2/x - model v 1.0.1

... time passes

validate/1/x - model v 1.0.1, so the flow might fail to load v1.0.0 or load it incorrectly

Can metaflow help circumvent this issue in any way?

Issue Analytics

State:
Created 3 years ago
Comments:8 (2 by maintainers)

Top GitHub Comments

2reactions

romain-intelcommented, Apr 4, 2020

Hi. To provide a bit more detail, as @savingoyal mentioned, we are actually working on a way to access artifacts produced in a conda environment. As you rightly pointed out, an artifact that was saved within a specific environment (model in your TrainFlow) needs to be accessed within that environment again. This is true if you use a Notebook to access it or access it via the client API in another flow (as you do in your ValidateFlow). Currently, for your case, the solution would be to have the validate step run in the same environment as the train step. This is clearly not ideal in large part because we cannot expect the writer of the validate flow to necessarily know what the train flow did.

The solution we are working on would spin up a separate python interpreter in the proper environment (so the conda environment that was used to create the artifact in the first place) and pass all methods on that artifact to that sub-interpreter. There are lots of details and caveats for this but the basic principle has been tested for a different use case within Netflix (it’s basically a form of RPC).

We are hoping to develop something in Q2 which would make our Conda solution more complete (right now, artifacts that are pickled in a specific conda environment are hard to access unless you are in that same conda environment).

2reactions

savingoyalcommented, Apr 3, 2020

We are actually working on a feature this quarter (Q2 2020) to essentially enable what you are seeking. run.model.score will execute inside its own environment that was defined when the model was first generated. @romain-intel can share more details on that front.

Top Results From Across the Web

Versioning Data and Models | Data Version Control - DVC

Open-source version control system for Data Science and Machine Learning projects. Git-like experience to organize your data, models, and experiments.

Version Control for ML Models: Why You Need It, What It Is ...

Versioning is a very important step during and after model development. It enables collaboration, history keeping, and performance monitoring over multiple ...

Versioning - MLOps Guide

Data and Model Versioning. The use of code versioning tools is vital in the software development industry. The possibility of replicating the same...

How to Version Control Your Production ML Models - DataRobot

Model versioning is one component of AI/ML governance, which is the overall process for how an organization controls access, implements policy, ...

What is Data And Model Versioning - MLOps Wiki - Censius

Versioning refers to the process of uniquely naming multiple iterations of an ML model used at different stages of ML development. It helps...