question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[FR] [Roadmap] Add infrastructure examples

See original GitHub issue

MLflow Roadmap Item

This is an MLflow Roadmap item that has been prioritized by the MLflow maintainers. We’ve identified this feature as a highly requested addition to the MLflow package based on community feedback. We’re seeking a community contribution for the implementation of this feature and will enthusiastically support the development and review of a submitted PR for this.

Contribution Note

As with other roadmap items, there may be a desire for multiple contributors to work on an issue. While we don’t discourage collaboration, we strongly encourage that a primary contributor is assigned to roadmap issues to simplify the merging process. The items on the roadmap are of a high priority. Due to the wide-spread demand of roadmap features, we encourage potential contributors to only agree to take on the work of creating a PR, making changes, and ensuring that test coverage is adequately created for the feature if they are willing and able to see the implementation through to a merged state.

Feature scope

This roadmap feature’s complexity is classified as:

  • good-first-issue: This feature is limited in complexity and effort required to implement.
  • simple: This feature does not require a large amount of effort to implement and / or is clear enough to not need a design discussion with maintainers.
  • involved: This feature will require a substantial amount of development effort but does not require an agreed-upon design from the maintainers. The feedback given during the PR phase may be involved and necessitate multiple iterations before approval. (Please bear with us as we collaborate with you to make a great contribution)
  • design-recommended: This is a substantial feature that should have a design document approved prior to working on an implementation (to save your time, not ours). After agreeing to work on this feature, a maintainer will be assigned to support you throughout the development process.

Proposal Summary

This is a ‘meta FR’ that is intended to track several implementation PRs (one for each item). Please do not file a single PR with all of these changes. In order to work on one of these items, please request assignment to one (or several) and maintainers will assign it to you so that there isn’t duplicated work by others.

Each element in here should have two contributions associated with a PR for it:

  1. A documented script, with README, and any required files that illustrate and create a reproducible example concept in a folder within examples that is structured by example content type similar to mlflow/examples/mlflow_artifacts/
  2. Accompanying updates to the official documentation with a linked page that provides a ‘walkthrough guide’ that explains how to accomplish the task with accompanying example code that shows modifications and usage of the provided example.

The infrastructure examples that need to be built are:

  • End to end tracking server configuration for MLflow with SQLite scenario 2.
  • End to end tracking server configuration for MLflow on localhost with Tracking Server scenario 3
  • End to end tracking server configuration for MLflow Tracking Server used exclusively as proxied access host for artifact storage access scenario 6
  • A full example of using child runs to track a hyperparameter optimization run (logging to scenario 2 above) using either hyperopt or optuna (preferred)
  • A full example of setting up a tracking server with nginx and basic http authentication with .htaccess (using env vars MLFLOW_TRACKING_USERNAME and MLFLOW_TRACKING_PASSWORD)

These examples should include ‘real-world-like’ usage patterns of creating multiple experiments, multiple runs per experiment, using nested child runs, setting tags (on experiments and runs), populating descriptions, logging artifacts (as well as metrics and parameters). Showing the use of autologging and the model evaluate API are highly encouraged.

Motivation

What is the use case for this feature?

To support new users and help to simplify the developer experience with well-documented guides on how to use MLflow.

Why is this use case valuable to support for MLflow users in general? ^

What component(s), interfaces, languages, and integrations does this feature affect?

Components

  • area/artifacts: Artifact stores and artifact logging
  • area/build: Build and test infrastructure for MLflow
  • area/docs: MLflow documentation pages
  • area/examples: Example code
  • area/model-registry: Model Registry service, APIs, and the fluent client calls for Model Registry
  • area/models: MLmodel format, model serialization/deserialization, flavors
  • area/projects: MLproject format, project running backends
  • area/scoring: MLflow Model server, model deployment tools, Spark UDFs
  • area/server-infra: MLflow Tracking server backend
  • area/tracking: Tracking Service, tracking client APIs, autologging

Interfaces

  • area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
  • area/docker: Docker use across MLflow’s components, such as MLflow Projects and MLflow Models
  • area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
  • area/windows: Windows support

Languages

  • language/r: R APIs and clients
  • language/java: Java APIs and clients
  • language/new: Proposals for new client languages

Integrations

  • integrations/azure: Azure and Azure ML integrations
  • integrations/sagemaker: SageMaker integrations
  • integrations/databricks: Databricks integrations

Issue Analytics

  • State:open
  • Created a year ago
  • Reactions:1
  • Comments:11 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
BenWilson2commented, Nov 22, 2022

@Rusteam Sure thing! Please ping me when you file your PR 😃 Thank you for volunteering!

0reactions
Ransakacommented, Dec 20, 2022

Looking forward to contributing to this.

Read more comments on GitHub >

github_iconTop Results From Across the Web

[FR] [Roadmap] Add infrastructure examples - GitHub
Open source platform for the machine learning lifecycle - [FR] [Roadmap] Add infrastructure examples · mlflow/mlflow@790bd94.
Read more >
Build Your Infrastructure Roadmap - Info-Tech Research Group
Demonstrate your understanding to the business and let them validate or correct. Infrastructure must position itself as the broker for new ...
Read more >
Roadmapping: Your starter guide - AHA.io
A roadmap is a visual representation of your strategic plans. In this guide you'll learn how roadmaps can help turn your team's aspirations...
Read more >
Technology roadmap template & examples to organize projects
Plan and coordinate technology developments, etc. Some organizations use a technology roadmap template to streamline internal processes and systems, apart from ...
Read more >
9 business roadmap examples for scaling your organization
Discover business roadmap examples for scaling your organization. Plan and build your company roadmap using one of our free templates.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found