Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Development friendly Kubeflow experience

See original GitHub issue

/kind feature

Why you need this feature: Say I have a local package containing my application logic (i.e cleaning, feature generation, ML model training, etc…). This local package contains modules and functions used in my component.

I want to make changes to the application logic (i.e change a feature scaling method), then run my pipeline and 1) make sure the pipeline works or 2) see an improvement in my offline metrics.

My component image needs to have all the dependencies on the image, so this seems to mean that if I want to run my kubeflow pipeline with new code, I need to re-build and submit an image each time. This is a pretty slow process, and prevents us from wanting to make smaller components (better to develop pipelines in Python and run them as a bigger component via a CLI command).

I’m imagining one solution with a local Kubeflow instance, that has the component images pointing to locally built docker images that have the local application code mounted, you can get a much faster iteration cycle.

Is there a better way to develop faster with Kubeflow? It says it’s experimentation friendly, but I haven’t felt that from working with Kubeflow so far (it is nice that it has experiment management/tracking in the UI though!). I don’t feel like I can swap my current experimentation workflow out for Kubeflow.

Maybe a user guide on developing locally could be a good solution? Something equivalent to pip install -e . for Kubeflow components would be great!

Issue Analytics

State:
Created 3 years ago
Comments:9 (6 by maintainers)

Top GitHub Comments

1reaction

DavidSpekcommented, Jan 19, 2021

/area pipelines Ping @Bobgy. Seeing as this is related to Pipelines specifically maybe it can be moved to the kubeflow/pipelines repo.

0reactions

stale[bot]commented, Apr 28, 2022

This issue has been automatically closed because it has not had recent activity. Please comment “/reopen” to reopen it.

Top Results From Across the Web

Accelerating Machine Learning App Development ... - YouTube

Find out how running Kubeflow on Google Cloud helped GOJEK to dramatically accelerate the speed at which they could deliver machine learning ...

Kubeflow Pipelines: Helping developers from prototyping to ...

In this episode of Google Cloud AI Huddle, Soroush Radpour, Software Engineer on the Google Cloud AI platform team, goes over Kubeflow ......

Kubeflow: Not Yet Ready for Production? - Data Revenue

This article describes our Kubeflow experience. ... Model training and development: training and preparing models based on the processed data; ...

Build flexible and scalable distributed training architectures ...

Build flexible and scalable distributed training architectures using Kubeflow on AWS and Amazon SageMaker ... Machine learning (ML) development ...

Building an ML Pipeline with Kubeflow - Manning Publications

Your challenges will include restructuring a complex deep learning project to make it Kubeflow-friendly, and developing reusable components that can be ...