question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Development friendly Kubeflow experience

See original GitHub issue

/kind feature

Why you need this feature: Say I have a local package containing my application logic (i.e cleaning, feature generation, ML model training, etc…). This local package contains modules and functions used in my component.

I want to make changes to the application logic (i.e change a feature scaling method), then run my pipeline and 1) make sure the pipeline works or 2) see an improvement in my offline metrics.

My component image needs to have all the dependencies on the image, so this seems to mean that if I want to run my kubeflow pipeline with new code, I need to re-build and submit an image each time. This is a pretty slow process, and prevents us from wanting to make smaller components (better to develop pipelines in Python and run them as a bigger component via a CLI command).

I’m imagining one solution with a local Kubeflow instance, that has the component images pointing to locally built docker images that have the local application code mounted, you can get a much faster iteration cycle.

Is there a better way to develop faster with Kubeflow? It says it’s experimentation friendly, but I haven’t felt that from working with Kubeflow so far (it is nice that it has experiment management/tracking in the UI though!). I don’t feel like I can swap my current experimentation workflow out for Kubeflow.

Maybe a user guide on developing locally could be a good solution? Something equivalent to pip install -e . for Kubeflow components would be great!

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:9 (6 by maintainers)

github_iconTop GitHub Comments

1reaction
DavidSpekcommented, Jan 19, 2021

/area pipelines Ping @Bobgy. Seeing as this is related to Pipelines specifically maybe it can be moved to the kubeflow/pipelines repo.

0reactions
stale[bot]commented, Apr 28, 2022

This issue has been automatically closed because it has not had recent activity. Please comment “/reopen” to reopen it.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Accelerating Machine Learning App Development ... - YouTube
Find out how running Kubeflow on Google Cloud helped GOJEK to dramatically accelerate the speed at which they could deliver machine learning ...
Read more >
Kubeflow Pipelines: Helping developers from prototyping to ...
In this episode of Google Cloud AI Huddle, Soroush Radpour, Software Engineer on the Google Cloud AI platform team, goes over Kubeflow ......
Read more >
Kubeflow: Not Yet Ready for Production? - Data Revenue
This article describes our Kubeflow experience. ... Model training and development: training and preparing models based on the processed data; ...
Read more >
Build flexible and scalable distributed training architectures ...
Build flexible and scalable distributed training architectures using Kubeflow on AWS and Amazon SageMaker ... Machine learning (ML) development ...
Read more >
Building an ML Pipeline with Kubeflow - Manning Publications
Your challenges will include restructuring a complex deep learning project to make it Kubeflow-friendly, and developing reusable components that can be ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found