question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Long term solution for AutoML CI/CD test infrastructure

See original GitHub issue

/kind feature

We need to figure out a long term plan to run CI test infrastructure under AutoML projects. Right now, we use Travis to run our unit and integration tests, kubeflow-ci and automl-ci project to run our e2e tests, using Prow. During Prow test we follow these steps:

  • We submit Argo workflow from ksonnet app to kubeflow-testing cluster in kubeflow-ci project. Do we want to refactor this workflow using Tekton, for example?

  • For pre-submit we use gcr.io/automl-ci registry and automl-ci project to build and push images. For post-submit we use gcr.io/kubeflow-images-public registry and kubeflow-ci project to build and push images. As mentioned here: https://github.com/kubeflow/katib/pull/1326#issuecomment-689512234 we can switch to apps-cd to release our images instead of using post-submit. @gaocegege @johnugeorge Do we want to use our own registry under automl-ci project or continue use gcr.io/kubeflow-images-public ?

  • For e2e test we create Kubernetes cluster under automl-ci project, deploy manifests and submit required Experiments.

Since kubeflow-ci is maintaining only for 20% (https://github.com/kubeflow/testing/issues/737) we need to figure out the way to deploy our test infra. Some possible solutions can be:

  • Use Kind with GitHub actions to build images and run our e2e tests.
  • Switch to AWS common test infra on which @Jeffwan is currently working.
  • Continue maintain automl-ci project, create cluster there to trigger Prow jobs.

/cc @gaocegege @johnugeorge @jlewi

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:13 (11 by maintainers)

github_iconTop GitHub Comments

2reactions
jlewicommented, Oct 26, 2020

@andreyvelich it looks like the cloudbuild service account 593963025935@cloudbuild.gserviceaccount.com for project kubeflow-ci should also have had its permissions removed as part of the lockdown of kubeflow-images-public. I have updated it to expire on 11/02

2reactions
andreyvelichcommented, Sep 21, 2020

@jlewi October 17 as a deadline looks good for us. We verified that GitHub actions can work.

If we don’t switch to AWS test infra until the deadline, we will use GitHub actions. /cc @johnugeorge @gaocegege

Read more comments on GitHub >

github_iconTop Results From Across the Web

CI/CD Pipeline: A Gentle Introduction
A CI/CD pipeline helps you automate steps in your software delivery process. Understand the basics, best practices and how to get started.
Read more >
MLOps: Continuous delivery and automation pipelines in ...
This document discusses techniques for implementing and automating continuous integration (CI), continuous delivery (CD), and continuous ...
Read more >
The CI/CD Pipeline: Why Testing Is Required at Every Stage
This article talk about how to test throughout the CI/CD pipeline, some CI/CD best practices, and we'll conclude by waxing philosophical ...
Read more >
Test Automation in CI/CD: Part 2. By
Both MOGA and Jenkins optimizations were not long-term solutions but temporary band-aids. Overall, the testing pipeline worked, but the goal of ...
Read more >
CI/CD: Complete Guide to Continuous Integration and ...
Everything you need to know about CI/CD - the difference between continuous integration, delivery, and deployment, pipeline stages, tools, and best ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found