Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

InferenceService Status restructuring

See original GitHub issue

Currently, ModelMesh has its own set of status fields that it populates in the InferenceService status since we allow unknown fields there. Similarly, KServe has its own status fields that are predominantly based on knative fields.

The goal here is to converge on a common set of status fields that would work with the various different deployment modes.

As an overview, here are the current status fields:

InferenceServiceStatus

InferenceServiceStatus
  ├──── duckv1.Status `json:",inline"`
  │        │    // The 'Generation' of the Service that was last processed by the controller.
  │        ├─── ObservedGeneration int64 `json:"observedGeneration,omitempty"`
  │        │
  │        │    // Conditions the latest available observations of a resource's current state.
  │        |    // Condition defines a readiness condition for a Knative resource.
  │        ├─── Conditions Conditions `json:"conditions,omitempty" patchStrategy:"merge" patchMergeKey:"type"`
  │        | 
  │        |     // Additional Status fields for the Resource to save some additional State as well as convey more information to the user.
  │        ├─── Annotations map[string]string `json:"annotations,omitempty"`
  │
  ├──── Address *duckv1.Addressable `json:"address,omitempty"`
  ├──── URL *apis.URL `json:"url,omitempty"`
  ├──── Components map[ComponentType]ComponentStatusSpec `json:"components,omitempty"`

ComponentTypes are predictor, explainer, transformer. Which map to a ComponentStatusSpec.

An example of an actual status can be found here: https://pastebin.com/wkCrZyxk

ModelMesh Predictor Status

PredictorStatus
  ├──── Available bool `json:"available"`
  │
  │     // One of 'UpToDate', 'InProgress', 'BlockedByFailedLoad', or 'InvalidSpec'
  ├──── TransitionStatus TransitionStatus `json:"transitionStatus"
  │
  │     // High level state string: Pending, Standby, Loading, Loaded, FailedToLoad
  ├──── ActiveModelState ModelState `json:"activeModelState"
  ├──── TargetModelState ModelState `json:"targetModelState"`
  │
  │     // Details of last failure, when load of target model is failed or blocked
  ├──── LastFailureInfo *FailureInfo `json:"lastFailureInfo,omitempty"`
  │
  │     // Addressable endpoint for the deployed trained model. This will be "static" and will not change when the model is mutated
  ├──── HTTPEndpoint string `json:"httpEndpoint"`
  ├──── GrpcEndpoint string `json:"grpcEndpoint"`
  │
  │     // How many copies of this predictor's models failed to load recently
  ├──── FailedCopies int `json:"failedCopies"`

An example of an actual predictor status can be found here: https://pastebin.com/wBM3WgFW

Action item: Decide how we can restructure the fields in a way that make sense for all the deployment modes. Do we still rely on the Knative types?

Issue Analytics

State:
Created 2 years ago
Comments:8 (8 by maintainers)

Top GitHub Comments

1reaction

chinhuang007commented, Feb 17, 2022

For both KServe and ModelMesh to update and understand the status/state of an inference service, I would suggest to continue to use the Knative types. That probably means that InferenceServiceStatus needs to cover ModelMesh status requirements, and ModelMesh needs to include Knative packages and change code accordingly.

1reaction

pvaneckcommented, Feb 17, 2022

FYI @yuzisun , @njhill , @chinhuang007

Top Results From Across the Web

Inferenceservice status is 'Unknown' with 'IngressNotConfigured'

My kubeflow and kfserving are installed in k8s, I tried the solution of the issue https://github.com/kubeflow/kfserving/issues/668, but my ...

How to make an ML model inference on KFServing ... - Medium

You will restructure the rpm model that you have downloaded earlier. You will also remove the unnecessary files.

Run your first InferenceService - Kubeflow

Run your first InferenceService. A tutorial on building and deploying a model using the KServe Python SDK. Run in Google Colab View source...

Release Notes - Seldon Deploy Documentation

Adding fluentd labels to inferenceservice deployments for logs ... Restructuring application paths and in-app navigation.

Inference Autoscaling - KServe Documentation Website

Apply the autoscale.yaml to create the Autoscale InferenceService. kubectl ... 0.0000 secs, 0.0021 secs Status code distribution: [200] 4126 responses.