[Quantum-Cloud] Helm Chart for easy deployment of API-Server to "Deploy and Serve" Hybrid Models
See original GitHub issueFeature details
Benefits of Deploying and Serving Hybrid Model using REST API:
- End User would be able to get model results on any device, even inside Android/IOS applications if required.
- Hybrid models will be easy to use and test inside other applications/Models without much hassle.
- An ecosystem for hybrid models.
- Each model’s results could be easily tested for different goals and improved.
Query the Model using REST API
curl -d '{"input-data": [image-1.png, image-2.png, image-3.png, image-4.png]}' \
-X POST http://localhost:8080/v1/models/hybrid-model-torch:predict
OUTPUT: ["bee","ant","bee","deer"]
Other Projects are doing it.
Tensorflow: https://www.tensorflow.org/tfx/serving/docker PyTorch: https://github.com/pytorch/serve/
Why using Helm: https://helm.sh/
- To create packaging of configurations for different flavours of Pennylane(Different Interfaces or Plugins)
- Easy to Deploy on Kubernetes Cluster and Use like Quantum-Cloud.
- Easy to create and manage different Quantum-Nodes and communication between Quantum Nodes if required.
Implementation
By using Flask-Restful to create communication between Client and Hybrid-model running on Cloud. https://flask-restful.readthedocs.io/en/latest/
Creating volume Node for models which needs to be served. https://kubernetes.io/docs/concepts/storage/volumes/
Flask+Kubernetes: https://kubernetes.io/blog/2019/07/23/get-started-with-kubernetes-using-python/
And Finally Helm for one command Deployment and Maintenance.
How important would you say this feature is?
2: Somewhat important. Needed this quarter.
Additional information
AWSlabs: https://github.com/awslabs/multi-model-server
ForesFlow: https://github.com/ForestFlow/ForestFlow
Triton Inference server: https://github.com/triton-inference-server/server
Issue Analytics
- State:
- Created 2 years ago
- Comments:13 (13 by maintainers)
Thanks @arshpreetsingh for the clarification (and the initial request 🙂). Admittedly this is not something we have much experience with internally on the PennyLane team, so we have not thought much about it before. We might have to discuss internally if/how we can support
@arshpreetsingh The community calls are on the Unitary Fund Discord server. You can join here https://discord.com/invite/JqVGmpkP96
There on the left you will community-call within the voice channels. That’s where we meet on Thursdays at 11am ET!