Long prediction using KFServing API 504 Gateway Timeout
See original GitHub issue/kind bug
What steps did you take and what happened: [A clear and concise description of what the bug is.] Ran a custom container, using short/small data that will respond within 10-second is working fine.
However, we ran a custom container with a long input, that will take more than 30 seconds, the curl will get 504 Gateway Timeout. Queue-proxy has 127.0.0.1:8080 connection refused error.
What did you expect to happen: Should wait for a response that takes longer. Should be able to read probe.
Environment:
- Knative Version: 0.11.1
- KFServing Version: 0.2.2
- Kubeflow version: 1.0.0
- Microk8s: 1.18.3
- Kubernetes version: (use
kubectl version
): 1.18.3 - OS (e.g. from
/etc/os-release
): RHEL
Issue Analytics
- State:
- Created 3 years ago
- Comments:13 (5 by maintainers)
Top Results From Across the Web
Long prediction using KFServing API 504 Gateway Timeout
However, we ran a custom container with a long input, that will take more than 30 seconds, the curl will get 504 Gateway...
Read more >How to Fix the 504 Gateway Timeout Error on Your Site - Kinsta
A 504 Gateway Timeout error indicates that the web server is waiting too long to respond from another server and “timing out.” There...
Read more >Troubleshoot API Gateway HTTP 504 timeout errors - AWS
To troubleshoot 504 timeout errors from API Gateway, first identify and verify the source of the error in your Amazon CloudWatch execution logs....
Read more >Flask based application : CloudFlare Gateway Timeout error
This is just one strategy that you can use, but ultimately you don't want the model prediction to be happening in the request...
Read more >Model Monitoring - ART Adversial Detector - 《KServe v0.7 ... - 书栈网
It is using the Adversarial Robustness Toolbox (ART) on KServe. ... <504> Gateway Timeout <504> - the explainer is probably taking too long...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@yuzisun How can we change the default timeout value using V1aplha2 specs? I understand the “timeout” option present in the V1beta1PredictorSpec allows to change the default timeout value . We are currently using the KFServing 0.4.1 release and leveraging the kfserving pipeline component to deploy services.
Issue-Label Bot is automatically applying the labels:
Please mark this comment with 👍 or 👎 to give our bot feedback! Links: app homepage, dashboard and code for this bot.