question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Observing "no healthy upstream" for new deployments until ambassador pods restarted

See original GitHub issue

Description of the problem I am facing a very strange problem. Our IT wants us to migrate Application testing pipeline to a new cluster. After deploying ambassador with helm (originally it was 1.12.0) I tested the deployments of our applications: all the deployments were successful, however on access to the application I constantly got an error “no healthy upstream” (the same deployment works in the old cluster).

At some point in time I learned about released 1.12.1 and upgraded the ambassador with “helm upgrade” to 1.12.1. After that all the old not working application deployments started to work without any additional changes. But every new deployment had the same issue: the error “no healthy upstream”. Eventually ambassador was upgraded to 1.12.2 with the same effect: not working the old deployments started to work without any changes and every new deployment had an error “no healthy upstream”.

Investigation of connectivity confirmed that the application is accessible with curl from ambassador pod via connection to the app service, as well as to the app in pod directly. However, external requests to the application always ended up with “no healthy upstream”.

Now, if the ambassador pod is killed (replica count was reduced to 1 for simplifying logs analysis) and the deployment/replicaset replaces it with a new pod the issue is resolved - all not working deployments start working (it was tested 3 times).

Details on the current deployment:

$ helm -n ambassador list
NAME            NAMESPACE       REVISION        UPDATED                                 STATUS          CHART                   APP VERSION
ambassador      ambassador      14              2021-03-31 10:20:18.8370383 -0400 EDT   deployed        ambassador-6.6.2        1.12.2

Is it something that I might be missing during the deployment of ambassador?nd concise description of what the bug is.

Expected behavior All the new application deployments start working without a need to restart ambassador pods

Versions:

  • Ambassador: 1.12.2 (1.12.0, 1.12.1)
  • Kubernetes environment: Azure Kubernetes Service (AKS) - privatelink custer (i,e. no access from the public internet and only internal LBs - Annotation “service.beta.kubernetes.io/azure-load-balancer-internal” is set to “true”)
  • Version: v1.18.14

Additional context None. I am not sure if it is a bug or not. I would appreciate any workaround for our environment.

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Reactions:1
  • Comments:15 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
wissam-launchtripcommented, Jun 19, 2021

No actually it’s a different issue. Upstream Services get disconnected for no clear reason! And we get “no healthy upstream” error. This happens after a few hours from last deployment in the cluster. If we make a deployment in the cluster, the error disappears.

0reactions
esmetcommented, May 24, 2021

@wissam-launchtrip can you go into a bit more detail? Are you seeing this exact issue or something similar? Anything that can help us verify the report and reproduce the issue for a possible fix 👍

Read more comments on GitHub >

github_iconTop Results From Across the Web

Ambassador qotm route shows "no healthy upstream"
I am getting issues with a qotm service deployed with ambassdor gateway. What ever I do, I get a statement saying no healthy...
Read more >
Announcing Telepresence 2.2.0, Emissary-Ingress & Edge ...
#3324: Observing “no healthy upstream” for new deployments until ambassador pods restarted. #3331: Regression: Resolver Issue in Mapping CRD ...
Read more >
Istio Ingress resulting in "no healthy upstream" - Stack Overflow
a) involves restarting istio-ingressgateway pods (data plane) to make it work. b) Involves no restart of istio data or istio control plane.
Read more >
Guided Exercise: Configuring Istio Ingress Control
In this exercise, you will configure the ingress resource to access the Kubernetes dashboard. Outcomes. You should be able to: Deploy the Kubernetes ......
Read more >
Ambassador Integration - Kubernetes - API Manual
» Check Ambassador Connect sidecar logs ... Find the name of the Connect Integration pod and make sure it is running. ... Dump...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found