[Bug] unexpected removing all kafka resources when upgrade using helm3
See original GitHub issueDescribe the bug I’m using strimzi operator v0.19.0 and tried upgrade to 0.20.0. When I’ve ran helm upgrade procedure all my resources (users, topics, clusters) was removed. I try to reproduce problem with fresh installed cluster and situation was reproduced again.
To Reproduce Steps to reproduce the behavior:
1. helm install strimzi-kafka strimzi/strimzi-kafka-operator --namespace kafka --set watchNamespaces="{kafka,test-kafka}" --version=0.19.0
2. create cluster, users and topics from manifests (apiVersion: v1beta1)
3. helm upgrade strimzi-kafka strimzi/strimzi-kafka-operator --namespace kafka --set watchNamespaces="{kafka,test-kafka}"
kubectl get crd| grep kafka| wc -l
0
After the steps above my cluster and users/topics was removed. The operator pod try to start and crashed with the following error:
2020-10-26 14:35:47 WARN WatchConnectionManager:198 - Exec Failure: HTTP 404, Status: 404 - 404 page not found
java.net.ProtocolException: Expected HTTP 101 response but was '404 Not Found'
at okhttp3.internal.ws.RealWebSocket.checkResponse(RealWebSocket.java:229) [com.squareup.okhttp3.okhttp-3.12.6.jar:?]
at okhttp3.internal.ws.RealWebSocket$2.onResponse(RealWebSocket.java:196) [com.squareup.okhttp3.okhttp-3.12.6.jar:?]
at okhttp3.RealCall$AsyncCall.execute(RealCall.java:203) [com.squareup.okhttp3.okhttp-3.12.6.jar:?]
at okhttp3.internal.NamedRunnable.run(NamedRunnable.java:32) [com.squareup.okhttp3.okhttp-3.12.6.jar:?]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?]
at java.lang.Thread.run(Thread.java:834) [?:?]
2020-10-26 14:35:47 WARN WatchConnectionManager:198 - Exec Failure: HTTP 404, Status: 404 - 404 page not found
java.net.ProtocolException: Expected HTTP 101 response but was '404 Not Found'
at okhttp3.internal.ws.RealWebSocket.checkResponse(RealWebSocket.java:229) [com.squareup.okhttp3.okhttp-3.12.6.jar:?]
at okhttp3.internal.ws.RealWebSocket$2.onResponse(RealWebSocket.java:196) [com.squareup.okhttp3.okhttp-3.12.6.jar:?]
at okhttp3.RealCall$AsyncCall.execute(RealCall.java:203) [com.squareup.okhttp3.okhttp-3.12.6.jar:?]
at okhttp3.internal.NamedRunnable.run(NamedRunnable.java:32) [com.squareup.okhttp3.okhttp-3.12.6.jar:?]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?]
at java.lang.Thread.run(Thread.java:834) [?:?]
Expected behavior The operator should be updated without removing resources.
Environment (please complete the following information):
- Strimzi version: 0.19.0
- Installation method: Helm chart
- Kubernetes cluster: v.1.18.8
- Infrastructure: Rancher2 on Amazon EC2 instances
Issue Analytics
- State:
- Created 3 years ago
- Reactions:4
- Comments:33 (10 by maintainers)
Top Results From Across the Web
Breaking Changes in Helm 3 (and How to Fix Them) - ITNEXT
This is nice if your YAML files are all perfectly aligned with the Kubernetes API docs; if not you'll have to remove any...
Read more >Change log for 4.9.52
Kubernetes upgraded from 1.21.1 to 1.22.8; Red Hat Enterprise Linux ... Bug 2009024: Delete all the ports from tagged Neutron networks.
Read more >Helm 3, the Good, the Bad and the Ugly - Banzai Cloud
The recurring problem here is that Helm is a clever templating tool for YAML files and doesn't care how Kubernetes works. Another great...
Read more >Troubleshoot Bitnami Helm chart issues
The following are the most common issues that Bitnami users face when dealing with Bitnami charts: Credential errors while upgrading chart ...
Read more >Kafka - Bitnami - Artifact Hub
Apache Kafka is a distributed streaming platform designed to build real-time ... The command removes all the Kubernetes components associated with the chart ......
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
A quick workaround we found with our team:
There is also the option to edit the data in the helm secret instead of deleting it
The remove the CRD data inside the
templates
andmanifest
sections and upload the secret againThen the upgrade to 0.20.0 will leave the CRDs alone…